37 views

Uploaded by om98474831879339

- Final Report
- Data ScienceNovember 2017
- A Review of Current Marine Demultiple Techniques
- Artificial Neural Networks in Manufacturing Concepts Applications and Perspectives-hb3
- Uncertain Evaluation for an Ultrasonic data fusion based target differentiation problem using Generalized Aggregated Uncertanty measure 2
- SELF LEARNING FOR AUTONOMOUS SYSTEMS
- optimization
- Singh 2000
- Binder1 55.pdf
- 1009.4983
- VLSI Realization of Switched Hamming Neural Network for 3-Bit Digit Recognition
- Sonar-Based Mobile Robot Navigation Through Supervised Learning on a Neural Net
- Thesis 2
- CS2351 AI UNIT5 Learning
- Optimization of Citric Acid Production Using Ann
- Introduction to Deep Learning Using R
- NeuralNetworks-1
- 2012-01-27 Fast Pixel Pattern Recognition
- 6 Frame Optimization using Neural Network .pdf
- Practical Convolutional Neural Networks_ Implement advanced deep learning models using Python.pdf

You are on page 1of 3

MICHAEL I. JORDAN

Massachusetts Institute of Technology

CHRISTOPHER M. BISHOP

Aston University

Neural networks have emerged as a field of study within AI and engineering via the collaborative efforts of engineers, physicists, mathematicians, computer scientists, and neuroscientists. Although the strands of research are many, there is a basic underlying focus on pattern recognition and pattern generation, embedded within an overall focus on network architectures. Many neural network methods can be viewed as generalizations of classical pattern-oriented techniques in statistics and the engineering areas of signal processing, system identification, optimization, and control theory. There are also ties to parallel processing, VLSI design, and numerical analysis. A neural network is first and foremost a graph, with patterns represented in terms of numerical values attached to the nodes of the graph and transformations between patterns achieved via simple message-passing algorithms. Certain of the nodes in the graph are generally distinguished as being input nodes or output nodes, and the graph as a whole can be viewed as a representation of a multivariate function linking inputs to outputs. Numerical values (weights) are attached to the links of the graph, parameterizing the input/output function and allowing it to be adjusted via a learning algorithm. A broader view of a neural network architecture involves treating the network as a statistical processor, characterized by making particular probabilistic assumptions about data. Patterns appearing on the input nodes or the

Copyright 1996, CRC Press.

output nodes of a network are viewed as samples from probability densities, and a network is viewed as a probabilistic model that assigns probabilities to patterns. The problem of learning the weights of a network is thereby reduced to a problem in statisticsthat of finding weight values that look probable in the light of observed data. The links to statistics have proved important in practical applications of neural networks. Real-world problems are often characterized by complexities such as missing data, mixtures of qualitative and quantitative variables, regimes of qualitatively different functional relationships, and highly nonuniform noise. Neural networks are complex statistical models with the flexibility to address many of these complexities, but as with any flexible statistical model one must take care that the complexity of the network is adjusted appropriately to the problem at hand. A network that is too complex is not only hard to interpret but, by virtue of overfitting the random components of the data, can perform worse on future data than a simpler model. This issue is addressed via statistical techniques such as cross-validation, regularization, and averaging, as well as the use of an increasingly large arsenal of Bayesian methods. Other practical statistical issues that arise include the assignment of degrees of confidence to network outputs (error bars), the active choice of data points (active learning), and the choice between different network struc-

74

Michael I. Jordan and Christopher M. Bishop which are associated with problems in pattern recognition and control theory. Here we give a small selection of examples, focusing on applications in routine use. The problem of recognizing handwritten characters is a challenging one, that has been widely studied as a prototypical example of pattern recognition. Some of the most successful approaches to this problem are based on neural network techniques and have resulted in several commercial applications. Mass screening of medical images is another area in which neural networks have been widely explored, where they form the basis for one of the leading systems for semi-automatic interpretation of cervical smears. As a third example of pattern recognition we mention the problem of verifying handwritten signatures, based on the dynamics of the signature captured during the signing process, where the leading approach to this problem is again based on neural networks. An example of a control application based on neural networks involves the real-time adjustment of the plasma boundary shape in a tokamak fusion experiment, which requires several interdependent parameters to be controlled on time scales of a few tens of microseconds. Neural networks have also been applied to the real-time control of the mirror segments in adaptiveoptics telescopes, used to cancel distortions due to atmospheric turbulence. The prospects for neural networks seem excellent, given the increasing sophistication of the underlying theory, the increasing range of applicability of the techniques, and the growing scale of the applications that are being undertaken. The interdisciplinary nature of research in the field seems certain to persist and to bring new vigor into allied fields. Finally, future progress in theoretical neuroscience will provide a continuing impetus for the development and understanding of network models of intelligence.

tures (model selection). Progress has been made on all these issues by applying and developing statistical ideas. The statistical approach also helps in understanding the capabilities and limitations of network models and in extending their range. Neural networks can be viewed as members of the class of statistical models known as nonparametric, and the general theory of nonparametric statistics is available to analyze network behavior. It is also of interest to note that many neural network architectures have close cousins in the nonparametric statistics literature; for example, the popular multilayer perceptron network is closely related to a statistical model known as projection pursuit, and the equally popular radial basis function network has close ties to kernel regression and kernel density estimation. A more thoroughgoing statistical approach, with close ties to semiparametric statistical modeling, is also available in which not only the input and output nodes of a network but also the intermediate (hidden) nodes are given probabilistic interpretations. The general notion of a mixture model, or more generally a latent variable model, has proved useful in this regard; the hidden units of a network are viewed as unobserved variables that have a parameterized probabilistic relationship with the observed variables (i.e., the inputs, the outputs, or both). This perspective has clarified the links between neural networks and a variety of graphical probabilistic approaches in other fields; in particular, close links have been forged with hidden Markov models, decision trees, factor analysis models, Markov random fields, and Bayesian belief networks. These links have helped to provide new algorithms for updating the values of nodes and the values of the weights in a network; in particular, EM algorithms and stochastic sampling methods such as Gibbs sampling have been used with success. Neural networks have found a wide range of applications, the majority of

ACM Computing Surveys, Vol. 28, No. 1, March 1996

Neural Networks

REFERENCES

BISHOP, C. M. 1995. Neural Networks for Pattern Recognition. Oxford University Press, New York.

75

HAYKIN, S. 1994. Neural Networks: A Comprehensive Foundation. Macmillan, New York. HERTZ, J., KROGH, A., AND PALMER, R. G. 1991. Introduction to the Theory of Neural Computation. Addison-Wesley, Reading, MA.

- Final ReportUploaded byBibinMathew
- Data ScienceNovember 2017Uploaded byChris Morris
- A Review of Current Marine Demultiple TechniquesUploaded bymhouskal
- Artificial Neural Networks in Manufacturing Concepts Applications and Perspectives-hb3Uploaded bytfsm22
- Uncertain Evaluation for an Ultrasonic data fusion based target differentiation problem using Generalized Aggregated Uncertanty measure 2Uploaded byMia Amalia
- SELF LEARNING FOR AUTONOMOUS SYSTEMSUploaded byPraveen Kumar
- optimizationUploaded byDipika Deswal
- Singh 2000Uploaded byVitthal Patnecha
- Binder1 55.pdfUploaded byAbdul Rahman
- 1009.4983Uploaded byThái Sơn
- VLSI Realization of Switched Hamming Neural Network for 3-Bit Digit RecognitionUploaded byEditor IJRITCC
- Sonar-Based Mobile Robot Navigation Through Supervised Learning on a Neural NetUploaded byjejudo
- Thesis 2Uploaded byMadasamy Rathinam R
- CS2351 AI UNIT5 LearningUploaded bykalia_prml
- Optimization of Citric Acid Production Using AnnUploaded byHana Hamid
- Introduction to Deep Learning Using RUploaded byJorgeGazal
- NeuralNetworks-1Uploaded byIrwansyah Ab
- 2012-01-27 Fast Pixel Pattern RecognitionUploaded bysatoen
- 6 Frame Optimization using Neural Network .pdfUploaded bykklisa
- Practical Convolutional Neural Networks_ Implement advanced deep learning models using Python.pdfUploaded byLucas Alves
- Recent Deep LearningUploaded byehsan_civil_62
- Embedded Neural Network.docxUploaded byMohan Raj
- Uncertainty Measurement for Ultrasonic Sensor Fusion Using Generalized Aggregated Uncertainty Measure 1Uploaded byMia Amalia
- UT Dallas Syllabus for econ7309.001.10f taught by Donggyu Sul (dxs093000)Uploaded byUT Dallas Provost's Technology Group
- text7-1_5-16Uploaded bynick4name
- NngUploaded byapi-26172869
- 350-904-1-PB.pdfUploaded byMiguel González Seguerí
- 5000001683-5000002589-1-PB.pdfUploaded byZaya
- Maxchargeperdelay.pdfUploaded byVidya Sagar
- jert_140_07_072905Uploaded bytsheikh6361

- cat-04-2019Uploaded byom98474831879339
- calculus_11_Sequences_and_Series.pdfUploaded byMoonbob Gigi
- MathematicsUploaded byShivantha Hettiarachchi
- RoseUploaded byom98474831879339
- green.pdfUploaded byom98474831879339
- dip0419-1.pdfUploaded byDilshad CM
- Example Questions NumericalUploaded byMoonblades
- Odisha Creativity.pdfUploaded byAshok Kumar Kallo
- Ashta-lakshmi-stotram Malayalam PDF File8748Uploaded byom98474831879339
- Data Sheet IC LM 723Uploaded byWahyu Sulistyo Nugroho
- 1003.pdfUploaded byVaisakVenugopal
- Greatest Common DivisorUploaded byom98474831879339
- Heat transferUploaded byfarshidian
- Diploma Time TableUploaded byom98474831879339
- Instrument Landing System1Uploaded byom98474831879339
- GoddUploaded byom98474831879339
- AmmaUploaded byom98474831879339
- Let 20017Uploaded byom98474831879339
- DsoUploaded byom98474831879339
- ayodyaUploaded byom98474831879339
- keralam thekkadyUploaded byom98474831879339
- House Boating AllappyUploaded byom98474831879339
- keralam1Uploaded byom98474831879339
- MalaUploaded byom98474831879339
- GraphUploaded byom98474831879339
- EcgUploaded byom98474831879339
- Ettumanoor TempleUploaded byom98474831879339
- amma.pdfUploaded byom98474831879339
- GreenUploaded byom98474831879339
- Let 2017 CrashUploaded byom98474831879339

- Venus StepbyStepGuide 4 Light CuringUploaded byPaula Andrea Sanchez Parilo
- human resource management SystemUploaded bykhanrayan82
- _DLC Boot 2018 Final VersionUploaded bydidier
- 2011FRM金程模拟考题Uploaded byLiu Zhilin
- Dna Computing and Security(2)Uploaded byVinay Kumar Rajput
- Breaking Out 3Uploaded byKuru Kshetran
- Quantitative Reasoning Final Exam (QTN-561)Uploaded bybrandonlmurphy
- Bimetal Desingers GuideUploaded byAnjana2893
- Kinatic ChainsUploaded byPushkar Nath
- Computer FundamentalsnotesUploaded byManohar Banoth
- Belief, Truth and Knowledge by D. M. ArmstrongUploaded byDon Arthur
- Display More Than One Internal Table in ALV Using Object Oriented ABAP ProgrammingUploaded bymdwaris
- Impacto Del EjercicioUploaded byangelitau
- Ece Matlab Major Project ListUploaded byEDUGENE TECHNOLOGIES
- FHWA-ICT-12-002Uploaded byUsman Rafiq
- A Programmed Review of Engineering FundamentalsUploaded byOzgur Tung
- Gr7_SBUploaded byJamilah Salleh
- JUE-75CsecurityUploaded byVincent Sarpakis
- Offset Dependent Mis-Tie Analysis at Seismic Line IntersectionUploaded bysialmuhammad456778
- taskingC196manualUploaded byusmancheema81
- 3_4 CSE-2 AY-2017-18(1).pdfUploaded byShantha Kumar
- 10thDeceUploaded byOmaru Nimaga
- WAITING LINES AND CUSTOMER SATISFACTIONUploaded byM S Sridhar
- artigo Poços inteligentesUploaded byipixuna
- alebraheemAMS17-20-2012Uploaded byaldo
- Simplified Linear MMSE algorithm for OFDM sytemsUploaded bysriblue7
- motorbustransfertutorialrev20080715-160401012509Uploaded bycyyguy3k
- A Multivariate Statistical Modeling of Geochemical Factors of Soils, Sediments and Ground Water M.tech Thesis_Sabyasachi RoutUploaded bySabyasachi Rout
- Prop Demand JW HerbertUploaded byspurupuntu
- Bear- jfebear-tp-l-001_blanking_dimensions_rev14.pdfUploaded byRyan Llaneta