Professional Documents
Culture Documents
Syllabus
Mod-I : Pattern Recognition Overview
1
2/21/2017
Books
Text Books:
1. Robort Schalkoff - Pattern Recognition, Statistical, Structural and
Neural Approach, John Wiley Indian Edition.
Reference Books :
1. R. U. Duda Pattern Classification, John Wiley, Indian Edition, 2006.
3. E. Gose, R.J., & S.J. Pattern Recognition and Image Analysis, PHI
Learning Pvt Ltd.
2
2/21/2017
To analyze the working of heart, doctor can recognize the ECG pattern
made by Heart beat and detect any misfire in pattern produced by
heart pumping.
3
2/21/2017
4
2/21/2017
Shape Discrimination
5
2/21/2017
Texture Discrimination
6
2/21/2017
make sound and reasonable decisions about the categories of the patterns.
7
2/21/2017
8
2/21/2017
Given some examples of complex signals and the correct decisions for
them, make decisions automatically for a stream of future examples
Ripley
Before 1960 PR was mostly the output of theoretical research in the area
of statistics.
9
2/21/2017
10
2/21/2017
Application Areas of PR
Machine Vision System (Inspector): Image Analysis
Character Recognition System (OCR) : Image Analysis
Computer aided diagnosis
Speech and Audio recognition (NLP)
Data mining and Knowledge discovery (Information Retrieval)
Biometrics: Faces, Iris, fingerprints, handwriting etc
Bioinformatics : DNA
Seismic analysis: Volcano eruption, earth quake etc
Radar signal classification and analysis
Medical domain: ECG, medical diagnosis
Remote sensing: Weather forecasting, Estimation of Glacier
melting etc
In general, extracting the hidden pattern and trend.
Application Areas of PR
11
2/21/2017
What is a Pattern?
Pattern is an abstract entity.
Examples of patterns
12
2/21/2017
What is a Feature?
13
2/21/2017
Objects are represented as points in feature space; the result is a scatter plot
Feature vectors ... used in StatPR, NeurPR
Classification
14
2/21/2017
Feature Extraction
Feature as measurements extracted from data may require
significant computational effort (e.g., extracting shape
properties of 3D objects)
Extracted features may be noisy ... may have errors
The quality of a feature vector is related to its ability to
discriminate examples from different classes
Examples from the same class should have similar feature values,
while from different classes save different feature values
Feature Selection
selection of features from the set of available
Features based on.
computationally feasible
good discriminative power
good descriptive power
15
2/21/2017
Pattern Distortion
Measurements may be noisy ... color varies with
lighting, shape varies with viewing angle, etc.
16
2/21/2017
17
2/21/2017
Some definitions
Recognition: It is the ability to classify. In PR problems, dont
know class is dummy c+1st class
Classification: It assigns input data to one or more of c pre-
specified classes based on extraction of significant features or
attributes and the analysis of these attributes
Description It is alternative to classification where structural
description of the input pattern is desired.
A pattern Class: it is a set of patterns known to originate from
the same source in C.
Noise: It results from non-ideal circumstances
Distortion in the input pattern(measurement errors)
Error in preprocessing
Feature extraction errors
Training data errors
18
2/21/2017
Decision Rule:
gm(X) > gi(X) where i = 1, 2, 3, c
and i =/= m
Discriminant functions
g1(X) = || X-X1||
g2(X) = || X- X2 ||
Corresponding partition
of R2
Decision Rule:
X belongs to class R1 if g1(X) < g2(X)
X on the decision boundary if g1(X) == g2(X)
X belongs to class R2 Otherwise
19
2/21/2017
Pre-processing: (Segmentation)
Removal of noise in data
Isolation of patterns of interest from the background
Classification:
Using features and learned models to assign a pattern to a
category
Post-processing:
Evaluation of confidence in decisions
Exploitation of context to improve performance
Combination of experts
20
2/21/2017
test Feature
Preprocessing Classification
pattern Measurement
training Feature
pattern Preprocessing Extraction/ Learning
Selection
Training Mode
21
2/21/2017
Fish species
22
2/21/2017
Clear that the populations of salmon and sea bass are indeed distinct.
The space of all sh is quite large. Each dimension is dened by some property of
the sh, most of which we cannot even measure with the camera.
23
2/21/2017
24
2/21/2017
Machine Learning
Programming computers to use example data or
past experience.
25
2/21/2017
Preprocessing
For lighting conditions, position of fish on the conveyor belt, camera noise,
etc, just to segmentation to separate fish from background
-So, we can use length as a feature and sort the sea bass and salmon
according to a threshold on length.
-So, from the segmented image we estimate the length of the fishes
26
2/21/2017
Assume we also observed that sea bass are typically longer than
salmon.
27
2/21/2017
Decision rule:
Classify the fish as a sea bass
if its feature vector falls above
the decision boundary shown,
and as salmon otherwise
28
2/21/2017
29
2/21/2017
30
2/21/2017
Generalization
A good classifier should be able to generalize, i.e. perform well on unseen
data
As consequence:
We are better off with a slightly poorer performance on the training
examples, if this means that our classifier will have better performance
on novel patterns.
Generalization
The decision boundary shown may represent the optimal tradeoff between
accuracy on the training set and on new patterns
31
2/21/2017
32
2/21/2017
Performance evaluation:
Classification Error rate (Pe): The percentage of
misclassified test samples is taken as an estimate
of the error rate. i.e. new patterns that are
assigned to the wrong class.
Bayesian Decision
Theory: considers the
ideal case in which the
probability structure
underlying the classes is
known perfectly. This is
rarely true in practice,
but it allows to
determine the optimal
classifier, against which
we can compare all
classifiers.
33
2/21/2017
34
2/21/2017
Approaches of PR
Statistical
-Patterns classified based on an underlying statistical model of the features
-The statistical model is defined by a family of class-conditional probability
density functions (x/ ) (Probability of feature vector given class )
Neural
-Classification is based on the response of a network of processing units (neurons) to an
input stimuli (pattern)
-Knowledge is stored in the connectivity and strength of the synaptic weights
-Trainable, non-algorithmic, black-box strategy
-Very attractive since it requires minimum a priori knowledge
-With enough layers and neurons, ANNs can create any complex decision region
Syntactic
-Patterns classified based on measures of structural similarity
-Knowledge is represented by means of formal grammars or relational descriptions
(graphs)
-Used not only for classification, but also for description
-Typically, syntactic approaches formulate hierarchical descriptions of complex patterns
built up from simpler sub patterns
35
2/21/2017
Comparisons of approaches
36
2/21/2017
Thanks
37