Unit 04 Copy
Unit 04 Copy
2
What is a pattern?
Watanable [163] defines a pattern as
“ the opposite of a chaos; it is an entity, vaguely defined, that
could be given a name”
3
Other Patterns
Insurance, credit card applications - applicants are characterized by
- # of accidents, make of car, year of model
- Income, # of dependents, credit worthiness, mortgage amount
Dating services
- Age, hobbies, income, etc. establish your “desirability”
Web documents
- Key words based descriptions (e.g., documents containing
“terrorism”, “Osama” are different from those containing “football”,
“NFL”).
Housing market
- Location, size, year, school district, etc.
4
Pattern Class
A collection of “similar” (not necessarily identical) objects
- Inter-class variability
5
Pattern Class Model
Gaussian)
6
Classification vs Clustering
- Classification (known categories)
- Clustering (creation of new categories)
Category “A”
Clustering
Category “B” (Unsupervised
Classification (Recognition) Classification)
(Supervised Classification)
7
7
Pattern Recognition
Key Objectives:
8
Emerging PR Applications
Problem Input Output
Speech recognition Speech waveforms Spoken words, speaker
identity
Non--destructive testing
Non Ultrasound, eddy current, Presence/absence of flaw,
acoustic emission waveforms type of flaw
Detection and diagnosis of EKG, EEG waveforms Types of cardiac conditions,
disease classes of brain conditions
9
Emerging PR Applications (cont’d)
Problem Input Output
Web search Key words specified by a user Text relevant to the user
Online handwriting retrieval Query word written by a user Occurrence of the word in
the database
10
10
Main PR Areas
Template matching
- The pattern to be recognized is matched against a stored
template while taking into account all allowable pose (translation
and rotation) and scale changes.
Statistical pattern recognition
- Focuses on the statistical properties of the patterns (i.e.,
probability densities).
Structural Pattern Recognition
- Describe complicated objects in terms of simple primitives and
structural relationships.
Syntactic pattern recognition
- Decisions consist of logical rules or grammars.
Artificial Neural Networks
- Inspired by biological neural network models. 11
11
Statistical Pattern Recognition
Pattern Feature
Preprocessing Classification
extraction
Recognition
Training
Feature
Preprocessing Learning
Patterns selection
+
Class labels
12
12
Structural Pattern Recognition
Describe complicated objects in terms of simple primitives and
structural relationships.
Decision-making when features are non-numeric or structural
Scene
Object Background
N
L
X D E M N
T
M
Z
T X Y Z
D E 13
Training
Grammatical,
Primitive
Preprocessing structural
selection
Patterns inference
+
Class labels
14
Chromosome Grammars
15
16
16
Artificial Neural Nodes
Nodes in neural networks are nonlinear, typically
analog
x1 w1
x2
Y (output)
xd wd
17
17
Multilayer Perceptron
Feed-forward nets with one or more layers (hidden) between the input and
output nodes
A three-layer net can generate arbitrary complex decision regions
. .
. . . .
. . .
c outputs
d inputs First hidden layer Second hidden layer
NH1 input units NH2 input units
These nets can be trained by the back-propagation training algorithm
18
18
Comparing Pattern Recognition Models
Template Matching
Structural / Syntactic
Statistical
19
Speech Recognition
20
They can also serve as the input to further linguistic processing in order to
achieve speech understanding
21
Signal processing:
- Convert the audio wave into a sequence of feature vectors
Speech recognition:
- Decode the sequence of feature vectors into a sequence of words
Semantic interpretation:
- Determine the meaning of the recognized words
Dialog Management:
- Correct errors and help get the task done
Response Generation
- What words to use to maximize user understanding
Speech synthesis (Text to Speech):
- Generate synthetic speech from a ‘marked-up’ word string
22
Transcription
Information access
Problem solving
24
Sphinx system
- speaker-independent
- continuous speech
- large vocabulary
ATIS system
- context management
25
Reduce fraud
26
Speech Recognition
Figure out what a person is saying.
Speaker Verification
Authenticate that a person is who she/he claims to be.
Limited speech patterns
Speaker Identification
Assigns an identity to the voice of an unknown person.
Arbitrary speech patterns
27
28
29
30
32
33
34
2000 NetBytel launched the world's first voice enabler, which includes
an on-line ordering application with real-time Internet integration for
Office Depot.
35
36
38
39
When recognising speech, the sounds uttered are allocated to the closest
corresponding output, and the character for that output is printed.
42
o a a h r æ l ø y y j i
o o a h r r r g g y j i
o o m a r m n m n j i i
l o u h v vm n n h hj j j
l u v v p d d t r h hi j
. . u v tk k p p p r k s
. . v k pt t p t p h s s
43
44
For simplicity, you can limit your task to the recognition of digits from 0
to 9. Each digit is represented by a 5 ´ 9 bit map
45
1 2 3 4 5
6 7 8 9 10
11 12 13 14 15
16 17 18 19 20
21 22 23 24 25
26 27 28 29 30
31 32 33 34 35
36 37 38 39 40
41 42 43 44 45
46
The output layer has 10 neurons – one neuron for each digit to be
recognized.
47
48
0 1 1 0
1 2 2 1
1 3
3 0
1
1 4
4 0
0 5 2
5 0
3
6 0
1 41 4
7 0
1 42 5
1 43
8 0
1 44 9 0
1 45 10 0
49