100% found this document useful (1 vote)

60 views

Introduction

This document provides an introduction to pattern recognition. It defines key concepts such as patterns, pattern classes, and pattern recognition. Pattern recognition involves assigning unknown patterns to known categories or classes. Common applications of pattern recognition include handwriting recognition, license plate recognition, biometrics, and medical imaging. Effective pattern recognition systems involve preprocessing data, extracting discriminative features, training classifiers on labeled data, and testing classifiers on new unlabeled data while avoiding overfitting. Choosing appropriate models and features is important for achieving good generalization to new patterns.

Uploaded by

Ebrahim Daneshifar

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

60 views

Introduction

Uploaded by

Ebrahim Daneshifar

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 49

Introduction to Pattern

Recognition
Chapter 1 (Duda et al.)

1
Text

2
What is a Pattern?

• A pattern could be an object or event.

biometric patterns hand gesture patterns

3
What is Pattern Recognition?

• Assign an unknown pattern to one of several known categories (or

classes).

4
What is a Pattern? (con’t)
• Loan/Credit card applications
• Income, # of dependents, mortgage amount  credit worthiness
classification.

• Dating services
• Age, hobbies, income “desirability” classification

• Web documents
• Key-word based descriptions (e.g., documents containing “football”,
“NFL”)  document classification.

5
Pattern Class
• A collection of “similar” objects.

Female Male

6
How do we model a Pattern Class?
• Typically, using a statistical model.
• probability density function (e.g., Gaussian)

male
Gender Classification female

7
How do we model a Pattern Class?
(cont’d)

• Key challenges:

• Intra-class variability

The letter “T” in different typefaces

• Inter-class variability

Letters/Numbers that look similar

8
Pattern Recognition:
Main Objectives
• Hypothesize the models that describe each pattern
class (e.g., recover the process that generated the
patterns).

• Given a novel pattern, choose the best-fitting

model for it and then assign it to the pattern class
associated with the model.

9
Classification vs Clustering
• Classification (known categories)
• Clustering (unknown categories)

Category “A”

Category “B”

Clustering
Classification (Recognition) (Unsupervised Classification)
(Supervised Classification)
10
Pattern Recognition
Applications

11
Handwriting Recognition

12
Handwriting Recognition (cont’d)

13
License Plate Recognition

14
Biometric Recognition

15
Fingerprint Classification

16
Face Detection

17
Gender Classification

Balanced classes (i.e., male vs female)

18
Autonomous Systems

19
Medical Applications

Skin Cancer Detection Breast Cancer Detection

20
Land Classification
(from aerial or satellite images)

21
“Hot” Applications
• Recommendation systems
• Amazon, Netflix

• Targeted advertising

22
The Netflix Prize

• Predict how much someone is going to enjoy a

movie based on their movie preferences.
• $1M awarded in Sept. 2009

• Can software recommend movies to customers?

• Not Rambo to Woody Allen fans
• Not Saw VI if you’ve seen all previous Saw movies

23
Main Classification Approaches
x: input vector (pattern)

y: class label (class)

•Generative
– Model the joint probability, p(x, y)
– Make predictions by using Bayes rules to calculate p(ylx)
– Pick the most likely label y

•Discriminative
– Estimate p(ylx) directly (e.g., learn a direct map from inputs x to the
class labels y)
– Pick the most likely label y

24
Complexity of PR – An Example

Problem: Sorting
incoming fish on a
conveyor belt.

Assumption: Two
kind of fish:
(1) sea bass
(2) salmon

26
Pre-processing Step

Example

(1) Image enhancement

(2) Separate touching

or occluding fish

(3) Find the boundary of

each fish

27
Feature Extraction
• Assume a fisherman told us that a sea bass is
generally longer than a salmon.

• We can use length as a feature and decide

between sea bass and salmon according to a
threshold on length.

• How should we choose the threshold?

28
“Length” Histograms

threshold l*

• Even though sea bass is longer than salmon on

the average, there are many examples of fish
where this observation does not hold.

29
“Average Lightness” Histograms
• Consider a different feature such as “average
lightness”

threshold x*
• It seems easier to choose the threshold x* but we
still cannot make a perfect decision.
30
Multiple Features
• To improve recognition accuracy, we might have to
use more than one features at a time.
• Single features might not yield the best performance.
• Using combinations of features might yield better
performance.

 x1  x1 : lightness
 x  x : width
 2 2
• How many features should we choose?

31
Classification
• Partition the feature space into two regions by
finding the decision boundary that minimizes the
error.

• How should we find the optimal decision boundary?

32
PR System – Two Phases
Test Phase Training Phase

33
Sensors & Preprocessing
• Sensing:
• Use a sensor (camera or microphone) for data capture.
• PR depends on bandwidth, resolution, sensitivity, distortion of the sensor.

• Pre-processing:
• Removal of noise in data.
• Segmentation (i.e., isolation of patterns of interest from background).

34
Training/Test data
• How do we know that we have collected an adequately large and
representative set of examples for training/testing the system?

Training Set Test Set ?

35
Feature Extraction
• How to choose a good set of features?
• Discriminative features

• Invariant features (e.g., translation, rotation and scale)

• Are there ways to automatically learn which features are best ?

36
How Many Features?
• Does adding more features always improve performance?
• It might be difficult and computationally expensive to extract certain
features.
• Correlated features might not improve performance.
• “Curse” of dimensionality.

37
Curse of Dimensionality
• Adding too many features can, paradoxically, lead to a
worsening of performance.
• Divide each of the input features into a number of intervals, so that
the value of a feature can be specified approximately by saying in
which interval it lies.

• If each input feature is divided into M divisions, then the total

number of cells is Md (d: # of features).
• Since each cell must contain at least one point, the number of
training data grows exponentially with d.

38
Missing Features
• Certain features might be missing (e.g., due to occlusion).
• How should we train the classifier with missing features ?
• How should the classifier make the best decision with missing
features ?

39
Complexity
• We can get perfect classification performance on the
training data by choosing complex models.
• Complex models are tuned to the particular training
samples, rather than on the characteristics of the true
model.

overfitting

How well can the model generalize to unknown samples?

40
Generalization
• Generalization is defined as the ability of a classifier to
produce correct results on novel patterns.
• How can we improve generalization performance ?
• More training examples (i.e., better model estimates).
• Simpler models usually yield better performance.

complex model simpler model

41
More on model complexity
• Consider the following 10 sample points (blue circles)
assuming some noise.
• Green curve is the true function that generated the
data.

• Approximate the true function from the sample points.

42
More on model complexity
(cont’d)
Polynomial curve fitting: polynomials having various
orders, shown as red curves, fitted to the set of 10 sample
points.

43
More on complexity (cont’d)
Polynomial curve fitting: 9’th order polynomials fitted to
15 and 100 sample points.

44
Ensembles of Classifiers
• Performance can be improved using a
"pool" of classifiers.

• How should we build and combine

different classifiers ?

45
PR System (cont’d)
• Post-processing:
• Exploit context to improve performance.

How m ch info mation are

y u mi sing?

46
Cost of miss-classifications

• Consider the fish classification example; there

are two possible classification errors:

(1) Deciding the fish was a sea bass when it was a

salmon.
(2) Deciding the fish was a salmon when it was a sea
bass.

• Are both errors equally important ?

47
Cost of miss-classifications (cont’d)

• Suppose the fish packing company knows that:

• Customers who buy salmon will object vigorously if
they see sea bass in their cans.
• Customers who buy sea bass will not be unhappy if
they occasionally see some expensive salmon in their
cans.

• How does this knowledge affect our decision?

48
Computational Complexity
• How does an algorithm scale with the number of:
• features
• patterns
• categories
• Consider tradeoffs between computational complexity and
performance.

49
Would it be possible to build a
“general purpose” PR system?
• Humans have the ability to switch rapidly and
seamlessly between different pattern recognition
tasks.
• It is very difficult to design a system that is capable
of performing a variety of classification tasks.
• Different decision tasks may require different features.
• Different features might yield different solutions.
• Different tradeoffs exist for different tasks.

Microsoft: Exam Questions DP-900
No ratings yet
Microsoft: Exam Questions DP-900
20 pages
Inverted Pendulum With MATLAB
No ratings yet
Inverted Pendulum With MATLAB
26 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
ETL Vs DB Testing
No ratings yet
ETL Vs DB Testing
13 pages
Assignment OF Data Science (AIT 120) : Submitted To: Submitted by
No ratings yet
Assignment OF Data Science (AIT 120) : Submitted To: Submitted by
10 pages
Introduction To Data Model L-1
No ratings yet
Introduction To Data Model L-1
17 pages
Data Scientist Interview Questions and Answers PDF
No ratings yet
Data Scientist Interview Questions and Answers PDF
37 pages
Data Modeler Resume
No ratings yet
Data Modeler Resume
5 pages
Erwin (A Data Modeling and Design Tool)
No ratings yet
Erwin (A Data Modeling and Design Tool)
14 pages
?DevOps Interview Disaster_ Avoid These Pitfalls!?
No ratings yet
?DevOps Interview Disaster_ Avoid These Pitfalls!?
7 pages
Artificial Intelligence Mcqs
No ratings yet
Artificial Intelligence Mcqs
173 pages
Big Data Analytics TEXTBOOK
No ratings yet
Big Data Analytics TEXTBOOK
230 pages
Wa0031
No ratings yet
Wa0031
59 pages
Chapter 4 Data Modeling
No ratings yet
Chapter 4 Data Modeling
9 pages
SQL For Data Analytics
No ratings yet
SQL For Data Analytics
92 pages
Langauage Model
No ratings yet
Langauage Model
148 pages
Module 6 - Guided Lab - Creating A Virtual Private Cloud
No ratings yet
Module 6 - Guided Lab - Creating A Virtual Private Cloud
9 pages
[FREE PDF sample] UML and Data Modeling A Reconciliation First Edition David C. Hay ebooks
100% (1)
[FREE PDF sample] UML and Data Modeling A Reconciliation First Edition David C. Hay ebooks
77 pages
AWS AI ML REPORT
No ratings yet
AWS AI ML REPORT
43 pages
TCS Technical Interview Questions
No ratings yet
TCS Technical Interview Questions
2 pages
Hybrid Resume ATS_TDS
No ratings yet
Hybrid Resume ATS_TDS
2 pages
(100% Off) Azure DevOps Bootcamp - Zero To Hero (Pipelines, Boards, Repos) Free Course Coupon-Imppppppppppp
No ratings yet
(100% Off) Azure DevOps Bootcamp - Zero To Hero (Pipelines, Boards, Repos) Free Course Coupon-Imppppppppppp
4 pages
Shubham Pande
No ratings yet
Shubham Pande
2 pages
Data Engineering Explanation
No ratings yet
Data Engineering Explanation
43 pages
Core Java
No ratings yet
Core Java
217 pages
CHO-22CS035, Cloud Computing
No ratings yet
CHO-22CS035, Cloud Computing
10 pages
AWS Scenario Based Interview Questions On EC2, IAM & VPC
No ratings yet
AWS Scenario Based Interview Questions On EC2, IAM & VPC
14 pages
Flutter Developer Resume
No ratings yet
Flutter Developer Resume
2 pages
Janmejaya_Sahoo
No ratings yet
Janmejaya_Sahoo
2 pages
Kubernetes & Google Kubernetes Engine (GKE) : by Akash Agrawal
No ratings yet
Kubernetes & Google Kubernetes Engine (GKE) : by Akash Agrawal
24 pages
GCP Cloud
No ratings yet
GCP Cloud
5 pages
Erwin Overview
No ratings yet
Erwin Overview
95 pages
Route Table Internet Gateway Network ACL Security Group: Click Here To Watch Hands-On Demo
No ratings yet
Route Table Internet Gateway Network ACL Security Group: Click Here To Watch Hands-On Demo
10 pages
SETLabs Briefings Software Validation
No ratings yet
SETLabs Briefings Software Validation
75 pages
1-SPRING BOOT MS BANK APP STEP BY SETP JAN 25
No ratings yet
1-SPRING BOOT MS BANK APP STEP BY SETP JAN 25
29 pages
Jenkins CICD Pipeline
No ratings yet
Jenkins CICD Pipeline
31 pages
Azure Devs Practical Assignment
100% (1)
Azure Devs Practical Assignment
7 pages
UE20CS302 Unit4 Slides
No ratings yet
UE20CS302 Unit4 Slides
312 pages
spring-cloud
No ratings yet
spring-cloud
661 pages
DevOps Shack _ 10 Docker Projects to Master Docker
No ratings yet
DevOps Shack _ 10 Docker Projects to Master Docker
43 pages
From Words To Pictures Artificial Intelligence Based Art Generator
No ratings yet
From Words To Pictures Artificial Intelligence Based Art Generator
9 pages
Course: Azure Administration: Module 1 - Exam Preparation and Study Techniques
No ratings yet
Course: Azure Administration: Module 1 - Exam Preparation and Study Techniques
9 pages
Chapter 3 Data Modeling Using The Entity Relationship ER Model
No ratings yet
Chapter 3 Data Modeling Using The Entity Relationship ER Model
55 pages
Excerpts - Machine Learning and Big Data Projects
No ratings yet
Excerpts - Machine Learning and Big Data Projects
80 pages
Use Case Diagram
No ratings yet
Use Case Diagram
42 pages
Angular 6
No ratings yet
Angular 6
54 pages
Data Quality and Cleaning
No ratings yet
Data Quality and Cleaning
9 pages
CA ERwin Tutorial
No ratings yet
CA ERwin Tutorial
12 pages
Roadmap
No ratings yet
Roadmap
13 pages
Quality Thought Manual Testing
No ratings yet
Quality Thought Manual Testing
15 pages
Donald Ngandeu 1
No ratings yet
Donald Ngandeu 1
6 pages
YASSER - Designed Resume
No ratings yet
YASSER - Designed Resume
3 pages
Brainalyst's SQL Interview Guide
No ratings yet
Brainalyst's SQL Interview Guide
112 pages
VPC New
No ratings yet
VPC New
36 pages
Day65 - Day70 Power BI Interview
No ratings yet
Day65 - Day70 Power BI Interview
31 pages
Applied Coding Track
No ratings yet
Applied Coding Track
10 pages
66 Data Analyst Interview Questions To Ace Your in
No ratings yet
66 Data Analyst Interview Questions To Ace Your in
38 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
20 pages
Documenting ETL Rules in CA ERwin
No ratings yet
Documenting ETL Rules in CA ERwin
25 pages
Aws VPC-1
No ratings yet
Aws VPC-1
36 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Technophilia Artificial Intelligence
No ratings yet
Technophilia Artificial Intelligence
5 pages
Computer Science Textbook Solutions - 32
No ratings yet
Computer Science Textbook Solutions - 32
7 pages
Genertaion of Hindu Languhae
No ratings yet
Genertaion of Hindu Languhae
9 pages
C12 (11-Bis) 18.12.2008 24 Slides
No ratings yet
C12 (11-Bis) 18.12.2008 24 Slides
24 pages
006 Practical List of DM-2023
No ratings yet
006 Practical List of DM-2023
1 page
Comparison of Prompt Source Artwork and AI
No ratings yet
Comparison of Prompt Source Artwork and AI
1 page
Knowledge K1 - Remembering K3 - Applying K5 - Evaluating Levels (KL) K2 - Understanding K4 - Analyzing K6 - Creating Course Outcome
No ratings yet
Knowledge K1 - Remembering K3 - Applying K5 - Evaluating Levels (KL) K2 - Understanding K4 - Analyzing K6 - Creating Course Outcome
3 pages
What Is Semantics? Its Type
No ratings yet
What Is Semantics? Its Type
2 pages
Automated Machine Learning For Remaining Useful Life Predictions
No ratings yet
Automated Machine Learning For Remaining Useful Life Predictions
10 pages
Unit 1 Language As Communication
No ratings yet
Unit 1 Language As Communication
7 pages
Long Quiz 001 - IT6202 - Database Management System 1
No ratings yet
Long Quiz 001 - IT6202 - Database Management System 1
11 pages
Agra TERM2 XII CS QP PB1
No ratings yet
Agra TERM2 XII CS QP PB1
6 pages
Itec - 212 Dbms - Course - Description
No ratings yet
Itec - 212 Dbms - Course - Description
1 page
Classification of Flower Species Final
No ratings yet
Classification of Flower Species Final
32 pages
Cephalometric Landmark Detection in Dental X-Ray Images Using Convolutional Neural Networks
No ratings yet
Cephalometric Landmark Detection in Dental X-Ray Images Using Convolutional Neural Networks
6 pages
Combining Transformer and CNN For Object Detection in UAV Imagery
No ratings yet
Combining Transformer and CNN For Object Detection in UAV Imagery
6 pages
Pick and Place Robots
No ratings yet
Pick and Place Robots
16 pages
Artificial Intelligence Applications in Supply Chain Management
100% (1)
Artificial Intelligence Applications in Supply Chain Management
66 pages
Social Signals Processing in Human-Computer Interaction
No ratings yet
Social Signals Processing in Human-Computer Interaction
1 page
Machine Learning Super Cheatsheet (Prof. Pedram Jahangiry)
No ratings yet
Machine Learning Super Cheatsheet (Prof. Pedram Jahangiry)
2 pages
003 Introduction To Expert System
No ratings yet
003 Introduction To Expert System
21 pages
Impact of AI in Marketing & Communications - Subhamoy Das - IABC APAC - March 20, 2019
No ratings yet
Impact of AI in Marketing & Communications - Subhamoy Das - IABC APAC - March 20, 2019
46 pages
Q1 - VLAD - Aggregating Local Descriptors Into A Compact Image Representation
No ratings yet
Q1 - VLAD - Aggregating Local Descriptors Into A Compact Image Representation
8 pages
Python + MongoDB
No ratings yet
Python + MongoDB
12 pages
Schramm's Model (Report 1) (Oral Communication)
100% (2)
Schramm's Model (Report 1) (Oral Communication)
4 pages
Deepfake Image Detection
No ratings yet
Deepfake Image Detection
7 pages
Lecture Notes Classical Reinforcement Learning: Agent-Environment Interaction
No ratings yet
Lecture Notes Classical Reinforcement Learning: Agent-Environment Interaction
11 pages
BUENAVENTURA Manelli Faten. - MODULE 3 Task123 PDF
100% (1)
BUENAVENTURA Manelli Faten. - MODULE 3 Task123 PDF
11 pages