0% found this document useful (0 votes)

31 views20 pages

5.11 MLBasics-Challenges

Basic

Uploaded by

Yehaa Km

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views20 pages

5.11 MLBasics-Challenges

Basic

Uploaded by

Yehaa Km

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Deep Learning Srihari

Challenges motivating deep

learning
Sargur N. Srihari
[email protected]

1
Topics In Machine Learning Basics
Deep Learning Srihari

1. Learning Algorithms
2. Capacity, Overfitting and Underfitting
3. Hyperparameters and Validation Sets
4. Estimators, Bias and Variance
5. Maximum Likelihood Estimation
6. Bayesian Statistics
7. Supervised Learning Algorithms
8. Unsupervised Learning Algorithms
9. Stochastic Gradient Descent
10. Building a Machine Learning Algorithm
11. Challenges Motivating Deep Learning 2
Deep Learning Srihari

Topics in “Motivations”
• Shortcomings of conventional ML
1. The curse of dimensionality
2. Local constancy and smoothness
regularization
3. Manifold learning

3
Deep Learning Srihari

Challenges Motivating DL
• Simple ML algorithms work very well on a wide
variety of important problems
• However they have not succeeded in solving
central problems of AI, such as recognizing
speech and recognizing objects
• Deep learning was motivated by failure of
traditional algorithms to generalize well on such
tasks
4
Deep Learning Srihari

Curse of dimensionality

• No of possible distinct
configurations of a set of
variables increases exponentially with no of
variables
– Poses a statistical challenge
• Ex: 10 regions of interest with one variable
– We need to track 100 regions with two variables
– 1000 regions with three variables 5
Local Constancy & Smoothness
Deep Learning Srihari

Regularization
• Prior beliefs
– To generalize well ML algorithms need prior beliefs
• Form of probability distributions over parameters
• Influencing the function itself, while parameters are
influenced only indirectly
• Algorithms biased towards preferring a class of functions
– These biases may not be expressed in terms of a probability
distribution

• Most widely used prior is smoothness

– Also called local constancy prior
– States that the function we learn should not change
6
very much within a small region
Deep Learning Srihari

Local Constancy Prior

• Function should not change very much within a
small region
• Many simpler algorithms rely exclusively on this
prior to generalize well
– Thus fail to scale statistical challenges in AI tasks
• Deep learning introduces additional (explicit
and implicit) priors in order to reduce
generalization error on sophisticated tasks
• We now explain why smoothness alone is
insufficient 7
Deep Learning Srihari

Specifying smoothness
• Several methods to encourage learning a
function f* that satisfies the condition
f*(x)≈f*(x+ε)
– For most configurations x and small change ε
• If we know a good answer for input x then that
answer is good in the neighborhood of x
• An extreme example is k-nearest neighbor
– Points having the same set of nearest neighbors all
have the same prediction
8
– For k=1, no of regions ≤ no of training examples
Deep Learning Srihari

Kernel machines and smoothness

• Kernel machines interpolate between training
set outputs associated with nearby training
examples
• With local kernels: k(u,v) is large when u=v
and decreases as u and v grow further apart
• Can be thought of as a similarity function that
performs template matching
– By measuring how closely test example x
resembles training example x(i)
• Much of deep learning is motivated by
9
limitations of template matching
Deep Learning Srihari

Decision Trees and Smoothness

• Also suffers from exclusively smoothness-
based learning
– They break input space into as many regions as
there are leaves and use a separate parameter in
each region
– For n leaves, at least n training samples are
required
– Many more needed for statistical confidence

10
Deep Learning Srihari

No. of examples and no. of regions

• All of the above methods require:
– O(k) regions need O(k) examples;
– O(k) parameters with O(1) parameters associated
with O(k) regions
• Nearest-neighbor : each training sample (circle)
defines at most one region
– y value associated with
each example defines the
output for all points within region

11
Deep Learning Srihari

More regions than examples

• Suppose we need more regions than examples
• Two questions of interest
1. Is it possible represent a complicated function
efficiently?
2. Is it possible for the estimated function to
generalize well for new inputs?
• Answer to both is yes
– O(2k) regions can be defined with O(k) examples
• By introducing dependencies between regions through
assumptions on data generating distribution 12
Core idea of deep learning
Deep Learning Srihari

• Assume data was generated by composition of

factors, at multiple levels in a hierarchy
– Many other similarly generic assumptions
• These mild assumptions allow exponential gain
in no of samples and no of regions
– An example of a distributed representation is a
vector of n binary features
• It can take 2n configurations
– Whereas in a symbolic
representation, each input
is associated with a single
symbol (or category)
13
– Here h1, h2 and h3 are three binary features
Deep Learning Srihari

Manifold Learning
• An important idea underlying many ideas in
machine learning
• A manifold is a connected region
– Mathematically it is a set of points in a
neighborhood
– It appears to be in a Euclidean space
• E.g., we experience the world as a 2-D plane while it is a
spherical manifold in 3-D space

14
Deep Learning Srihari

Manifold in Machine Learning

• Although manifold is mathematically defined,
in machine learning it is loosely defined:
– A connected set of points that can be approximated
well by considering only a small no of degrees of
freedom embedded in a higher-dimensional space
Training data lying near a 1-D The solid line indicates the underlying
Manifold in a 2-D space manifold that the learner should infer

In machine learning we allow the

dimensionality of the manifold to
vary from one point to another.
This often happens when a manifold
Intersects itself, as in a figure-eight
15
Deep Learning Srihari

Manifold learning surmounts Rn

• It is sometimes hopeless to learn functions with
variations across all of Rn
• Manifold learning algorithms surmount this
obstacle by assuming most of Rn consists of
invalid inputs
– And that intersecting inputs occur only along the
manifolds
• Introduced for continuous data and in
unsupervised learning, the probability
concentration idea can be generalized to
discrete and unsupervised settings
Deep Learning Srihari

Manifold hypothesis for Images

• Manifold assumption is
justified since:
• Distributions are highly
concentrated
– Uniformly sampled points
look like static noise,
never structured
• Although there is a non-zero
probability of generating a
face, it is never observed

17
Deep Learning Srihari

Manifold justified in Text domain

• If you generate a document by randomly
generating text, it is a near zero probability of
generating meaningful text
• Natural language sequences occupy a small
volume of total space of sequences of letters

18
Deep Learning Srihari

Manifolds traced by transformations

• Manifolds can be traced by making small
transformations
• Manifold structure of a dataset of human faces

19
Deep Learning Srihari

Manifolds discovered for Human Faces

• Variational
autoencoder
discovers underlying
two-dimensional
coordinate system:
1. Rotation
2. Emotion

7.14 TangentDistance
No ratings yet
7.14 TangentDistance
16 pages
5.2 MLBasics-Capacity
No ratings yet
5.2 MLBasics-Capacity
30 pages
Neural Network Optimization Challenges
No ratings yet
Neural Network Optimization Challenges
17 pages
Deep Learning: Overcoming High-Dimensional Challenges
No ratings yet
Deep Learning: Overcoming High-Dimensional Challenges
5 pages
5.3 MLBasics Hyperparam
No ratings yet
5.3 MLBasics Hyperparam
13 pages
ArchitectureDesign For DeepLearning
No ratings yet
ArchitectureDesign For DeepLearning
34 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
18 pages
Stochastic Backpropagation and Approximate Inference in Deep Generative Models
No ratings yet
Stochastic Backpropagation and Approximate Inference in Deep Generative Models
14 pages
Towards Causal Representation Learning
No ratings yet
Towards Causal Representation Learning
24 pages
6.1 DeepFFNets
No ratings yet
6.1 DeepFFNets
47 pages
Deep Feedforward Networks Guide
No ratings yet
Deep Feedforward Networks Guide
103 pages
9.5 CNN-Variants
No ratings yet
9.5 CNN-Variants
21 pages
9.2 CNN-Motivation
No ratings yet
9.2 CNN-Motivation
17 pages
Learning Manifolds with Autoencoders
No ratings yet
Learning Manifolds with Autoencoders
25 pages
MultiTask DL
No ratings yet
MultiTask DL
9 pages
Deep Learning in Computer Vision
No ratings yet
Deep Learning in Computer Vision
12 pages
Module 1 Introduction To DL
No ratings yet
Module 1 Introduction To DL
17 pages
A Survey On Bayesian Deep Learning
No ratings yet
A Survey On Bayesian Deep Learning
37 pages
VAE Applications and Summary
No ratings yet
VAE Applications and Summary
29 pages
Causal Interpretability For Machine Learning
No ratings yet
Causal Interpretability For Machine Learning
16 pages
Mod-1 Part-2
No ratings yet
Mod-1 Part-2
106 pages
Contents Part Research PDF
No ratings yet
Contents Part Research PDF
3 pages
Book Main 2
No ratings yet
Book Main 2
304 pages
Diligenti 2017
No ratings yet
Diligenti 2017
4 pages
Advanced Machine Learning: Course Overview
No ratings yet
Advanced Machine Learning: Course Overview
26 pages
8.5 AdaptiveLearning
No ratings yet
8.5 AdaptiveLearning
15 pages
Ad3501-Dl-Unit 1 Notes
No ratings yet
Ad3501-Dl-Unit 1 Notes
43 pages
AD3501-DL-Unit 1 Notes
No ratings yet
AD3501-DL-Unit 1 Notes
43 pages
Deep Learning Hand Book 2024
No ratings yet
Deep Learning Hand Book 2024
185 pages
Part III Chapt 13
No ratings yet
Part III Chapt 13
3 pages
Deep Learning-Lecture 1 (Student)
No ratings yet
Deep Learning-Lecture 1 (Student)
9 pages
11.2 BaselineModels
No ratings yet
11.2 BaselineModels
6 pages
Greedy-Layerwise in Deep Learning
No ratings yet
Greedy-Layerwise in Deep Learning
15 pages
Deep Learning - AD3501 - Notes - Unit 1 - Deep Networks Basics-pages-Deleted
No ratings yet
Deep Learning - AD3501 - Notes - Unit 1 - Deep Networks Basics-pages-Deleted
145 pages
22.1 GAN Motivation
No ratings yet
22.1 GAN Motivation
20 pages
Regularization in Neural Networks: Sargur Srihari Srihari@buffalo - Edu
No ratings yet
Regularization in Neural Networks: Sargur Srihari Srihari@buffalo - Edu
31 pages
Information Fusion: Sciencedirect
No ratings yet
Information Fusion: Sciencedirect
55 pages
Explaining Support Vector Machines - A Color Based Nomogram
No ratings yet
Explaining Support Vector Machines - A Color Based Nomogram
33 pages
Algoritmo
No ratings yet
Algoritmo
18 pages
Ai For Robustness and Fairness Addressing Bias Fairness and Robustness in Machine Learning Algorithms
No ratings yet
Ai For Robustness and Fairness Addressing Bias Fairness and Robustness in Machine Learning Algorithms
4 pages
Neural Algorithmic Reasoning Explained
No ratings yet
Neural Algorithmic Reasoning Explained
7 pages
Nature14539 PDF
No ratings yet
Nature14539 PDF
9 pages
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
No ratings yet
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
12 pages
The Little Book of Deep Learning - (François Fleuret) - University of Geneva-2023.compressed
No ratings yet
The Little Book of Deep Learning - (François Fleuret) - University of Geneva-2023.compressed
163 pages
Toward Causal Representation Learning: Byb S, F L, S B, N R K, N K, A G, Y B
No ratings yet
Toward Causal Representation Learning: Byb S, F L, S B, N R K, N K, A G, Y B
23 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
167 pages
Physics-Informed Neural Networks M. Raissi & P. Perdikaris & G.E. Karniadakis Online Version
No ratings yet
Physics-Informed Neural Networks M. Raissi & P. Perdikaris & G.E. Karniadakis Online Version
98 pages
Statistical Learning Theory Overview
No ratings yet
Statistical Learning Theory Overview
75 pages
Cheatsheets For Deep Learning 1650192034
No ratings yet
Cheatsheets For Deep Learning 1650192034
95 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
168 pages
Interpolation in Deep Learning Theory
No ratings yet
Interpolation in Deep Learning Theory
51 pages
SVMs: Techniques & Applications
No ratings yet
SVMs: Techniques & Applications
42 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
ML Mid1 Notes
No ratings yet
ML Mid1 Notes
45 pages
Unit 1 EDITED
No ratings yet
Unit 1 EDITED
52 pages
Lecun 2015
No ratings yet
Lecun 2015
10 pages
Deep Learning Is Not So Mysterious or Different: Andrew Gordon Wilson
No ratings yet
Deep Learning Is Not So Mysterious or Different: Andrew Gordon Wilson
20 pages
Graph Attention Layer Evolves Semantic Segmentation For Road Pothole Detection A Benchmark and Algorithms
No ratings yet
Graph Attention Layer Evolves Semantic Segmentation For Road Pothole Detection A Benchmark and Algorithms
12 pages
Wipro TalentNext Java Full Stack
No ratings yet
Wipro TalentNext Java Full Stack
12 pages
GE3791 - Human Values and Ethics - Course Material
No ratings yet
GE3791 - Human Values and Ethics - Course Material
119 pages
Petiton Flow
No ratings yet
Petiton Flow
6 pages
Project Diary Label
No ratings yet
Project Diary Label
1 page
Student Project Diary Template
No ratings yet
Student Project Diary Template
2 pages
Annual Day - Co Curricular - Ii Ai
No ratings yet
Annual Day - Co Curricular - Ii Ai
2 pages
ccs347 GD Unit 3 Notes
No ratings yet
ccs347 GD Unit 3 Notes
42 pages
N20240317133750772185395 Signed PDF
No ratings yet
N20240317133750772185395 Signed PDF
21 pages
SSC CGL 2025: Dates, Pattern, Syllabus
No ratings yet
SSC CGL 2025: Dates, Pattern, Syllabus
8 pages
Manifold Learning in Face Image Analysis
No ratings yet
Manifold Learning in Face Image Analysis
37 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
CMR Technical Campus B. Tech. Mid Question Bank (R22 Regulation) Academic Year:2024-2025 Semester: VI
No ratings yet
CMR Technical Campus B. Tech. Mid Question Bank (R22 Regulation) Academic Year:2024-2025 Semester: VI
4 pages
Understanding Dimension Reduction Tools
No ratings yet
Understanding Dimension Reduction Tools
73 pages
Computer Vision in Cocoa Pathology
No ratings yet
Computer Vision in Cocoa Pathology
21 pages
Geometric Deep Learning On Graphs and Manifolds Using Mixture Model Cnns
No ratings yet
Geometric Deep Learning On Graphs and Manifolds Using Mixture Model Cnns
13 pages
Unsupervised Learning Overview
No ratings yet
Unsupervised Learning Overview
41 pages
21AI502 Syllbus
No ratings yet
21AI502 Syllbus
5 pages
A Comprehensive Survey of Anomaly Detect
No ratings yet
A Comprehensive Survey of Anomaly Detect
30 pages
The AI Music Arms Race On The Detection of AI-Gene
100% (1)
The AI Music Arms Race On The Detection of AI-Gene
16 pages
Yarden Eilat Bloch, Dovi Poznanski, Nick L. J. Cox, Emmanuel Bernhard, Iain Mcdonald, Manuela Rauch, and Albert Zijlstra
No ratings yet
Yarden Eilat Bloch, Dovi Poznanski, Nick L. J. Cox, Emmanuel Bernhard, Iain Mcdonald, Manuela Rauch, and Albert Zijlstra
14 pages
ML Unit 4
No ratings yet
ML Unit 4
50 pages
Two-Stage Hierarchical and Explainable Feature
No ratings yet
Two-Stage Hierarchical and Explainable Feature
13 pages
HAR via Local Linear Embedding
No ratings yet
HAR via Local Linear Embedding
13 pages
Flexible Comparison of Batch Correction Methods
No ratings yet
Flexible Comparison of Batch Correction Methods
12 pages
Dimensionality Reduction Techniques
No ratings yet
Dimensionality Reduction Techniques
6 pages
Advanced Science - 2024 - Gong - Control of Cellular Differentiation Trajectories For Cancer Reversion
No ratings yet
Advanced Science - 2024 - Gong - Control of Cellular Differentiation Trajectories For Cancer Reversion
17 pages
Machine Learning 2: Exercise Sheet 1
No ratings yet
Machine Learning 2: Exercise Sheet 1
2 pages
Icmc 2023 Template
No ratings yet
Icmc 2023 Template
9 pages
unit-II Node Embeddings
No ratings yet
unit-II Node Embeddings
44 pages
Dimensionality Reduction Algorithms Review
No ratings yet
Dimensionality Reduction Algorithms Review
13 pages
Unit V
No ratings yet
Unit V
20 pages
"Organoid AI: Revolutionizing In Vitro Models"
No ratings yet
"Organoid AI: Revolutionizing In Vitro Models"
6 pages
Deep Learning in Process Fault Detection
No ratings yet
Deep Learning in Process Fault Detection
42 pages
Evaluation of Unsupervised Learning Algorithms For The Classification of Behavior From Pose Estimation Data
No ratings yet
Evaluation of Unsupervised Learning Algorithms For The Classification of Behavior From Pose Estimation Data
14 pages
Biochem Molecular Bio Educ - 2023 - Garma - Demystifying Dimensionality Reduction Techniques in The Omics Era A
No ratings yet
Biochem Molecular Bio Educ - 2023 - Garma - Demystifying Dimensionality Reduction Techniques in The Omics Era A
14 pages
Unit 4
No ratings yet
Unit 4
46 pages
The Plant Journal - 2022 - Yan - Unsupervised and Semi Supervised Learning The Next Frontier in Machine Learning For Plant
No ratings yet
The Plant Journal - 2022 - Yan - Unsupervised and Semi Supervised Learning The Next Frontier in Machine Learning For Plant
12 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
Senior Thesis FINAL
No ratings yet
Senior Thesis FINAL
64 pages

5.11 MLBasics-Challenges

Uploaded by

5.11 MLBasics-Challenges

Uploaded by

Deep Learning Srihari

Challenges motivating deep

• Most widely used prior is smoothness

Local Constancy Prior

Kernel machines and smoothness

Decision Trees and Smoothness

No. of examples and no. of regions

More regions than examples

• Assume data was generated by composition of

Manifold in Machine Learning

In machine learning we allow the

Manifold learning surmounts Rn

Manifold hypothesis for Images

Manifold justified in Text domain

Manifolds traced by transformations

Manifolds discovered for Human Faces

You might also like