0% found this document useful (0 votes)

61 views34 pages

Deep Learning Basics for Beginners

Uploaded by

attackontitans.blacklover

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views34 pages

Deep Learning Basics for Beginners

Uploaded by

attackontitans.blacklover

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Introduction to deep

learning

By: DIVAKAR KESHRI

PhD NIT TRICHY
About this course
• Introduction to deep learning
• basics of ML assumed
• mostly high-school math
• much of theory, many details skipped
• 1st day: lectures + small-scale exercises using
notebooks.csc.fi
• 2nd day: experiments using GPUs at Puhti-AI
• Slides at: https://tinyurl.com/yyej6rxl
• Other materials at GitHub:
https://github.com/csc-training/intro-to-dl
• Gitter chat at:
https://gitter.im/csc_training/intro-to-dl
• Focus on text and image classification, no fancy
stuff
Further resources
• This course is largely “inspired by”: “Deep
Learning with Python” by François Chollet
• Recommended textbook: “Deep learning”
by Goodfellow, Bengio, Courville
• Lots of further material available online, e.g.:
http://cs231n.stanford.edu/ http://course.fast.ai/
https://developers.google.com/machine-learning/crash-course/

www.nvidia.com/dlilabs http://introtodeeplearning.com/
https://github.com/oxford-cs-deepnlp-2017/lectures,
https://jalammar.github.io/
• Academic courses
What is artificial
intelligence?

Artificial intelligence is the ability of a computer to

perform tasks commonly associated with intelligent
beings.
What is machine
learning?

Machine learning is the study of algorithms that

learn from examples and experience instead of
relying on hard-coded rules and make predictions
on new data.
What is deep learning?

Deep learning is a subfield of machine learning

focusing on learning data representations as
successive layers of increasingly meaningful
representations.
Image from https://blogs.nvidia.com/blog/2016/07/29/whats-difference-artificial-intelligence-machine-learning-deep-learning-ai/
“Traditional” machine learning:

handcrafted learned
cat
features classifier

Deep, “end-to-end” learning:

learned learned learned

learned
low-level mid-level high-level cat
classifier
features features features
From: Wang & Raj: On the Origin of Deep Learning (2017)
Main types of machine
learning
Main types of machine learning

• Supervised learning
cat
• Unsupervised learning
• Self-supervised dog
learning
• Reinforcement
learning
Main types of machine learning

• Supervised learning

• Unsupervised
learning
• Self-supervised
learning
• Reinforcement
learning
Main types of machine learning

• Supervised learning

• Unsupervised learning
• Self-supervised
learning
• Reinforcement
learning

Image from https://arxiv.org/abs/1710.10196

Main types of machine learning

• Supervised learning

• Unsupervised learning
• Self-supervised
learning
• Reinforcement
learning

Animation from https://yanpanlau.github.io/2016/07/10/FlappyBird-Keras.html

Fundamentals of machine
learning
Data
• Humans learn by observation
and unsupervised learning
• model of the world /
common sense reasoning
• Machine learning needs lots
of (labeled) data to
compensate
Data

• Tensors: generalization of matrices

to n dimensions (or rank, order, degree)
• 1D tensor: vector
• 2D tensor: matrix
• 3D, 4D, 5D tensors
• numpy.ndarray(shape, dtype)
• Training – validation – test split (+
adversarial test)
• Minibatches
• small sets of input data used at a time
• usually processed independently Image from:
https://arxiv.org/abs/1707.08945
Model – learning/training – inference

http://playground.tensorflow.org/

• parameters 𝜃 and hyperparameters

Optimization
• Mathematical optimization:
“the selection of a best element
(with
regard to some criterion) from some
set of available alternatives”
(Wikipedia)
• Main types:
• Learning asiterative,
finite-step, an optimization
heuristic
By Rebecca Wilson (originally posted to Flickr as Vicariously) [CC BY 2.0], via Wikimedia Commons

problem
loss regularization
• cost function:
Optimization

Image from: Li et al. “Visualizing the Loss Landscape of Neural Nets”, arXiv:1712.09913
Gradient descent

• Derivative and minima/maxima of

functions
• Gradient: the derivative of a multivariable
function
• Gradient descent:

• (Mini-batch) stochastic gradient

descent (and its variants)
Image from: https://towardsdatascience.com/gradient-descent-algorithm-and-its-variants-10f652806a3
Over- and underfitting, generalization,
regularization
• Models with lots of parameters
can easily overfit to training
data
• Generalization: the quality of
ML model is measured on new,
unseen samples
• Regularization: any method*
to prevent overfitting
• simplicity, sparsity, dropout, early
stopping
• *) other than adding more data By Chabacano [GFDL or CC BY-SA 4.0], from Wikimedia Commons
Deep learning
Anatomy of a deep neural network

• Layers
• Input data and targets
• Loss function
• Optimizer
Layers
• Data processing modules
• Many different kinds exist
• densely connected
• convolutional
• recurrent
• pooling, flattening, merging,
normalization, etc.
• Input: one or more tensors
output: one or more tensors
• Usually have a state, encoded as
weights
• learned, initially random
• When combined, form a network or
a model
Input data and targets

• The network maps the input

data X to predictions Y′
• During training, the
predictions Y′ are compared
to true targets Y using the
loss function

cat
dog
Loss function
• The quantity to be minimized (optimized) during
training
• the only thing the network cares about
• there might also be other metrics you care
about
• Common tasks have “standard” loss functions:
• mean squared error for regression
• binary cross-entropy for two-class
classification
• categorical cross-entropy for multi-class
classification
• etc.
Optimizer
• How to update the
weights based on the
loss function
• Learning rate
(+scheduling)
• Stochastic gradient
descent, momentum,
and their variants
• RMSProp is usually a
good first choice
• more info:
http://ruder.io/optimizing-gradient-d
escent/ Animation from: https://imgur.com/s25RsOr
Anatomy of a deep neural network
Deep learning frameworks
Deep learning frameworks
+

• Actually tools for defining static or

dynamic general-purpose +
computational graphs
• Automatic differentiation ✕ ✕
• Seamless CPU / GPU usage
• multi-GPU, distributed
x y 5
• Python/numpy or R interfaces
• instead of C, C++, CUDA or HIP
• Open source
Deep learning Lasagne Keras TF Estimator torch.nn Gluon

frameworks
Theano TensorFlow CNTK PyTorch MXNet Caffe

CUDA, cuDNN
MKL, MKL-DNN
• Keras is a high-level HIP, MIOpen

neural networks API

• we will use TensorFlow GPUs CPUs
as the compute backend
• included in TensorFlow 2 as tf.keras
• https://keras.io/ ,
https://www.tensorflow.org/guide/keras
• PyTorch is:
• a GPU-based tensor library
• an efficient library for dynamic neural networks

Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
34 pages
Deep Learning Course Introduction
No ratings yet
Deep Learning Course Introduction
34 pages
Main
No ratings yet
Main
17 pages
Deep Learning Introduction Class
No ratings yet
Deep Learning Introduction Class
46 pages
Machine Learning Vs Deep Learning
No ratings yet
Machine Learning Vs Deep Learning
2 pages
1c Machinelearning
No ratings yet
1c Machinelearning
50 pages
Introduction To ML
No ratings yet
Introduction To ML
34 pages
Machine Learning vs Deep Learning
No ratings yet
Machine Learning vs Deep Learning
2 pages
Machinelearning VSDeep Learning
No ratings yet
Machinelearning VSDeep Learning
1 page
AI & ML Basics for Beginners
No ratings yet
AI & ML Basics for Beginners
39 pages
Introduction To Machine Learning For Beginners: Ayush Pant
No ratings yet
Introduction To Machine Learning For Beginners: Ayush Pant
28 pages
Artificial Inteligence PDF
No ratings yet
Artificial Inteligence PDF
328 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
7 pages
Unit-I Deep Learning Techniques
No ratings yet
Unit-I Deep Learning Techniques
20 pages
DL Unit - I CSD Iv
No ratings yet
DL Unit - I CSD Iv
19 pages
OCI ML Fundations
No ratings yet
OCI ML Fundations
9 pages
Lectures On Machine Learning
100% (1)
Lectures On Machine Learning
69 pages
Python Machine Learning Machine Learning and Deep Learning From Scratch Illustrated With Python Scikit Learn Keras Theano and Tensorflow 1211083261
No ratings yet
Python Machine Learning Machine Learning and Deep Learning From Scratch Illustrated With Python Scikit Learn Keras Theano and Tensorflow 1211083261
53 pages
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
23 pages
Deep Learning Midsem Merged Previous Batch
No ratings yet
Deep Learning Midsem Merged Previous Batch
423 pages
Session 2 - Machine Learning Fundamental
No ratings yet
Session 2 - Machine Learning Fundamental
25 pages
Unit 3 Introduction To Deep Learning Part 1
No ratings yet
Unit 3 Introduction To Deep Learning Part 1
7 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
22 pages
Unit I
No ratings yet
Unit I
10 pages
Unit 1
No ratings yet
Unit 1
46 pages
Module 1 DL Snotes
No ratings yet
Module 1 DL Snotes
11 pages
Machine Learning and Deep Learning Basics
No ratings yet
Machine Learning and Deep Learning Basics
36 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
Deep Learning: A Visual Guide
No ratings yet
Deep Learning: A Visual Guide
53 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
71 pages
00 Pytorch and Deep Learning Fundamentals PDF
No ratings yet
00 Pytorch and Deep Learning Fundamentals PDF
44 pages
Deep Learning Review and Discussion of Its Future
No ratings yet
Deep Learning Review and Discussion of Its Future
7 pages
JNTUK R20 B.Tech CSE 4-1 Deep Learning Techniques Unit 1 Notes
No ratings yet
JNTUK R20 B.Tech CSE 4-1 Deep Learning Techniques Unit 1 Notes
15 pages
Lecture Notes: Introduction To Machine Learning For The Sciences
No ratings yet
Lecture Notes: Introduction To Machine Learning For The Sciences
80 pages
I MSC DS ML Notes
No ratings yet
I MSC DS ML Notes
109 pages
Lect 4-Introduction To Deep Learning
No ratings yet
Lect 4-Introduction To Deep Learning
33 pages
Intro Part1
No ratings yet
Intro Part1
50 pages
Unit - 1 Deep Learning Techniques
No ratings yet
Unit - 1 Deep Learning Techniques
18 pages
Advanced Machine Learning Tutorial
No ratings yet
Advanced Machine Learning Tutorial
37 pages
Modern Deep Learning Foundation by Barak or
No ratings yet
Modern Deep Learning Foundation by Barak or
144 pages
Deep Learning File
No ratings yet
Deep Learning File
58 pages
Unit 2 Introduction To Deep Learning & Architectures
No ratings yet
Unit 2 Introduction To Deep Learning & Architectures
90 pages
Machine Learning Semester Paper
No ratings yet
Machine Learning Semester Paper
31 pages
DL Unit 1
No ratings yet
DL Unit 1
21 pages
Asg202508161528241103 0 238
No ratings yet
Asg202508161528241103 0 238
5 pages
Deep Learning With Tensorflow
100% (1)
Deep Learning With Tensorflow
70 pages
Intro To Machine Learning
100% (1)
Intro To Machine Learning
250 pages
Deep Learning
No ratings yet
Deep Learning
100 pages
ITR Roll No.20
No ratings yet
ITR Roll No.20
3 pages
Salman Technical Seminar
No ratings yet
Salman Technical Seminar
24 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
58 pages
ML Microst
No ratings yet
ML Microst
264 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
Machine Learning Using Python
No ratings yet
Machine Learning Using Python
12 pages
Deep Learning Introduction
No ratings yet
Deep Learning Introduction
14 pages
Unit 1
No ratings yet
Unit 1
26 pages
ME3435E ADDTE Lect27 Machine Learning For Signal Processing 19.03.25
No ratings yet
ME3435E ADDTE Lect27 Machine Learning For Signal Processing 19.03.25
34 pages
RL Class Mtech
No ratings yet
RL Class Mtech
67 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
Deep Learning and Its Applications
No ratings yet
Deep Learning and Its Applications
33 pages
FL 1
No ratings yet
FL 1
25 pages
FCE Result 2015 Students Book
68% (25)
FCE Result 2015 Students Book
178 pages
Humanistic Theory
No ratings yet
Humanistic Theory
2 pages
Korean Verb Conjugation Guide
No ratings yet
Korean Verb Conjugation Guide
3 pages
Peng Et Al (201-WPS Office
No ratings yet
Peng Et Al (201-WPS Office
2 pages
Personality Types and Megabytes: Student Attitudes Toward C0Mputer Mediated Communication (CMC) in The Language Classroom
No ratings yet
Personality Types and Megabytes: Student Attitudes Toward C0Mputer Mediated Communication (CMC) in The Language Classroom
19 pages
ظاهرة التقديم والتأخير ...
No ratings yet
ظاهرة التقديم والتأخير ...
56 pages
Htkbook
No ratings yet
Htkbook
354 pages
Module 3 Reflection Fba
No ratings yet
Module 3 Reflection Fba
1 page
Language Learning Process in Early Childhood
100% (3)
Language Learning Process in Early Childhood
34 pages
A Detailed Lesson Plan in Grade 10
No ratings yet
A Detailed Lesson Plan in Grade 10
9 pages
Module 1 Reflection 1
No ratings yet
Module 1 Reflection 1
2 pages
Theories of Spoken Word Recognition, PIA
No ratings yet
Theories of Spoken Word Recognition, PIA
58 pages
The Philosophical Diseases of Medicine and Their Cure
No ratings yet
The Philosophical Diseases of Medicine and Their Cure
435 pages
4mat Lesson Plan Final
No ratings yet
4mat Lesson Plan Final
4 pages
Clownery
No ratings yet
Clownery
2 pages
Epistemological Beliefs and Attitudes Toward Inclusion in Pre-Service Teachers
No ratings yet
Epistemological Beliefs and Attitudes Toward Inclusion in Pre-Service Teachers
10 pages
Final Exam AG2 - Virtual 202206 Avanzado 5 17-45-19 - 15
No ratings yet
Final Exam AG2 - Virtual 202206 Avanzado 5 17-45-19 - 15
18 pages
Veronica 2022
No ratings yet
Veronica 2022
13 pages
F IDEA - Its Meaning, Its Distinction From Phantasm
No ratings yet
F IDEA - Its Meaning, Its Distinction From Phantasm
3 pages
10 - Chapter-2, Review of Related Literature
No ratings yet
10 - Chapter-2, Review of Related Literature
30 pages
Lesson 7 Language, Culture and Society
No ratings yet
Lesson 7 Language, Culture and Society
3 pages
THESIS
No ratings yet
THESIS
27 pages
Lect5 (Models of Supervision in Education)
100% (4)
Lect5 (Models of Supervision in Education)
7 pages
Plinth To Paramount by Neetu Singh Downloaded From Edumo - in
50% (2)
Plinth To Paramount by Neetu Singh Downloaded From Edumo - in
424 pages
Grade 1 Leadership Seminar Review
100% (1)
Grade 1 Leadership Seminar Review
5 pages
2016 Teens 1 Specifications: o LIVE BEAT 1 Units 1 To 9 (Student's Book & Workbook)
No ratings yet
2016 Teens 1 Specifications: o LIVE BEAT 1 Units 1 To 9 (Student's Book & Workbook)
1 page
QSEN Competencies for Nurses
No ratings yet
QSEN Competencies for Nurses
34 pages
Employee Training in NTC
100% (2)
Employee Training in NTC
15 pages
English 5 Curriculum: Q1 Weeks 1-10
100% (1)
English 5 Curriculum: Q1 Weeks 1-10
4 pages

Deep Learning Basics for Beginners

Uploaded by

Deep Learning Basics for Beginners

Uploaded by

Introduction to deep

By: DIVAKAR KESHRI

Artificial intelligence is the ability of a computer to

Machine learning is the study of algorithms that

Deep learning is a subfield of machine learning

Deep, “end-to-end” learning:

learned learned learned

Image from https://arxiv.org/abs/1710.10196

Animation from https://yanpanlau.github.io/2016/07/10/FlappyBird-Keras.html

• Tensors: generalization of matrices

• parameters 𝜃 and hyperparameters

• Derivative and minima/maxima of

• (Mini-batch) stochastic gradient

• The network maps the input

• Actually tools for defining static or

neural networks API

You might also like