0% found this document useful (0 votes)

155 views

Deep Learning Neural Network - Lecture1 PDF

This document provides details about a deep learning course, including its contents, applications, and frameworks. It discusses neural network and deep learning architectures like convolutional neural networks and sequence models. The document also summarizes the previous week's discussion on machine learning tasks, performance measures, and overfitting/underfitting. It introduces neural networks and logistic regression in the neural network mindset.

Uploaded by

Win Roedily

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

155 views

Deep Learning Neural Network - Lecture1 PDF

Uploaded by

Win Roedily

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Deep Learning

Neural Network

2019/2/25 1
Course Details

• Contents:
• Introduction
• Programing frameworks
• Applications, data collection, data preprocessing, features selection
• Neural Network and Deep Learning Architecture
• Convolutional Neural Network
• Sequence Model
• Introduction to Reinforcement Learning

• Reference “Deep Learning” by :Ian Goodfellow, Yoshua Bengio, Aron Courville

• Most lecture note base om Andrew Ng notes

2019/2/25 2
Todays’ discussion focus on:

• Recap on last week discussion and project ideas for final project

• Introduction to Neural Network

• Logistic regression in neural network mindset

• Setting your notebooks for programming in python

2019/2/25 3
Final project ideas

• Think of a problem you need to solve in your research area

• Remaining Useful Life of robot arm joint
• Automated Visual Inspection of production Process (laser, electronics)
• Health care
• Cyber security
• Automated driving( self drive car)
• Intelligent conversational interface/ chatbots
• Energy use and cost
• Understanding intentions / bad behaviors
• Automated reading
• Customer service and troubleshooting

2019/2/25 4
Project timing

• Proposal
• Progress and plan for next step
• Poster presentation (short)
• Final report

2019/2/25 5
Summery of last week

• Mitchell’s definition
• “A computer program is said to learn from experience E with respect to some class of tasks T and
performance measure P, if its performance at tasks in T, as measured by P, improves with
experience E”.
• Machine learning performance P
TN  TP
• Accuracy Accuracy 
TN  TP  FN  FP
• Confusion matrix (Precision, Recall, F1 measures)
TP
Pr ecision 
TP  FP
P*R
TP F1  2( )
Re call 
TP  FN
PR

2019/2/25 6
• A=99.9%

Wiki
2019/2/25 7
• Common machine learning tasks T
• Classification
• Regression
• Machine translation
• Anomaly detection
• Machine learning experience E,
• Supervised
• Unsupervised
• Some machine learning algorithms interact with the environment
(feedback in the loop) – reinforcement learning

2019/2/25 8
Underfitting and overfitting

• Generalization ability – generalization error (or Test error)

• Solve problem of overfitting

• Reduce the number of features
• Regularization 1  m n
2
J ( )   
2m  i 1
( h (i )
( x )  y ) 
(i ) 2
 
i 1
 j 

2019/2/25 9
Lets dive to deep Learning

2019/2/25 10
Neural Network and Deep Learning Architecture

• Introduction
• Basic of Neural network Architecture
• One Layer Neural Network
• Deep Neural Network

2019/2/25 11
What is Neural Network

size price
price

neuron

size of house

2019/2/25 12
Sensor representation in brain

• [BrainPort; Welsh & Blasch, 1997; Nagel et al., 2005; Constantine-Paton & Law, 2009]

2019/2/25 13
Housing Price Prediction

size 𝑥1

#bedrooms 𝑥2
y
zip code 𝑥3

wealth 𝑥4

x, y 
Supervised Learning

Input(x) Output (y) Application

House features Price Real Estate

STD NN
Ad, user info Click on ad? (0/1) Advertising

Image Object (1,…,100) Object recognition CNN

Audio Text transcript Speech recognition

RNN
English French Machine translation

Image, Location info Position of other cars Autonomous driving combo

and pedestrian
Supervised Learning
Structured Data Unstructured Data
Size #bedrooms … Floor No Price
(1000$s)
2104 3 3 400
1600 3 5 330
2400 3 6 369
⋮ ⋮ ⋮ ⋮
3000 4 2 540
Audio/Vibration

User Age Ad Id … Click Neural Network and

41 93242 1 neuroscience study…
80 93287 0
18 87312 1
⋮ ⋮ ⋮ Text
27 71244 1
Feed forward networks

Standard NN
Recurrent NN
Convolutional NN

2019/2/25 17
What drive deep learning

• Large amount of data available

• Faster computation and
• Innovation in neural network algorithm.
Scale drives deep learning

Small training
set

Andrew Ng’s graph

2019/2/25 18
History

• Trend

Gartner hyper cycle graph to analyzing the history of artificial neural network technology

2019/2/25 19
Break

2019/2/25 20
Binary Classification

1 (cat) vs 0 (non cat)

x y
255
231
Blue  
Green 42 
Red  
22 
 
nx  12288
X  
255
134 
 
 
255
 
134 
Notation
 x, y  x  R nx , y  0,1

m training examples: x (1)

  
, y (1) , x ( 2 ) , y ( 2 ) ,  x ( m ) , y ( m ) 
    
Y  y (1) , y ( 2 ) , , y ( m ) 
 (1) ( 2 ) (m) 
X  x x  x  Y  R1m
    
 

X  R n x m
Logistic Regression

• Logistic regression is a learning algorithm used in a supervised learning problem

when the output 𝑦 are all either zero or one (binary).


• Given x, y  P( y  1 | x),

X R nx
where 0  y  1  ( z) 
1
1  ez
• Parameters:
w R nx bR

• Output:

y   ( wT x  b)

2019/2/25 23
Logistic Regression cost function
𝑦ො = 𝜎 𝑤 𝑇 𝑥 + 𝑏 , where 𝜎 𝑧 =
1 z (i )  wT x (i )  b
1+𝑒 −𝑧

Given: (𝑥 (1) , 𝑦 (1) ),…,(𝑥 (𝑚) , 𝑦 (𝑚) ) , want 𝑦ො (𝑖) ≈ 𝑦 𝑖

1 
Loss (error) fun: L( y, y )  ( y  y ) 2
2
  
L( y, y )  ( y log y  (1  y ) log(1  y )) y 1 y0
 
if y  1 : L( y, y )   log y
 
if y  0 : L( y, y )   log(1  y )
h (x) 1 h (x) 1

1 m 
1 m  (i )  (i ) 

Cost function: J (W , b)   L( y , y )    ( y log y  (1  y ) log(1  y (i ) ))
(i ) (i ) (i )

m i 1 m i 1  
Gradient Descent
1
𝑦ො = 𝜎 𝑤𝑇𝑥 +𝑏 , 𝜎 𝑧 =
1+𝑒 −𝑧

𝑚 𝑚
1 1
𝐽 𝑤, 𝑏 = ෍ ℒ(𝑦ො 𝑖 , 𝑦 (𝑖) ) = − ෍𝑦
(𝑖)
log 𝑦ො 𝑖 + (1 − 𝑦 (𝑖) ) log(1 − 𝑦ො 𝑖 )
𝑚 𝑚
𝑖=1 𝑖=1

𝐽 𝑤, 𝑏

Want to find 𝑤, 𝑏 that minimize 𝐽 𝑤, 𝑏

𝑏
𝑤
Gradient descent

• Lets J be function of w; J(w)

repeat{
J (w) dJ ( w)
0
dw dJ
w : w  
dw
}
𝑤
• If J(w , b), repeat{ w : w  
J
w
J
b : b  
w
2019/2/25 } 26
Computation graph

 a   (z )
• z=xy y   ( wx  b)
Remember chain rule of differentiation

If
z  u b
z  xy Then chain rule:
u  wx

x y w x b

Lets apply this to logistic regression

2019/2/25 27
Optimization algorithms

• Root Mean Square Prop(RMSProp) Momentum:

This results in minimizing oscillations and
faster convergence.

Adaptive Moment Estimation(Adam).

Combines ideas from both RMSProp and
Momentum
2019/2/25 28
Adaptive Moment Estimation(Adam).
• Combines ideas from both RMSProp and Momentum

2019/2/25 29
Logistic regression
+1
b
x1 w1 
𝑧= 𝑤𝑇𝑥 +𝑏 y
x2 w2 z  wT x  b
𝑦ො = 𝑎 = 𝜎(𝑧)  (z )
w3
x3 w4
𝐿 𝑎, 𝑦 = −(𝑦 log(𝑎) + (1 − 𝑦) log(1 − 𝑎))
x4
da= dL y 1 y
 
da a 1 a
w1 : w1   dw1
dz= dL  dL . da  a  y
dz da dz w2 : w2   dw2
b : b  db
dL dL da dz
dw1=  . .  x1dz
dw1 da dz dw1
dL dz dL
dw2=  dz.  x2 dz db =  dz
dw2 dw dw2
Logistic regression

• For m training examples:

𝑚 𝑚
1 1
𝐽 𝑤, 𝑏 = ෍ ℒ(𝑦ො 𝑖 , 𝑦 (𝑖) ) = − ෍𝑦
(𝑖)
log 𝑦ො 𝑖 + (1 − 𝑦 (𝑖) ) log(1 − 𝑦ො 𝑖 )
𝑚 𝑚
𝑖=1 𝑖=1

 (i ) for each training examples ( x (i ) , y (i ) )

a (i )  y   ( z (i ) )   ( wT x (i )  b)

 1 m  dw1(i ) , dw2(i ) , db (i )
J ( w, b)   L(a ( i ) , y ( i ) )
w1 m i 1 w1
1 m
  dw1(i )
m i 1

2019/2/25 31
Logistic regression

• Python implementation of logistic regression for m examples

J= 0 ;dw1 =0;dw2 = 0; db = 0
m=10
J
dw1 
for i in range(1,m): dw1
z = w.T*x + b
a = sigmoid(z)
J += -[y*np.log(a) + (1-y)*np.log(1-a)]

dz = a-y
dw1 += x1*dz w1 : w1   dw1
dw2 += x2 * dz
db += dz w2 : w2   dw2
J= J/m
dw1= dw1/m; dw2= dw2/m; db = db/m
b : b   db
Vector Valued functions

a11 a12  Vector implementation

𝑧1 a a 
𝑧= ⋮ A   21 22 
𝑧𝑛   import numpy as np
  u = np.exp(z)
an1 an 2 
u = np.log(A)
u = np.max(0,z)
import math
v = np.zeros((n,1), dtype = np.float32)
for i in range(n):
u[i] = math.exp(v[i])
Vectorization of logistic regression
1 T 
nx J ( w, b)   ( y log y  (1  y )T log(1  y ))
𝑧= 𝑤𝑇𝑥 +𝑏 w R m
J= 0 ;dw1 =0;dw2 = 0; db = 0
m=10 for iter in range(10000):
• z=0 Z = np.dot(W.T,X) + b
for i in range(1,m):
A = sigmoid(Z)
z = w.T*x + b
a = sigmoid(z)
dZ = A - Y
J += -[y*np.log(a) + (1-y)*np.log(1-a)] dw = 1/m *(X *dZ.T)
db = np.sum(dZ)
dz = a-y
dw1 += x1*dz w := w - αdw
dw2 += x2 * dz b := b - αdb
db += dz
J= J/m
dw1= dw1/m; dw2= dw2/m; db = db/m

2019/2/25 34

Introduction To Neural Networks Using MATLAB
100% (1)
Introduction To Neural Networks Using MATLAB
548 pages
Relee Tfsi
No ratings yet
Relee Tfsi
10 pages
Deep Learning by AndrewNG Tutorial Notes
No ratings yet
Deep Learning by AndrewNG Tutorial Notes
298 pages
Deep Learning - Intro, Methods & Applications
100% (1)
Deep Learning - Intro, Methods & Applications
37 pages
DL Notes
No ratings yet
DL Notes
652 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
22 pages
Cours1 Annotations
No ratings yet
Cours1 Annotations
42 pages
cours1
No ratings yet
cours1
42 pages
2019 Dac Tutorial Nvidia Part
No ratings yet
2019 Dac Tutorial Nvidia Part
55 pages
Deep Learning
100% (4)
Deep Learning
100 pages
Deep Learning
No ratings yet
Deep Learning
100 pages
Introduction To Neural Network - Deep Learning
No ratings yet
Introduction To Neural Network - Deep Learning
17 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
17 pages
Artificial intelligence basics
No ratings yet
Artificial intelligence basics
13 pages
Neural Networks
No ratings yet
Neural Networks
40 pages
Artificial Neural Networks Kluniversity Course Handout
No ratings yet
Artificial Neural Networks Kluniversity Course Handout
18 pages
Machine Learning with Artificial Neural Networks
No ratings yet
Machine Learning with Artificial Neural Networks
6 pages
Review of Deep Learning Algorithms and Architectur
No ratings yet
Review of Deep Learning Algorithms and Architectur
29 pages
Ch 19 AI
No ratings yet
Ch 19 AI
17 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
ANN-Unit 6 - Deep Neural Networks
No ratings yet
ANN-Unit 6 - Deep Neural Networks
29 pages
W1 Ann
No ratings yet
W1 Ann
3 pages
Artificial Neural Networks: Introduction To Computational Neuroscience
No ratings yet
Artificial Neural Networks: Introduction To Computational Neuroscience
42 pages
Lec3 MLP Optimization
No ratings yet
Lec3 MLP Optimization
86 pages
gO1HZSRkk1EC (58016015)
100% (2)
gO1HZSRkk1EC (58016015)
409 pages
Neural Network & Fuzzy Logic
No ratings yet
Neural Network & Fuzzy Logic
5 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
80 pages
Deep Learning Andrew NG
100% (3)
Deep Learning Andrew NG
173 pages
Artificial Neural Network Concepts and Examples
No ratings yet
Artificial Neural Network Concepts and Examples
61 pages
Lect 12 -Deep Feed Forward NN- Review
No ratings yet
Lect 12 -Deep Feed Forward NN- Review
93 pages
Applied Deep Learning - Part 1 - Artificial Neural Networks - by Arden Dertat - Towards Data Science
No ratings yet
Applied Deep Learning - Part 1 - Artificial Neural Networks - by Arden Dertat - Towards Data Science
34 pages
Neural Network Using Matlab
63% (30)
Neural Network Using Matlab
548 pages
132618915-Neural-Network-Using-Matlab Sumathi and Sivanandam PDF
67% (3)
132618915-Neural-Network-Using-Matlab Sumathi and Sivanandam PDF
548 pages
Module 3
No ratings yet
Module 3
97 pages
Assignment 2
No ratings yet
Assignment 2
12 pages
Module1 ECO-598 AI & ML Aug 21
No ratings yet
Module1 ECO-598 AI & ML Aug 21
45 pages
Neural_Nets_Lecture_1-2_Summary
No ratings yet
Neural_Nets_Lecture_1-2_Summary
6 pages
Lec 1
No ratings yet
Lec 1
30 pages
PA_TH_MDM[1]
No ratings yet
PA_TH_MDM[1]
4 pages
DL Unit 3 Jntuk r20
100% (1)
DL Unit 3 Jntuk r20
47 pages
Contemporary ML For Physicists
No ratings yet
Contemporary ML For Physicists
91 pages
Cheatsheets For Deep Learning 1650192034
No ratings yet
Cheatsheets For Deep Learning 1650192034
95 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
(Ebook) Neural Network Design by Martin T. Hagan et al. ISBN 9780971732117, 0971732116 2024 Scribd Download
100% (1)
(Ebook) Neural Network Design by Martin T. Hagan et al. ISBN 9780971732117, 0971732116 2024 Scribd Download
81 pages
S5 and S6-2023 curriculum syllabus
No ratings yet
S5 and S6-2023 curriculum syllabus
6 pages
Clevered AI Wizard Level 3
No ratings yet
Clevered AI Wizard Level 3
17 pages
DeepLearning Introduction
No ratings yet
DeepLearning Introduction
14 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
Chapter 6 - Neural Networks (part 1)
No ratings yet
Chapter 6 - Neural Networks (part 1)
29 pages
Computational Methods and Techniques
No ratings yet
Computational Methods and Techniques
15 pages
U4 Introduction To Neural Networks
No ratings yet
U4 Introduction To Neural Networks
20 pages
FULLTEXT01
No ratings yet
FULLTEXT01
74 pages
1 DEEP LEARNING 2324 (1)
No ratings yet
1 DEEP LEARNING 2324 (1)
55 pages
Deep Learning 1.0 and Beyond: A Tutorial
No ratings yet
Deep Learning 1.0 and Beyond: A Tutorial
50 pages
Ann I
No ratings yet
Ann I
41 pages
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
No ratings yet
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
37 pages
Safari
No ratings yet
Safari
97 pages
Image Compression: Efficient Techniques for Visual Data Optimization
From Everand
Image Compression: Efficient Techniques for Visual Data Optimization
Fouad Sabry
No ratings yet
Math for Deep Learning: What You Need to Know to Understand Neural Networks
From Everand
Math for Deep Learning: What You Need to Know to Understand Neural Networks
Ronald T. Kneusel
No ratings yet
Tone Mapping: Tone Mapping: Illuminating Perspectives in Computer Vision
From Everand
Tone Mapping: Tone Mapping: Illuminating Perspectives in Computer Vision
Fouad Sabry
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Evaluation of Real-Time Noise Classifier Based On CNN-LSTM and MFCC For Smartphones
No ratings yet
Evaluation of Real-Time Noise Classifier Based On CNN-LSTM and MFCC For Smartphones
66 pages
Create Mobile Games
No ratings yet
Create Mobile Games
261 pages
Java - Socket - Communication: Mini - Project - 1 M10702803
No ratings yet
Java - Socket - Communication: Mini - Project - 1 M10702803
8 pages
TCP Video Transfer
No ratings yet
TCP Video Transfer
8 pages
Kz01-Qa-pln-s4-Xxxx Installation Itp For Instrument System
No ratings yet
Kz01-Qa-pln-s4-Xxxx Installation Itp For Instrument System
8 pages
Dental Biomedical Waste Management: Review Article
No ratings yet
Dental Biomedical Waste Management: Review Article
3 pages
Warehouse Internal Operations Unit 5
No ratings yet
Warehouse Internal Operations Unit 5
20 pages
2.2 - Tabon Caves
No ratings yet
2.2 - Tabon Caves
1 page
Airport Declaration India
No ratings yet
Airport Declaration India
2 pages
E Votingonlinevotingsystem 170401142257
No ratings yet
E Votingonlinevotingsystem 170401142257
50 pages
CAPE IT 2010 Unit2 Paper2
No ratings yet
CAPE IT 2010 Unit2 Paper2
8 pages
Daftar Pustaka
100% (1)
Daftar Pustaka
4 pages
Daftar Sisa Stok Dan Obat Habis Terpakai
No ratings yet
Daftar Sisa Stok Dan Obat Habis Terpakai
6 pages
Activity 4 Financial Reporting in Hyperinflationary Economies
No ratings yet
Activity 4 Financial Reporting in Hyperinflationary Economies
5 pages
Jurnal Kista Radikuler
No ratings yet
Jurnal Kista Radikuler
4 pages
Mobile APP HANA Dashboard Not Working
No ratings yet
Mobile APP HANA Dashboard Not Working
4 pages
ELSCi Q1 Lesson 12 - Hazards Caused by Coastal Processes
No ratings yet
ELSCi Q1 Lesson 12 - Hazards Caused by Coastal Processes
29 pages
Grid Deviation Water Table
No ratings yet
Grid Deviation Water Table
9 pages
Let's Talk About Sex!
No ratings yet
Let's Talk About Sex!
4 pages
Edie Scarf
No ratings yet
Edie Scarf
2 pages
Hold Relax Dan Passive Stretching Efektif Dalam Meningkatkan Kemampuan
No ratings yet
Hold Relax Dan Passive Stretching Efektif Dalam Meningkatkan Kemampuan
8 pages
Themelios, Volume 36 Issue 1
100% (2)
Themelios, Volume 36 Issue 1
181 pages
Darklight Manual Draft
No ratings yet
Darklight Manual Draft
28 pages
WPS - Api 5L X52 - Codigo Api1104
No ratings yet
WPS - Api 5L X52 - Codigo Api1104
1 page
Rushi - Mad Microproject - Removed
No ratings yet
Rushi - Mad Microproject - Removed
26 pages
Clinical Reasoning & Decision Making in Shoulder Case
No ratings yet
Clinical Reasoning & Decision Making in Shoulder Case
18 pages
SWF Product Brochure
No ratings yet
SWF Product Brochure
52 pages
17 - Aljaberi Et Al
No ratings yet
17 - Aljaberi Et Al
20 pages
Fresh Updated CV SARFRAZ
No ratings yet
Fresh Updated CV SARFRAZ
5 pages
Learning Style Vak
No ratings yet
Learning Style Vak
3 pages
Efa - Faq
No ratings yet
Efa - Faq
10 pages
10M GREENHOUSE DD STAGE REPORT (REV 0) 26AUG2020 - Stamped PDF
No ratings yet
10M GREENHOUSE DD STAGE REPORT (REV 0) 26AUG2020 - Stamped PDF
222 pages
Eureka Forbes LTD Case Analysis
No ratings yet
Eureka Forbes LTD Case Analysis
5 pages

Deep Learning Neural Network - Lecture1 PDF

Uploaded by

Deep Learning Neural Network - Lecture1 PDF

Uploaded by

Deep Learning

• Reference “Deep Learning” by :Ian Goodfellow, Yoshua Bengio, Aron Courville

• Introduction to Neural Network

• Logistic regression in neural network mindset

• Setting your notebooks for programming in python

• Think of a problem you need to solve in your research area

• Generalization ability – generalization error (or Test error)

• Solve problem of overfitting

Input(x) Output (y) Application

House features Price Real Estate

Image Object (1,…,100) Object recognition CNN

Audio Text transcript Speech recognition

Image, Location info Position of other cars Autonomous driving combo

User Age Ad Id … Click Neural Network and

• Large amount of data available

Andrew Ng’s graph

1 (cat) vs 0 (non cat)

m training examples: x (1)

• Logistic regression is a learning algorithm used in a supervised learning problem

Given: (𝑥 (1) , 𝑦 (1) ),…,(𝑥 (𝑚) , 𝑦 (𝑚) ) , want 𝑦ො (𝑖) ≈ 𝑦 𝑖

Want to find 𝑤, 𝑏 that minimize 𝐽 𝑤, 𝑏

• Lets J be function of w; J(w)

Lets apply this to logistic regression

• Root Mean Square Prop(RMSProp) Momentum:

Adaptive Moment Estimation(Adam).

• For m training examples:

 (i ) for each training examples ( x (i ) , y (i ) )

• Python implementation of logistic regression for m examples

a11 a12  Vector implementation

You might also like