0% found this document useful (0 votes)

10 views40 pages

Neural Networks

Uploaded by

aayatjamil16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views40 pages

Neural Networks

Uploaded by

aayatjamil16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 40

Machine Learning

Neural Networks

Slides mostly adapted from Tom

Mithcell, Han and Kamber
Introduction to Artificial
Neural Networks
Neural networks to the rescue
 Neural network: information processing
paradigm inspired by biological nervous systems,
such as our brain
 Structure: large number of highly interconnected

processing elements (neurons) working together

 Like people, they learn from experience (by

example)
Neural networks to the rescue
 Neural networks are configured for a specific
application, such as pattern recognition or data
classification, through a learning process
 In a biological system, learning involves

adjustments to the synaptic connections

between neurons
 Same for artificial neural networks (ANNs)
Inspiration from Neurobiology
 A neuron: many-inputs / one-
output unit
 output can be excited or not
excited
 incoming signals from other
neurons determine if the
neuron shall excite ("fire")
 Output subject to attenuation
in the synapses, which are
junction parts of the neuron
Synapse concept
 The synapse resistance to the incoming signal can be changed
during a "learning" process [1949]

Hebb’s Rule:
If an input of a neuron is repeatedly and persistently
causing the neuron to fire, a metabolic change
happens in the synapse of that particular input to
reduce its resistance
Mathematical representation
The neuron calculates a weighted sum of inputs and compares it
to a threshold. If the sum is higher than the threshold, the
output is set to 1, otherwise to -1.

Non-linearity
A simple perceptron
 It’s a single-unit network
 Change the weight by an
amount proportional to the
difference between the desired
output and the actual output.
Δ Wi = η * (D-Y).Ii Input

Actual output
Learning rate
Desired output

Perceptron Learning Rule

Example: A simple single unit
adaptive network
 The network has 2 inputs,
and one output. All are
binary. The output is
 1 if W0I0 + W1I1 + Wb > 0
 0 if W0I0 + W1I1 + Wb ≤ 0
 We want it to learn
simple OR: output a 1 if
either I0 or I1 is 1.
Learning
 From experience: examples / training data
 Strength of connection between the neurons is

stored as a weight-value for the specific

connection
 Learning the solution to a problem = changing

the connection weights

Artificial Neural Networks
Adaptive interaction between individual neurons
Power: collective behavior of interconnected neurons

The hidden layer learns to

recode (or to provide a
representation of) the
inputs: associative mapping
Evolving networks
 Continuous process of:
 Evaluate output
 Adapt weights
 Take new inputs
 ANN
“Learning”
evolving causes stable state of the weights,
but neurons continue working: network has
‘learned’ dealing with the problem
Where are NN used?

 Recognizing and matching complicated, vague, or

incomplete patterns
 Data is unreliable
 Problems with noisy data
Prediction
Classification
Data association
Data conceptualization
Filtering
Planning
Applications
 Prediction: learning from past experience
 pick the best stocks in the market
 predict weather
 identify people with cancer risk
 Classification
 Image processing
 Predict bankruptcy for credit card companies
 Risk assessment
Applications
 Recognition
 Pattern recognition: SNOOPE (bomb detector in U.S.
airports)
 Character recognition
 Handwriting: processing checks
 Data association
 Not only identify the characters that were scanned but
identify when the scanner is not working properly
Applications
 Data Conceptualization
 infer grouping relationships
e.g. extract from a database the names of those most likely to
buy a particular product.
 Data Filtering
e.g. take the noise out of a telephone signal, signal smoothing
 Planning
 Unknown environments
 Sensor data is noisy
 Fairly new approach to planning
Artificial Neural Networks
 Computational models inspired by the human
brain:
 Algorithms that try to mimic the brain.

 Massively parallel, distributed system, made up of

simple processing units (neurons)

 Synaptic connection strengths among neurons are

used to store the acquired knowledge.

 Knowledge is acquired by the network from its

environment through a learning process
History
 late-1800's- Neural Networks appear as an
analogy to biological systems
 1960's and 70's – Simple neural networks appear
 Fallout of favor because the perceptron is not
effective by itself, and there were no good algorithms
for multilayer nets
 1986 – Backpropagation algorithm appears
 NeuralNetworks have a resurgence in popularity
 More computationally expensive
Applications of ANNs
 ANNs have been widely used in various domains
for:
 Pattern recognition
 Function approximation
 Associative memory
Properties
 Inputs are flexible
 any real values
 Highly correlated or independent
 Target function may be discrete-valued, real-valued, or
vectors of discrete or real values
 Outputs are real numbers between 0 and 1
 Resistant to errors in the training data
 Long training time

 Fast evaluation

 The function produced can be difficult for humans to

interpret
When to consider neural networks
 Input is high-dimensional discrete or raw-valued
 Output is discrete or real-valued
 Output is a vector of values
 Possibly noisy data
 Form of target function is unknown
 Human readability of the result is not important

Examples:
 Speech phoneme recognition
 Image classification
 Financial prediction
A Neuron (= a perceptron)
- t
x0 w0
x1 w1
 f
output y
xn wn
For Example
n
Input weight weighted Activation y sign(  wi xi  t )
vector x vector w sum function i 0

 The n-dimensional input vector x is mapped into variable y by

means of the scalar product and a nonlinear function mapping

Data Mining: Concepts and

January 18, 2025 Techniques 22
Perceptron
 Basic unit in a neural network
 Linear separator

 Parts
N inputs, x1 ... xn
 Weights for each input, w1 ... wn
A bias input x0 (constant) and associated weight w0
 Weighted sum of inputs, y = w0x0 + w1x1 + ... + wnxn
A threshold function or activation function,
 i.e 1 if y > t, -1 if y <= t
Artificial Neural Networks (ANN)
 Model is an assembly of Input
nodes
inter-connected nodes Black box
Output
and weighted links X1 w1 node
w2
X2  Y
 Output node sums up w3

each of its input value X3 t

according to the weights

of its links Perceptron Model
Y  I (  wi xi  t ) or
 Compare output node i

against some threshold t Y sign(  wi xi  t )

i
Types of connectivity

output units
 Feedforward networks
 These compute a series of
transformations hidden units
 Typically, the first layer is the
input and the last layer is the
input units
output.
 Recurrent networks
 These have directed cycles in their
connection graph. They can have
complicated dynamics.
 More biologically realistic.
Different Network Topologies
 Single layer feed-forward networks
 Input layer projecting into the output layer

Single layer
network

Input Output
layer layer
Different Network Topologies
 Multi-layer feed-forward networks
 One or more hidden layers. Input projects only
from previous layers onto a layer.

2-layer or
1-hidden layer
fully connected
network
Input Hidden Output
layer layer layer
Different Network Topologies
 Multi-layer feed-forward networks

Input Hidden Output

layer layers layer
Different Network Topologies
 Recurrent networks
 A network with feedback, where some of its
inputs are connected to some of its outputs (discrete
time).

Recurrent
network

Input Output
layer layer
Algorithm for learning ANN
 Initialize the weights (w0, w1, …, wk)

 Adjustthe weights in such a way that the output

of ANN is consistent with class labels of training
examples
E  Yi  f ( wi , X i )
2
 Error function:
i

 Find the weights wi’s that minimize the above error

function
 e.g., gradient descent, backpropagation algorithm
Optimizing concave/convex function

 Maximum of a concave function = minimum of a

convex function
Gradient ascent (concave) / Gradient descent (convex)

Gradient ascent rule

Multi-layer Networks
 Linear units inappropriate
 No more expressive than a single layer
„ Introduce non-linearity
 Threshold not differentiable
„ Use sigmoid function
Backpropagation
 Iteratively process a set of training tuples & compare the network's
prediction with the actual known target value
 For each training tuple, the weights are modified to minimize the mean
squared error between the network's prediction and the actual target
value
 Modifications are made in the “backwards” direction: from the output
layer, through each hidden layer down to the first hidden layer, hence
“backpropagation”
 Steps
 Initialize weights (to small random #s) and biases in the network

 Propagate the inputs forward (by applying activation function)

 Backpropagate the error (by updating weights and biases)

 Terminating condition (when error is very small, etc.)

Data Mining: Concepts and

January 18, 2025 Techniques 37
Neural Network as a Classifier
 Weakness
 Long training time
 Require a number of parameters typically best determined empirically,
e.g., the network topology or “structure.”
 Poor interpretability: Difficult to interpret the symbolic meaning behind
the learned weights and of “hidden units” in the network
 Strength
 High tolerance to noisy data
 Ability to classify untrained patterns
 Well-suited for continuous-valued inputs and outputs
 Successful on a wide array of real-world data
 Algorithms are inherently parallel
 Techniques have recently been developed for the extraction of rules
from trained neural networks
Data Mining: Concepts and
January 18, 2025 Techniques 38
Artificial Neural Networks (ANN)
Input
nodes Black box
X1 X2 X3 Y
1 0 0 0 Output
1 0 1 1 X1 0.3 node
1 1 0 1
1 1 1 1
X2 0.3
0 0 1 0
 Y
0 1 0 0
0 1 1 1 X3 0.3 t=0.4
0 0 0 0

Y  I ( 0 .3 X 1  0 .3 X 2  0 .3 X 3  0 .4  0 )
1 if z is true
where I ( z ) 
0 otherwise
General Structure of ANN
x1 x2 x3 x4 x5

Input
Layer Input Neuron i Output
I1 wi1
wi2 Activation
I2
wi3
Si function Oi Oi
Hidden g(Si )
Layer I3

threshold, t

Output Training ANN means learning

Layer the weights of the neurons
y

Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
The Multilayer Perceptron
No ratings yet
The Multilayer Perceptron
11 pages
ObjectiveQ&a Mid-I NNDL
No ratings yet
ObjectiveQ&a Mid-I NNDL
15 pages
A Systematic Study of Deep Learning Architectures For Analysis of Glaucoma and Hypertensive Retinopathy
No ratings yet
A Systematic Study of Deep Learning Architectures For Analysis of Glaucoma and Hypertensive Retinopathy
17 pages
Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber
No ratings yet
Machine Learning: Neural Networks Slides Mostly Adapted From Tom Mithcell, Han and Kamber
40 pages
Lecture 25 - Artificial Neural Networks
No ratings yet
Lecture 25 - Artificial Neural Networks
42 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
44 pages
ANN Theory
No ratings yet
ANN Theory
49 pages
Neural Networks
No ratings yet
Neural Networks
75 pages
Code Question1-Adaline
No ratings yet
Code Question1-Adaline
29 pages
2022 - Recurrent Neural Networks Concepts and Applications
No ratings yet
2022 - Recurrent Neural Networks Concepts and Applications
413 pages
Neural Networks and Their Statistical Application
No ratings yet
Neural Networks and Their Statistical Application
41 pages
Machine Learning - Intro Bayes DecisionT Neural
No ratings yet
Machine Learning - Intro Bayes DecisionT Neural
81 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Physical Chemistry Chemical Physics: Volume 13 - Number 40 - 28 October 2011 - Pages 17901-18232
No ratings yet
Physical Chemistry Chemical Physics: Volume 13 - Number 40 - 28 October 2011 - Pages 17901-18232
27 pages
Artificial Neural Networks As An Architectural Des
No ratings yet
Artificial Neural Networks As An Architectural Des
9 pages
ML-Lec10-Artificial Neural Networks (1)
No ratings yet
ML-Lec10-Artificial Neural Networks (1)
76 pages
Artificial Neural Network
100% (2)
Artificial Neural Network
20 pages
Notes 7sem Pec Csm701
No ratings yet
Notes 7sem Pec Csm701
23 pages
Unit 6 Application of AI
No ratings yet
Unit 6 Application of AI
91 pages
Topological Constraints and Robustness in Liquid State Machines
No ratings yet
Topological Constraints and Robustness in Liquid State Machines
10 pages
Unit 2
No ratings yet
Unit 2
38 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
6 pages
A New Sort of Computer: What Are (Everyday) Computer Systems Good At... and Not So Good At?
No ratings yet
A New Sort of Computer: What Are (Everyday) Computer Systems Good At... and Not So Good At?
30 pages
DM Lecture 09
No ratings yet
DM Lecture 09
36 pages
Ai 7
No ratings yet
Ai 7
41 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
31 pages
Isch 4
No ratings yet
Isch 4
44 pages
Ai 7
No ratings yet
Ai 7
41 pages
Introduction To Neural Networks
100% (1)
Introduction To Neural Networks
46 pages
Lab 1
No ratings yet
Lab 1
4 pages
A Review of Optical Neural Networks
No ratings yet
A Review of Optical Neural Networks
11 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
34 pages
Lecture15 NeuronNetworks
No ratings yet
Lecture15 NeuronNetworks
61 pages
Introduction to Neural Networks
No ratings yet
Introduction to Neural Networks
125 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
57 pages
The Modelling of Surface Roughness After the Ball Burnishing 2diqidpb
No ratings yet
The Modelling of Surface Roughness After the Ball Burnishing 2diqidpb
24 pages
Lecture Slides-Week13,14
No ratings yet
Lecture Slides-Week13,14
62 pages
ML Unit 5
No ratings yet
ML Unit 5
33 pages
12 Neural Network
No ratings yet
12 Neural Network
52 pages
1cO1CO2: A CO1CO1Co1
No ratings yet
1cO1CO2: A CO1CO1Co1
4 pages
Lecture NN 2005
No ratings yet
Lecture NN 2005
137 pages
Neural Nets
No ratings yet
Neural Nets
43 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
83 pages
Unit 3 - Ann
No ratings yet
Unit 3 - Ann
49 pages
Ai Introduction
No ratings yet
Ai Introduction
31 pages
Lecture 7 - Neural Networks
No ratings yet
Lecture 7 - Neural Networks
48 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
w1 01 Introtonn
No ratings yet
w1 01 Introtonn
42 pages
ANN Introduction
No ratings yet
ANN Introduction
37 pages
Basics
No ratings yet
Basics
48 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Introduction To Neural Networks: Training Learn Generalization
No ratings yet
Introduction To Neural Networks: Training Learn Generalization
46 pages
Unit 1
No ratings yet
Unit 1
22 pages
L6 Neural Network
No ratings yet
L6 Neural Network
57 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
37 pages
Assignment - Soft Computing
No ratings yet
Assignment - Soft Computing
12 pages
Optimizers-lionVSAdam
No ratings yet
Optimizers-lionVSAdam
2 pages
1-s2.0-S0360544224006005-main
No ratings yet
1-s2.0-S0360544224006005-main
10 pages
Neural Network
No ratings yet
Neural Network
55 pages
Unit 4 Neural Networks (1)
No ratings yet
Unit 4 Neural Networks (1)
76 pages
Kiet School of Engineering & Technology: Department of Computer Appication
No ratings yet
Kiet School of Engineering & Technology: Department of Computer Appication
30 pages
ECSE484 Intro v2
No ratings yet
ECSE484 Intro v2
67 pages
Week 1
No ratings yet
Week 1
24 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
Unit_4 ANN ppt
No ratings yet
Unit_4 ANN ppt
46 pages
Keras1-Introduction Two KEras
No ratings yet
Keras1-Introduction Two KEras
6 pages
Reviewer
No ratings yet
Reviewer
7 pages
ML Important Questions For Preparation All Units 2022
No ratings yet
ML Important Questions For Preparation All Units 2022
12 pages
ML Unit-5 Final
No ratings yet
ML Unit-5 Final
23 pages
Lesson 14 ANN Supervised
No ratings yet
Lesson 14 ANN Supervised
37 pages
UNIT-4
No ratings yet
UNIT-4
22 pages
Artificial-Neural-Network (1)
No ratings yet
Artificial-Neural-Network (1)
21 pages
Wk. 12. Artificial Neural Networks [12!05!2021] (1)
No ratings yet
Wk. 12. Artificial Neural Networks [12!05!2021] (1)
48 pages
19 - Introduction To Neural Networks
No ratings yet
19 - Introduction To Neural Networks
7 pages
UNIT V
No ratings yet
UNIT V
49 pages
ML-5TH UNIT
No ratings yet
ML-5TH UNIT
28 pages
Unit-4_Full (2)
No ratings yet
Unit-4_Full (2)
48 pages
unit1
No ratings yet
unit1
29 pages
AI Lecture 16
No ratings yet
AI Lecture 16
51 pages
Artificial Neural Networks Ppt
No ratings yet
Artificial Neural Networks Ppt
26 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
5 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
UNIT - 4
No ratings yet
UNIT - 4
17 pages
Part7.2 Artificial Neural Networks
No ratings yet
Part7.2 Artificial Neural Networks
51 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
25 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet

Neural Networks

Uploaded by

Neural Networks

Uploaded by

Machine Learning

Slides mostly adapted from Tom

processing elements (neurons) working together

adjustments to the synaptic connections

Perceptron Learning Rule

stored as a weight-value for the specific

the connection weights

The hidden layer learns to

 Recognizing and matching complicated, vague, or

 Massively parallel, distributed system, made up of

 Synaptic connection strengths among neurons are

 Knowledge is acquired by the network from its

 The function produced can be difficult for humans to

 The n-dimensional input vector x is mapped into variable y by

Data Mining: Concepts and

each of its input value X3 t

according to the weights

against some threshold t Y sign(  wi xi  t )

Input Hidden Output

 Adjustthe weights in such a way that the output

 Find the weights wi’s that minimize the above error

 Maximum of a concave function = minimum of a

Gradient ascent rule

 Propagate the inputs forward (by applying activation function)

 Backpropagate the error (by updating weights and biases)

 Terminating condition (when error is very small, etc.)

Data Mining: Concepts and

Output Training ANN means learning

You might also like