0% found this document useful (0 votes)

40 views12 pages

Structure of Neural Networks

Artificial neural networks, inspired by the human brain, are powerful models in machine learning that excel in complex tasks like speech recognition and image classification. They consist of interconnected artificial neurons that learn to perform tasks by adjusting weights and biases based on input signals. Key components of neural networks include input and output layers, hidden layers, activation functions, and parameters such as learning rate and number of epochs.

Uploaded by

muthuvadivel B

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views12 pages

Structure of Neural Networks

Uploaded by

muthuvadivel B

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Artificial neural networks are the most powerful learning models in the field of

machine learning inspired by the human brain.

In the past few years, deep artificial neural networks have proven to perform
surprisingly well for complex tasks such as speech recognition (converting speech
to text), machine translation,image and video classification. Such models are also
commonly called deep learning models.

A biological neuron works as follows: it receives signals through its dendrites that
are either amplified or inhibited as they pass through the axons to the dendrites of
other neurons.

Artificial neural networks are a collection of many simple devices called artificial
neurons. The network ‘learns’ to conduct certain tasks, such as recognising a cat, by
training the neurons to ‘fire’ in a certain way when given a particular input, such as a
cat. In other words, the network learns to inhibit or amplify the input signals to
perform a certain task, such as recognising an animal, speaking a word, identifying
a tree.

The applications of neural networks are across various domains such as images
and videos (computer vision), text, speech. The terms ‘deep learning’ and ‘neural
networks’ are often used interchangeably.

Perceptron

A perceptron acts like a tool that enables you to predict outcome based on multiple
factors. Each decision factor holds a different ‘weight’. Take different factors as input
signals,
attach a weight based on importance they attach to the corresponding factors and
perform basic operations to get an output.

In other terms, the perceptron takes a weighted sum of multiple inputs (along with a
bias) as the cumulative input and applies an output function on the cumulative
input to get the output, which then assists in making a decision.

Where, 𝑤𝑖'𝑠 represent the inputs, 𝑤𝑖'𝑠 represent the weights associated with inputs

and b is the bias.

A neat and concise way to represent the weighted sum of w and x is using the dot
𝑇
product of the transpose of the weight vector 𝑤 and the input vector x. Now, let’s
understand this concept of taking the dot product of the transpose of the weight
vector and the input vector.

𝑇
The transpose of w is 𝑤 = [𝑤1 𝑤2 .... 𝑤𝑘] - a row vector of size 1 x k. Taking the dot
𝑇
product of 𝑤 with x:

𝑇
After adding bias to 𝑤 . 𝑥, you will get the following equation:
𝑇
Cumulative Input = 𝑤 . 𝑥 + 𝑏 = 𝑤1𝑥1 + 𝑤2𝑥2 +...... + 𝑤𝑘𝑥𝑘 + 𝑏

We then apply the step function to the cumulative input. According to the step
function, if this cumulative weighted sum of inputs is > 0, the output is 1/yes;
otherwise, it is 0/no.

Single Neuron
Neural networks are a collection of artificial neurons arranged in a particular
structure. Now, you will learn how a single artificial neuron works. A neuron is very
similar to a perceptron. In perceptrons, the activation function used is the step
function, whereas in the case of ANNs, the activation functions are non-linear
functions such as the sigmoid function.

Please take a look at the structure of an artificial neuron in the image given below.

Where ‘a’ represents the inputs, ‘w’ represents the weights associated with the
inputs and ‘b’ represents the bias of the neuron.

Multiple Artificial Neurons

In a neural network, multiple artificial neurons are arranged in different layers. The
first layer is known as the input layer, and the last layer is called the output layer.
The layers in between these two are the hidden layers. The number of neurons in the
input layer is equal to the number of attributes/features in the data set, and those in
the output layer are determined by the number of classes of the target variable (for
a classification problem). For a regression problem, the number of neurons in the
output layer is 1 (a numeric value). Please take a look at the image given below to
understand the topology of neural networks in the case of classification and
regression problems.

Basic Structure of Artificial Neural Networks

To summarise, the six main things that must be specified for any neural network are
as follows:
1. Input layer
2. Output layer
3. Hidden layers
4. Network topology or structure
5. Weights and biases
6. Activation functions

Input
The most important point to notice is that the inputs can only be numeric. For
different types of input data, you can use different ways to convert the inputs to a
numeric form. The commonly used inputs used for ANNs are given below:

1. Structured data: The type of data that we use in standard machine learning
algorithms have multiple features and are available in two dimensions such
that the data can be represented in a tabular format. This type of data can be
used as an input for training ANNs.
2. Text data: For text data, you can use a one-hot vector or word embeddings
corresponding to a certain word.
3. Image: Images are naturally represented as arrays of numbers and can, thus,
be fed into the network directly. These numbers are the raw pixels of an
image. In images, pixels are arranged in rows and columns (an array of pixel
elements).
4. Speech: In the case of a speech/voice input, the basic input unit is in the form
of phonemes. These are distinct units of speech in any language. The speech
signal is in the form of waves, and to convert these waves into numeric inputs,
you need to use Fourier Transform. The input after this conversion will be
numeric; so, you will be able to feed it into a neural network.

Output
Depending on the nature of the task, the outputs of neural networks can either be in
the form of classes (if it is a classification problem) or numeric (if it is a regression
problem).
One of the commonly used output functions is the softmax function for
classification. Please take a look at the graphical representation of the softmax
function shown below.
A softmax output is similar to what we get from a multiclass logistic function
commonly used to compute the probability of an output belonging to one of the
multiple classes. It is given by the following formula:

Where c is the number of classes or neurons in the output layer, x’ is the input to the
network and wi’s are the weights associated with the inputs.

Let’s consider the case where the output layer has three neurons, and all of them
have the same input x′ (coming from the previous layers in the network). The
weights associated with them are represented as w0, w1 and w2. In such a case, the
probability of the input belonging to each of the classes is as follows:

Also,from these expressions,it is evthat the sum of p0 + p1 + p2 is 1 and that p0, p1, p2
ϵ (0,1).
We have seen the softmax function as a commonly used output function in
multiclass classification. Now, you will learn how this function translates to the
sigmoid function in the special case of binary classification.

In the case of a sigmoid output, only one neuron is present in the output layer since
if there are two classes with probabilities p0 and p1, we know that p0 + p1 = 1. Hence,
we need to compute the value either of p0 or p1. In other words, the sigmoid function
is just a special case of the softmax function (since binary classification is a special
case of multiclass classification).

We can derive the sigmoid function from the softmax function as shown below. Let's
assume that the softmax function has two neurons with the following outputs:

Consider only p1 and divide the numerator and the denominator with the
numerator. We can now rewrite p1 as :

If we assign ( w1−w0) with new w , we get the sigmoid function.

Internal Workings of a Neuron

The visual representation of how inputs are fed into a neuron and how we obtain
outputs using activation functions is shown below.
In the image, you can see that x1, x2 and x3 are the inputs, and their weighted sum
along with a bias is fed into the neuron to give the calculated result as the output.

The weights are applied on each of the inputs, and along with the bias, the
cumulative input is fed to the neuron. An activation function is then applied to the
cumulative input to obtain the neuron’s output. In the previous segment, you learnt
about some of the activation functions such as softmax and sigmoid. In the next
segment, we will explore more types of activation function. These functions apply
non-linearity to the cumulative input to enable the neural network to identify
complex non-linear patterns present in the data.

An in-depth representation of a cumulative input(Z) is shown below.

In this image, z is the cumulative input. You can see how the weights affect the
inputs depending on their magnitudes. z is the dot product of the weights and inputs
plus the bias.
Previously, you have learnt how a neuron takes an input and performs some
operations on it to give the output. The output is obtained through an activation
function.

Activation functions introduce non-linearity in the network, thus making the network
capable of solving very complex problems. Problems for which the help of neural
networks is to be taken require the ANN to recognise complex patterns and trends in
data. If no non-linearity was introduced, it would result in the output being just a
linear function of the input vector. This will not help us in understanding more
complex patterns present in the data.

Please take a look at the image provided below that shows the graphical
representation of a linear function and one of the possible representations of a
non-linear function.

The main conditions that you need to keep in mind while choosing activation
functions are that they should be:
● Non-linear
● Continuous
● Monotonically increasing

The different commonly used activation functions are given below.

1. Sigmoid
2. Hyperbolic Tangent (Tanh)
3. Rectified Linear Unit (ReLU)
4. Leaky ReLU (Leaky Rectified Linear Unit (Leaky ReLU)

Parameters and Hyperparameters of Neural Networks

During training, the neural network learning algorithm fits various models to the
training data and selects the best model. The learning algorithm is trained with a
predefined fixed set of hyperparameters associated with a network structure. Some
of these are given below:
● Number of layers
● Number of neurons in the input, hidden and output layers
● Learning rate (the step size taken each time we update the weights and
biases of an ANN)
● Number of epochs (the number of times the entire training data set passes
through the neural network)

The purpose of training is to obtain optimum weights and biases, which form the
parameters of the network.

The notations that you will come across are as follows:

1. W represents the weight matrix.
2. b stands for bias.
3. x represents the input.
4. y represents the ground truth label.
5. p represents the probability vector of the predicted output for the
𝐿
classification problem. ℎ represents the predicted output for the regression
problem (where L represents the number of layers).
6. h also represents the output of the hidden layers with appropriate superscript.
𝑛
The output of the second neuron in the nth hidden layer is denoted by ℎ2.

7. z represents the accumulated input to a layer. The accumulated input to the

𝑛
third neuron of the nth hidden layer is 𝑧3.
3
8. The bias of the first neuron of the third layer is represented as 𝑏1.

9. The superscript represents the layer number. The weight matrix connecting
2
the first hidden layer to the second hidden layer is denoted by 𝑊 .
10. The subscript represents the index of an individual neuron in a given layer. The
weight connecting the first neuron of the first hidden layer to the third neuron
2
of the second hidden layer is denoted by 𝑤31.

Assumptions for Simplifying Neural Networks

Commonly used neural network architectures make the following simplifying

assumptions:

1. The neurons in an ANN are arranged in layers, and these layers are arranged
sequentially.
2. The neurons within the same layer do not interact with each other.
3. Inputs are fed to the network through the input layer, and the outputs are sent
out from the output layer.
4. Neurons in consecutive layers are densely connected, i.e., all neurons in layer l
are connected to all neurons in layer l+1.
5. Every neuron in a neural network has a bias associated with it, and each
interconnection has a weight associated with it.
6. All neurons in a particular hidden layer use the same activation function.
Different hidden layers can use different activation functions, but in a hidden
layer, all neurons use the same activation function.

Unit V
No ratings yet
Unit V
9 pages
Unit I
No ratings yet
Unit I
90 pages
Neural Networks
No ratings yet
Neural Networks
61 pages
NN Unit - 1
100% (1)
NN Unit - 1
27 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
52 pages
Lecture8,9-Neural Networks
No ratings yet
Lecture8,9-Neural Networks
65 pages
Ds Unit V Ann Perceptron
No ratings yet
Ds Unit V Ann Perceptron
69 pages
8.2.1: Introduction To Neural Networks: Objectives
No ratings yet
8.2.1: Introduction To Neural Networks: Objectives
11 pages
Notes Chapter Neural Networks
No ratings yet
Notes Chapter Neural Networks
18 pages
Unit 5
No ratings yet
Unit 5
59 pages
Chapter-4 Fundamental of Neural Network
No ratings yet
Chapter-4 Fundamental of Neural Network
26 pages
Neural Networks Fundamentals, Activation Functions, Feedforward Neural Network
No ratings yet
Neural Networks Fundamentals, Activation Functions, Feedforward Neural Network
33 pages
6103 Deep Neural Network - Related Concepts (Lecture 12)
No ratings yet
6103 Deep Neural Network - Related Concepts (Lecture 12)
7 pages
2.deep Feed Forward Networks
No ratings yet
2.deep Feed Forward Networks
26 pages
Neural Networks: A Beginner's Guide
No ratings yet
Neural Networks: A Beginner's Guide
37 pages
Unit 3 - Ann
No ratings yet
Unit 3 - Ann
49 pages
What Is A Neural Network? - IBM
No ratings yet
What Is A Neural Network? - IBM
10 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Understanding Neurons and Perceptrons
No ratings yet
Understanding Neurons and Perceptrons
23 pages
Unit 5
No ratings yet
Unit 5
102 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Ai Unit-3
No ratings yet
Ai Unit-3
69 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Chap 7 Neural Networks
No ratings yet
Chap 7 Neural Networks
42 pages
Lecture - 05 (Introduction To ANN)
No ratings yet
Lecture - 05 (Introduction To ANN)
27 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
Unit 1
No ratings yet
Unit 1
16 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
Module 2
100% (1)
Module 2
62 pages
Unit - 4
No ratings yet
Unit - 4
17 pages
Deep Learning: On Artificial Neural Networks (Anns)
No ratings yet
Deep Learning: On Artificial Neural Networks (Anns)
16 pages
Neural Networks Basics & Training
No ratings yet
Neural Networks Basics & Training
8 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
13 Nnbasics
No ratings yet
13 Nnbasics
22 pages
Artificial Intelligence Basics
No ratings yet
Artificial Intelligence Basics
13 pages
Assignment B 3 Customer Churn Modeling
No ratings yet
Assignment B 3 Customer Churn Modeling
7 pages
Neural Networks for BI Enthusiasts
No ratings yet
Neural Networks for BI Enthusiasts
18 pages
Neural Network
100% (1)
Neural Network
54 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
3.2 Overview of Neural Networks
No ratings yet
3.2 Overview of Neural Networks
28 pages
Module 5 AIML Notes
No ratings yet
Module 5 AIML Notes
77 pages
Understanding Neural Networks Basics
100% (1)
Understanding Neural Networks Basics
11 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
Neural Networks for Visual Recognition
No ratings yet
Neural Networks for Visual Recognition
12 pages
Neural
No ratings yet
Neural
53 pages
Neural Network
No ratings yet
Neural Network
55 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
40 pages
ANN PG Module1
No ratings yet
ANN PG Module1
75 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
1756210939665-Artificial Neural Networks - A Primer
No ratings yet
1756210939665-Artificial Neural Networks - A Primer
7 pages
MLfromBasics Ch2E
No ratings yet
MLfromBasics Ch2E
32 pages
7 Neural Networks - Lecture Slides
No ratings yet
7 Neural Networks - Lecture Slides
74 pages
Chapter Neural Networks
No ratings yet
Chapter Neural Networks
14 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Analysis of Algorithm
No ratings yet
Analysis of Algorithm
30 pages
Penajaman Citra Landsat 8 dengan Brovey
No ratings yet
Penajaman Citra Landsat 8 dengan Brovey
8 pages
09 Jedi Intro To Programming Student Manual
No ratings yet
09 Jedi Intro To Programming Student Manual
241 pages
Power Platform Developer
No ratings yet
Power Platform Developer
7 pages
E-GP Training Program Overview
No ratings yet
E-GP Training Program Overview
26 pages
ACS6008 UserManual
No ratings yet
ACS6008 UserManual
100 pages
Router Setup Guide for Beginners
No ratings yet
Router Setup Guide for Beginners
6 pages
ISTQB Exam Prep: Sample Questions
No ratings yet
ISTQB Exam Prep: Sample Questions
7 pages
SDE Readiness Training: Arrays & Functions
No ratings yet
SDE Readiness Training: Arrays & Functions
9 pages
Practice Problems Based On Conflict Serializability
No ratings yet
Practice Problems Based On Conflict Serializability
9 pages
La Importancia de La Puntualidad en Los Ensayos
100% (1)
La Importancia de La Puntualidad en Los Ensayos
7 pages
Programming Language Categories
No ratings yet
Programming Language Categories
2 pages
Better Off Wed?: Fling To Ring-How To Know Which Finger To Give Him
No ratings yet
Better Off Wed?: Fling To Ring-How To Know Which Finger To Give Him
31 pages
Ecs511 - Fem Project Semesta Oct2023-Feb24 - Final
No ratings yet
Ecs511 - Fem Project Semesta Oct2023-Feb24 - Final
9 pages
GR QR Code Printing Debugging
No ratings yet
GR QR Code Printing Debugging
18 pages
Assignment 1: Deadline: 04/10/2019 at 23:59
No ratings yet
Assignment 1: Deadline: 04/10/2019 at 23:59
7 pages
Ai For Ceos
100% (1)
Ai For Ceos
246 pages
Animal Health Care Monitoring and Management Using Mobile App Solution For The Provincial Veterinary Office of Tarlac
100% (1)
Animal Health Care Monitoring and Management Using Mobile App Solution For The Provincial Veterinary Office of Tarlac
9 pages
Parallel Interfacing in Microprocessor Based Instrumentation System
No ratings yet
Parallel Interfacing in Microprocessor Based Instrumentation System
49 pages
Offer Letter
No ratings yet
Offer Letter
19 pages
VSA DTC Troubleshooting: 86-11
No ratings yet
VSA DTC Troubleshooting: 86-11
2 pages
Susheel 2025
No ratings yet
Susheel 2025
1 page
Sirim Load
No ratings yet
Sirim Load
7 pages
Auto Jumper User Guide
No ratings yet
Auto Jumper User Guide
2 pages
ServicePlus - Issuance of SEBC Certificate
No ratings yet
ServicePlus - Issuance of SEBC Certificate
2 pages
Tutorial 1 BMIT2703 Information and IT Security
No ratings yet
Tutorial 1 BMIT2703 Information and IT Security
3 pages
Pinky's Iron Doors Works With Clients To Create Bespoke Steel Windows and Doors Nationwide
No ratings yet
Pinky's Iron Doors Works With Clients To Create Bespoke Steel Windows and Doors Nationwide
3 pages
Regard 3000 Pi 9107431 en Us
No ratings yet
Regard 3000 Pi 9107431 en Us
8 pages
ODS & Modal Analysis Training NY
No ratings yet
ODS & Modal Analysis Training NY
5 pages

Structure of Neural Networks

Uploaded by

Structure of Neural Networks

Uploaded by

Artificial neural networks are the most powerful learning models in the field of

machine learning inspired by the human brain.

and b is the bias.

Multiple Artificial Neurons

Basic Structure of Artificial Neural Networks

If we assign ( w1−w0) with new w , we get the sigmoid function.

Internal Workings of a Neuron

An in-depth representation of a cumulative input(Z) is shown below.

The different commonly used activation functions are given below.

Parameters and Hyperparameters of Neural Networks

The notations that you will come across are as follows:

7. z represents the accumulated input to a layer. The accumulated input to the

Assumptions for Simplifying Neural Networks

Commonly used neural network architectures make the following simplifying

You might also like