0% found this document useful (0 votes)

138 views

A Concise Introduction To Machine Learni PDF

This document provides a concise introduction to machine learning with artificial neural networks. It defines machine learning as a field that allows computers to learn without being explicitly programmed. The document outlines the main types of machine learning problems - supervised learning, unsupervised learning, and reinforcement learning. It then defines artificial neural networks as mathematical models inspired by biological neural networks that are able to perform both classification and regression. The key components of artificial neurons are described, including dendrites, cell bodies, axons and synapses. Neural networks can be represented as graphs consisting of connected nodes.

Uploaded by

Nitesh agrahari

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

138 views

A Concise Introduction To Machine Learni PDF

Uploaded by

Nitesh agrahari

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

A Concise Introduction to Machine Learning

with Artificial Neural Networks

Oleksandr Zaytsev
May 2016

1 Introduction
In this paper I will try to give a concise and comprehensive introduction to the theory of Artificial
Neural Networks.

2 Machine Learning
In recent years Machine Learning became one of the most promising and rapidly developing
fields in Computer Science. It tackles the problems that classical programming and sometimes
also humans can’t handle. In this section I will give the short introduction to the field of Machine
Learning.
In his book Information Theory, Inference, and Learning Algorithms [3] David MacKay writes:
Machine learning allows us to tackle tasks that are too dicult to solve with fixed
programs written and designed by human beings. From a scientic and philosophical
point of view, machine learning is interesting because developing our understand-
ing of machine learning entails developing our understanding of the principles that
underlie intelligence.

Definition The intuitive definition of machine learning was given by Arthur Samuel in 1959:

Machine Learning is a field of study that gives computers the ability to learn without
being explicitly programmed.

This definition is nice and easy to understand. Though, to work with machine learning as a
scientific field (indeed, it is a field of computer science, strongly related to some mathematical
fields, such as computational statistics and mathematical optimization), we need a more for-
mal definition. One can be found in Tom Mitchell’s book Machine Learning (1997) [4]. This
definition is widely known and often reffered to as a well-posed learning problem:

A computer program is said to learn from experience E with respect to some class of
tasks T and performance measure P , if its performance at tasks in T , as measured
by P , improves with experience E.

1
3 NEURAL NETWORKS

Machine learning problems By the way we measure performance P on task T machine

learning tasks can be divided into three main classes:

• Supervised Learning - the agent receives the set of examples with labels (”right an-
swers”) to learn from
• Unsupervised Learning - no explicit feedback is provided. The agent should learn
patterns in an unlabeled dataset
• Reinforcement Learning - the agent receives series of reinforcements - rewards or pun-
ishments (for example, winning or loosing the chess game)

Here are the formal definitions for the problems of supervised and unsupervised machine learn-
ing1 .

Task of supervised learning Given a training set of m example input-output pairs

(x(1) , y (1) ), (x(2) , y (2) ), ..., (x(m) , y (m) )

where each y (j) is generated by an unknown function y (j) = f (x(j) )

Goal: discover the function h that approximates function f .

Task of unsupervised learning Given a large unlabeled dataset of m input examples

x(1) , x(2) , ..., x(m)

where each x(i) ∈ Rn .

Goal: learn a model for the inputs and then apply it to a specific machine learning task.

3 Neural Networks
An artificial neural networks is one of the most developed and widely used algorithms of machine
learning. It is the mathematical model of brain’s activity that is able to tackle both problems of
classification and regression. Neural network can function as a model of supervised, unsupervised
or reinforcement learning.

Definition Simon Haykin [1] offers the following definition:

A neural network is a massively parallel distributed processor made up of simple
processing units, which has a natural propensity for storing experiential knowledge
and making it available for use. It resembles brain in two respects:
1. Knowledge is aquired by network from its environment through a learning pro-
cess.
2. Interneuron connection strengths, known as synaptic weights, are used to store
the acquired knowledge.
1 Formal definitions of supervised and unsupervissed learning problems are inspired by [6] and [5] respectively

2
4 MODEL OF A NEURON

Since their invention in 50-s neural networks have been used to model human brain and approach
the goal of creating human-like artificial intelligence. Nowadays it is more common to think of
neural networks as of the statistical models that perform well on some extremely complicated
tasks. For example, Hastie et. al. [7] view neural networks as nonlinear statistical models, the
two-stage regression or classification models. David MacKay [3] sees them as parallel distributed
computational systems consisting of many interacting simple elements. And Goodfellow et. al.
(MIT) [2] write the following

Modern neural network research is guided by many mathematical and engineering

disciplines, and the goal of neural networks is not to perfectly model the brain. It
is best to think of feedforward networks as function approximation machines that
are designed to achieve statistical generalization, occasionally drawing some insights
from what we know about the brain, rather than as models of brain function.

4 Model of a Neuron
A biological neural network (brain) consists of cells called neurons. Human brain is composed
of about 10 billion neurons, each connected to about 10,000 other neurons. The same applies
to artificial neural network - they consists of many artificial neurons - mathematical models of
biological ones. I will start this section by describing the structure of a biological neuron. Then
I will provide a formal description of an artificial neuron as a mathematical model.

Biological neurons According to the medical definition, provided by Merriam-Webster dic-

tionary2 ,
Neuron is a cell that carries messages between the brain and other parts of the body
and that is the basic unit of the nervous system.
Figure 1 shows the simplified schematic of a biological neuron. Each neuron consists of a cell
body, or soma, that contains a cell nucleus. Branching out from the cell body are a number
of fibers called dendrites and a single long fiber called the axon. The axon stretches out for
a long distance, much longer than the scale in this diagram indicates. Typically, an axon is
1 cm long (100 times the diameter of the cell body), but can reach up to 1 meter. A neuron
makes connections with 10 to 100,000 other neurons at junctions called synapses. Signals are
propagated from neuron to neuron by a complicated electrochemical reaction. The signals
control brain activity in the short term and also enable long-term changes in the connectivity
of neurons. These mechanisms are thought to form the basis fur learning in the brain. Each
neuron receives electrochemical inputs from other neurons at the dendrites. If the sum of these
electrical inputs is sufficiently powerful to activate the neuron, it transmits an electrochemical
signal along the axon, and passes this signal to the other neurons whose dendrites are attached
at any of the axon terminals. It is important to note that a neuron fires only if the total signal
received at the cell body exceeds a certain level. The neuron either fires or it doesn’t, there
aren’t different grades of firing.
2 http://www.merriam-webster.com/dictionary/neuron

3
4 MODEL OF A NEURON

Figure 1: A schematic of biological neuron

Artificial neurons In fact, artificial neuron can be viewed as a parametric function of n

inputs gw : X n → Y , where X is the space of all possible input values, Y - the space of output
values and w - the vector of numeric weights w1 , w2 , ..., wn , the tunable parameters that [...]. In
other words, function gw is [defined with the rule]

y = gw (x) (1)
n n
where x ∈ X , y ∈ Y and w ∈ R .
Function gw is a superposition of two functions, sw : X n → R and f : R → Y :
gw (x) = f (sw (x))
where sw is defined as follows:
n
X
sw (x) = wi xi
i=0
f is a nonlinear activation function in case of classification and an identity function ∀xf (x) = x
in case of regression.

Figure 2: Model of artifitial neuron

4
5 STRUCTURE AND REPRESENTATION

5 Structure and Representation

Graphical representation Neural networks are composed of nodes (neurons), connected by
direct links (synaptic connections). If we think of neurons as vertices and synaptic connections
as edges, neural networks can be represented by weighted directed graphs called network
diagrams.
If the network is feedforward (without cycles), the graph will be k-partide, where k is the number
of layers. If it is also fully-connected, it will be a complete k-partide graph. See Figure 3 for
specific examples.

Figure 3: (1) Recurrent neural network represented by directed graph. (2) 3-layered feedfor-
ward neural network represented by 3-partide directed graph. (3) 3-layered fully-connected
feedforward neural network represented by a complete 3-partide directed graph

Network topologies As it was already said, neural networks consist of neurons. These
neurons are connected with directed links (synaptic connections) with numeric weights that
determine the strength and sign of the connection.
Neurons are grouped into layers. First layer is called input layer, the last one - output layer.
All the layers between input and output layers are called hidden layers. Number of hidden
layers is one of the tunable metaparameters that define the architecture of a neural network.
According to [7], typically the number of hidden units is somewhere in the range of 5-100.
To satisfy the linear model of a regression each layer, except for the output one, has an additional
bias unit b = 1.
For a specific example of neural network architecture see Figure 4.
By the type of connections neural network can be either feedforward or recurrent:
• Feedforward network - has connections only in one direction (outputs of neurons from
layer k can be connected only to neurons of layers k + c where c > 0). The network
diagram of a feedforward netwond forms a directed acyclic graph.
• Recurrent network - feeds its outputs to its own inputs. This network has at least one
cycle (at least one connection from neuron in layer k to neuron in layer k − c where c ≥ 0).
If in neural network with N layers ∀k : 0 ≤ k ≤ N − 1 every neuron in layer k is connected to
all the neurons of layer k + 1, the network is called fully-connected.
Figure 4 depicts the schematic of a feedforward neural network with one hidden layer.

5
6 NETWORK TRAINING

Figure 4: Feedforward artificial neural network with one hidden layer. Number of neurons in
each layer: 3 (2-dimentional input + bias unit) in the input layer, 4 in the hidden layer, 1 in
the output layer (1-demensional output)

6 Network Training
I will describe the training of a neural network with backpropagation of errors.
As the number of hidden layers grows, the problem arises, as we can only compute the error of
output layer by finding the deviation of hypothesis hw from the desired output y.
The fitness function for regression is the sum of squared errors
K X
X n
J(w) = (yi − fk (xi ))2 (2)
k=1 i=1

For classification it is defined as

n
X
J(w) = yik log fk (xi (3)
i=1

The task of learning is to minimize the fitness function J(w). There are different learning algo-
rithms for doing that. In this paper I will explain only the idea of the classical backpropagation
algorithm.
Backpropagation is an abbreviation for ”backward propagation of errors”. According to [7],
backpropagation is a two-pass procedure, used to compute the gradients for the updates in
gradient descent algorithm:

• Forward pass -the current weights are fixed and the predicted values are computed

6
REFERENCES REFERENCES

• Backward pass - the errors δki are computed and then backpropagated to give errors
sij . Both sets of errors are then used to compute the gradients for updates.

The main advantage of backpropagation is that it has local nature, and thus it can be efficiently
implemented on a parallel architecture computer.

References
[1] Simon Haykin. Neural Networks. A Comprehensive Foundation. Prentice Hall, second edi-
tion, 2005.
[2] Yoshua Bengio Ian Goodfellow and Aaron Courville. Deep learning. Book in preparation
for MIT Press, 2016.
[3] David J. C. MacKay. Information Theory, Inference, and Learning Algorithms. Cambridge
University Press, 2003.

[4] Tom Mitchell. Machine Learning. McGraw-Hill Science/Engineering/Math, 1997.

[5] Andrew Y. Ng Rajat Raina, Anand Madvavan. Large-scale deep unsupervised learning using
graphics processors. 2009.
[6] Peter Norvig Stuart Russell. Artificial Intelligence. A Modern Approach. Prentice Hall,
third edition, 2010.
[7] Jerome Friedman Trevor Hastie, Robert Tibshirani. The Elements of Statistical Learning.
Data Mining, Inference and Prediction. Springer, second edition, 2013.

Science Quiz Bee Reviewer
90% (20)
Science Quiz Bee Reviewer
26 pages
Emotional Design Donald Norman
No ratings yet
Emotional Design Donald Norman
268 pages
Nervous System Anatomy and Physiology Lecture
81% (16)
Nervous System Anatomy and Physiology Lecture
105 pages
Artificial Intelligence Artificial Neural Networks - : Introduction
No ratings yet
Artificial Intelligence Artificial Neural Networks - : Introduction
43 pages
Histology Table Reviewer
No ratings yet
Histology Table Reviewer
4 pages
Gaurav Hivre Report
No ratings yet
Gaurav Hivre Report
32 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Introduction To ANN
No ratings yet
Introduction To ANN
47 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
14 pages
Neural Networks: Tricea Wade (99-002187) Swain Henry (02-006844)
No ratings yet
Neural Networks: Tricea Wade (99-002187) Swain Henry (02-006844)
11 pages
Chap 1
No ratings yet
Chap 1
20 pages
Introduction To Neural Networks: Training Learn Generalization
No ratings yet
Introduction To Neural Networks: Training Learn Generalization
46 pages
Introduction To Neural Networks
100% (1)
Introduction To Neural Networks
46 pages
Unit 5
No ratings yet
Unit 5
75 pages
brain and Neuron
No ratings yet
brain and Neuron
18 pages
Ict L2 PDF
No ratings yet
Ict L2 PDF
49 pages
2 DeepLearning
No ratings yet
2 DeepLearning
46 pages
Neural Networks
No ratings yet
Neural Networks
32 pages
This Document Is About Artificial Inteligence.
No ratings yet
This Document Is About Artificial Inteligence.
81 pages
ML - Chapter 5 - Neural Network
No ratings yet
ML - Chapter 5 - Neural Network
64 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Artifcial Neural Network": "A Project On
No ratings yet
Artifcial Neural Network": "A Project On
31 pages
ANN Lecture Note1F
No ratings yet
ANN Lecture Note1F
50 pages
M.tech DL
No ratings yet
M.tech DL
221 pages
Unit 1
No ratings yet
Unit 1
25 pages
L2-ANN
No ratings yet
L2-ANN
27 pages
NN Lecture1 Introduction
No ratings yet
NN Lecture1 Introduction
40 pages
Neural and Fuzzy Systems
No ratings yet
Neural and Fuzzy Systems
27 pages
NN-BNU1
No ratings yet
NN-BNU1
31 pages
Neural Network
No ratings yet
Neural Network
37 pages
ANN Material
No ratings yet
ANN Material
99 pages
Machine Learning Using Neural Networks: Presentation By: C. Vinoth Kumar SSN College of Engineering
No ratings yet
Machine Learning Using Neural Networks: Presentation By: C. Vinoth Kumar SSN College of Engineering
24 pages
Deep learning
No ratings yet
Deep learning
14 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
48 pages
Neural Networks
No ratings yet
Neural Networks
75 pages
UNIT II Basic On Neural Networks
No ratings yet
UNIT II Basic On Neural Networks
36 pages
Neural Networks
No ratings yet
Neural Networks
32 pages
UNIT1_C
No ratings yet
UNIT1_C
21 pages
ANN Unit 3
No ratings yet
ANN Unit 3
26 pages
Artificial Neural Network: Synapses Weight The Individual Parts of Information
No ratings yet
Artificial Neural Network: Synapses Weight The Individual Parts of Information
8 pages
Neural Network – Overview
No ratings yet
Neural Network – Overview
37 pages
1.1 What Is A Neural Network?
No ratings yet
1.1 What Is A Neural Network?
3 pages
1 Perceptron1
No ratings yet
1 Perceptron1
25 pages
Ann mod1
No ratings yet
Ann mod1
106 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
51 pages
ANN Unit 1
No ratings yet
ANN Unit 1
77 pages
Neuron, Neueral Network
No ratings yet
Neuron, Neueral Network
35 pages
Unit 1
No ratings yet
Unit 1
29 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
31 pages
AI Mod4 Session 8 Best Fit Line & ANN
No ratings yet
AI Mod4 Session 8 Best Fit Line & ANN
39 pages
Neural Networks
No ratings yet
Neural Networks
16 pages
ml 6th (2)
No ratings yet
ml 6th (2)
21 pages
Introduction of Neural Network
No ratings yet
Introduction of Neural Network
31 pages
Minin Handout
No ratings yet
Minin Handout
13 pages
Neural Networks - Comprehensive Foundation (Introduction)
No ratings yet
Neural Networks - Comprehensive Foundation (Introduction)
47 pages
Neural Networks
No ratings yet
Neural Networks
36 pages
Anns
No ratings yet
Anns
19 pages
Unit-1 (1)
No ratings yet
Unit-1 (1)
89 pages
Contents:: 1. Introduction To Neural Networks
No ratings yet
Contents:: 1. Introduction To Neural Networks
27 pages
Content Library Read
No ratings yet
Content Library Read
25 pages
ML Unit-5 Final
No ratings yet
ML Unit-5 Final
23 pages
Ann2018 L2 PDF
No ratings yet
Ann2018 L2 PDF
18 pages
Neural Network
No ratings yet
Neural Network
85 pages
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
1-Simple Linearregression
No ratings yet
1-Simple Linearregression
8 pages
EE534 Line Codes
No ratings yet
EE534 Line Codes
66 pages
EE633 - Online - Lect4 - OFDM - Part - 2
No ratings yet
EE633 - Online - Lect4 - OFDM - Part - 2
9 pages
ARQ Protocols and Reliable Data Transfer
No ratings yet
ARQ Protocols and Reliable Data Transfer
48 pages
EE533 - Online - Lect5 - OFDM - Part - 3
No ratings yet
EE533 - Online - Lect5 - OFDM - Part - 3
9 pages
EE533 Summary Topics Pre Lockdown
No ratings yet
EE533 Summary Topics Pre Lockdown
10 pages
Power Control in Single-Cell Massive MIMO Systems
No ratings yet
Power Control in Single-Cell Massive MIMO Systems
17 pages
Ee534 DLL HDLC
No ratings yet
Ee534 DLL HDLC
41 pages
Effects of Pilot Contamination and Non-Synchronous Transmission
No ratings yet
Effects of Pilot Contamination and Non-Synchronous Transmission
12 pages
Power Control in Multi-Cell Massive MIMO Systems
No ratings yet
Power Control in Multi-Cell Massive MIMO Systems
20 pages
Data Link Layer: EE534 Dr. Sonali Chouhan Dept of EEE, IITG
No ratings yet
Data Link Layer: EE534 Dr. Sonali Chouhan Dept of EEE, IITG
53 pages
Report Final Propeller PDF
No ratings yet
Report Final Propeller PDF
31 pages
Ee534 - Udp - TCP Transport Layer Protocol
No ratings yet
Ee534 - Udp - TCP Transport Layer Protocol
108 pages
LMMSE (Linear Minimum Mean Square Estimation) Detection
No ratings yet
LMMSE (Linear Minimum Mean Square Estimation) Detection
7 pages
Recap: UL Pilots and Channel Estimation in Multiple Cells
No ratings yet
Recap: UL Pilots and Channel Estimation in Multiple Cells
10 pages
EE 590 - Linear Algebra & Optimization Assignment 2: Steepest Descent Method
No ratings yet
EE 590 - Linear Algebra & Optimization Assignment 2: Steepest Descent Method
1 page
Anatomy and Physiology of Nervous System
100% (1)
Anatomy and Physiology of Nervous System
32 pages
Chapter - 6: Tissues
No ratings yet
Chapter - 6: Tissues
20 pages
Chapter 7 - NERVOUS SYSTEM PDF
No ratings yet
Chapter 7 - NERVOUS SYSTEM PDF
58 pages
Unit 5 - AI PROJECT CYCLE
No ratings yet
Unit 5 - AI PROJECT CYCLE
16 pages
The Strategic Game of ? and ?: John R. Boyd
No ratings yet
The Strategic Game of ? and ?: John R. Boyd
61 pages
X CBSE Prelim-1 Science 2
No ratings yet
X CBSE Prelim-1 Science 2
7 pages
Herbarium. Herbs and Supplements For Lyme Disease - Lymeherbs
No ratings yet
Herbarium. Herbs and Supplements For Lyme Disease - Lymeherbs
30 pages
Medical Electronics: Mr. Deepak P. Associate Professor ECE Department Sngce
No ratings yet
Medical Electronics: Mr. Deepak P. Associate Professor ECE Department Sngce
128 pages
2 Information Brochure
No ratings yet
2 Information Brochure
40 pages
science Ncert short notes
No ratings yet
science Ncert short notes
14 pages
Circardian Rhythm
No ratings yet
Circardian Rhythm
15 pages
Y2023 Lecture 4 Bioelectricity
No ratings yet
Y2023 Lecture 4 Bioelectricity
59 pages
(Ebook) Biological psychology by Toates F.M. ISBN 9780273734994, 0273734997 download
100% (1)
(Ebook) Biological psychology by Toates F.M. ISBN 9780273734994, 0273734997 download
39 pages
Brain Structure and Function PDF
No ratings yet
Brain Structure and Function PDF
35 pages
Bio Psy
No ratings yet
Bio Psy
3 pages
In Vitro Models of Neurodegenerative Diseases
No ratings yet
In Vitro Models of Neurodegenerative Diseases
18 pages
Action Potential: DR Raghuveer Choudhary Associate Professor
100% (2)
Action Potential: DR Raghuveer Choudhary Associate Professor
63 pages
Chapter 29 The Nervous System
No ratings yet
Chapter 29 The Nervous System
64 pages
2023 AL Subject Report
No ratings yet
2023 AL Subject Report
269 pages
Module 5 The Powers of The MInd
100% (6)
Module 5 The Powers of The MInd
28 pages
Module 2.3. - Muscular System
No ratings yet
Module 2.3. - Muscular System
148 pages
Neurotransmitters-Key Factors in Neurological and Neurodegenerative Disorders of
No ratings yet
Neurotransmitters-Key Factors in Neurological and Neurodegenerative Disorders of
10 pages
CONGENITAL ABNORMALITIES OF THE CNS
No ratings yet
CONGENITAL ABNORMALITIES OF THE CNS
7 pages
Grade 10 3rd Quarter Science Reviewer
No ratings yet
Grade 10 3rd Quarter Science Reviewer
19 pages
Important Instructions For The School Principal: (Not To Be Printed With The Question Paper)
No ratings yet
Important Instructions For The School Principal: (Not To Be Printed With The Question Paper)
15 pages
Microbiology of Clostridium Tetani and Wound Classification
No ratings yet
Microbiology of Clostridium Tetani and Wound Classification
3 pages

A Concise Introduction To Machine Learni PDF

Uploaded by

A Concise Introduction To Machine Learni PDF

Uploaded by

A Concise Introduction to Machine Learning

with Artificial Neural Networks

Machine learning problems By the way we measure performance P on task T machine

Task of supervised learning Given a training set of m example input-output pairs

(x(1) , y (1) ), (x(2) , y (2) ), ..., (x(m) , y (m) )

where each y (j) is generated by an unknown function y (j) = f (x(j) )

Task of unsupervised learning Given a large unlabeled dataset of m input examples

x(1) , x(2) , ..., x(m)

where each x(i) ∈ Rn .

Definition Simon Haykin [1] offers the following definition:

Modern neural network research is guided by many mathematical and engineering

Biological neurons According to the medical definition, provided by Merriam-Webster dic-

Figure 1: A schematic of biological neuron

Artificial neurons In fact, artificial neuron can be viewed as a parametric function of n

Figure 2: Model of artifitial neuron

5 Structure and Representation

For classification it is defined as

[4] Tom Mitchell. Machine Learning. McGraw-Hill Science/Engineering/Math, 1997.

You might also like