CNN Regularization

Uploaded by

gs2116060

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

CNN Regularization

Uploaded by

gs2116060

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

CO 44251: DEEP LEARNING

Trishna Saikia
Pruning
• It is a technique used in deep learning to reduce the size of a neural network by eliminating less important units
(neurons) or connections between them, with the goal of improving efficiency and reducing overfitting.

• The term "pruning" refers to the process of selectively removing parts of a model that are considered less
significant to its performance.
Weight pruning

• Set individual weights in the weight matrix to zero. This

corresponds to deleting connections as in the figure.

• Here, to achieve sparsity of k% we rank the individual weights in

weight matrix W according to their magnitude, and then set to
zero the smallest k%.

Unit/Neuron pruning

• Set entire columns to zero in the weight matrix to zero, in effect For more details: https://towardsdatascience.com/pruning-deep-neural-network-
deleting the corresponding output neuron. 56cae1ec5505

• Here to achieve sparsity of k% we rank the columns of a weight

matrix according to their L2-norm and delete the smallest k%.
Stochastic pooling

• Stochastic pooling operates over local regions of the input feature map, just like other pooling methods.
• However, instead of deterministically selecting the maximum value (max pooling) or the average value (average
pooling), it selects a value based on a probability distribution derived from the values in the pooling region.
• This introduces a layer of randomness, acting as a regularizer and helping the network to avoid overfitting.

The Stochastic Pooling Process:

1. Pooling Region: Define a local region (e.g., 2x2 or 3x3) in the input feature map.
2. Probability Calculation: Sum all the values in the pooling region.
3. Divide each value by this sum to obtain the probability of selecting that value.
Benefits of Stochastic Pooling:

• Regularization: By introducing randomness, stochastic pooling serves as a form of regularization, reducing overfitting
and enhancing the generalization capability of the model.

• Robustness: It prevents the network from becoming overly reliant on specific features, making it more robust to
variations and noise in the input data.

• Feature Diversity: Stochastic pooling can capture a more diverse set of features compared to deterministic pooling
methods, potentially leading to richer representations.
Synthetic Data

• Synthetic data is essentially artificial data created algorithmically.

• It is designed to mimic the characteristics of real-world data without containing any actual information.
• Used widely in data science and machine learning, synthetic data enables algorithms to be tested and improved without
risking the privacy or security of real-world data.
• It can also be used to augment existing datasets, especially in cases where the original data is limited or biased.

Demo: https://openai.com/index/sora/
Early Stopping

It is a regularization technique for deep neural networks that stops

training when parameter updates no longer begin to yield improves on
a validation set.

Process:

• During training, a model iteratively updates its weights to minimize a loss function, improving its performance on the
training data.
• However, after a certain point, the model may start to "overfit," learning patterns that are specific to the training data and
not generalizable to new data.
• To track the model's generalization ability, the training process typically includes a validation set (a subset of data not used
for training).
• The performance of the model on the validation set is monitored after each epoch (a complete pass through the training
data).
• As training progresses, the model’s accuracy on the training data may continue to improve, but the validation accuracy might
plateau and then start to degrade. This indicates that the model is overfitting to the training data.
• Early stopping prevents this by stopping the training when the validation performance stops improving for a specified
number of epochs (patience). The model’s weights at the point of best validation performance are retained.
Weight Decay

• It is a regularization technique used in deep learning to prevent overfitting by penalizing large weights in the model.
• It works by adding a penalty term to the loss function during training, encouraging the model to learn smaller, more
generalized weights.
• This helps in improving the model’s generalization on unseen data.

❖ In weight decay, a regularization term is added to the original loss function (e.g., mean squared error or cross-entropy
loss). The modified loss function becomes:

𝐿𝑛𝑒𝑤 = 𝐿𝑜𝑟𝑖𝑔𝑖𝑛𝑎𝑙 + 𝜆∑𝑤𝑖2

Where,
➢ 𝐿𝑜𝑟𝑖𝑔𝑖𝑛𝑎𝑙 is the original loss function.
➢ 𝑤𝑖 represents the model’s weights.
➢ 𝜆 is a hyperparameter that controls the strength of the regularization.
➢ The added term𝜆∑𝑤𝑖2 encourages the weights 𝑤𝑖 to be smaller, penalizing large weight values.
Controlling Weight Decay:

• If 𝜆 is too small, the regularization effect will be negligible, and the model may overfit.
• If 𝜆 is too large, the weights will shrink too much, and the model might underfit (not learning the data well enough).
• A carefully tuned 𝜆 helps strike a balance between overfitting and underfitting.

Purpose of Weight Decay

• Prevents Overfitting: Models with very large weights tend to fit the training data too closely, leading to poor generalization
on unseen data. By penalizing large weights, weight decay reduces overfitting.

• Improves Generalization: Encouraging smaller weights can lead to simpler models that generalize better on new data.

• Smoothing the Loss Landscape: Weight decay has the effect of smoothing the model’s loss landscape, making it less
sensitive to small variations in the data, which can help in avoiding overfitting.
Thank You

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
57% (83)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (108)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
91% (35)
The 36 Questions That Lead To Love - The New York Times
3 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
77% (13)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
English 7 4th Quarter Week 1 Detailed Lesson Plan
100% (12)
English 7 4th Quarter Week 1 Detailed Lesson Plan
7 pages
1001 Songs
70% (73)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Han Tarone Interlanguage Forty Years Later (Chapter 1
100% (1)
Han Tarone Interlanguage Forty Years Later (Chapter 1
20 pages
Pronunciation - Practice Lists of Sounds Pairs
100% (5)
Pronunciation - Practice Lists of Sounds Pairs
73 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
16 pages
DL Class3
No ratings yet
DL Class3
28 pages
DL Notes
No ratings yet
DL Notes
16 pages
Unit 4
No ratings yet
Unit 4
35 pages
Module 2 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 2 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
20 pages
What is Regularization.
No ratings yet
What is Regularization.
10 pages
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
No ratings yet
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
75 pages
Unit Online 1.4
No ratings yet
Unit Online 1.4
132 pages
Nndl Notes
No ratings yet
Nndl Notes
73 pages
Unit Ii
No ratings yet
Unit Ii
8 pages
A Comparison of Dropout and Weight Decay For Regularizing Deep Ne
No ratings yet
A Comparison of Dropout and Weight Decay For Regularizing Deep Ne
15 pages
4th Unit DL Final Class Notes (1)
No ratings yet
4th Unit DL Final Class Notes (1)
68 pages
DL UNIT 3
No ratings yet
DL UNIT 3
14 pages
An Overview of Overfitting and Its Solutions
No ratings yet
An Overview of Overfitting and Its Solutions
7 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
5 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
UNIT LV
No ratings yet
UNIT LV
8 pages
03 Reg Slides
No ratings yet
03 Reg Slides
64 pages
An Overview of Overfitting and Its Solutions
No ratings yet
An Overview of Overfitting and Its Solutions
7 pages
Unit 2.3
No ratings yet
Unit 2.3
43 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
26 pages
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
No ratings yet
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
9 pages
An Overview of Overfitting and Its Solutions
No ratings yet
An Overview of Overfitting and Its Solutions
7 pages
WEEK 10
No ratings yet
WEEK 10
69 pages
Dataset Augmentation
No ratings yet
Dataset Augmentation
30 pages
DL mod 2
No ratings yet
DL mod 2
4 pages
Neural Networks For Machine Learning: Lecture 9a Overview of Ways To Improve Generalization
No ratings yet
Neural Networks For Machine Learning: Lecture 9a Overview of Ways To Improve Generalization
39 pages
Deep Neural Network Module 4 Regularization
No ratings yet
Deep Neural Network Module 4 Regularization
53 pages
Weight Dropout for Preventing Neural Networks From Overfitting
No ratings yet
Weight Dropout for Preventing Neural Networks From Overfitting
4 pages
tutorial 4
No ratings yet
tutorial 4
6 pages
UNIT-II Regularization in Deep Learning
No ratings yet
UNIT-II Regularization in Deep Learning
24 pages
mod4
No ratings yet
mod4
65 pages
465-Lecture 10-11
No ratings yet
465-Lecture 10-11
79 pages
S10_DNN_Regularization_wip
No ratings yet
S10_DNN_Regularization_wip
11 pages
DL_Unit-3
No ratings yet
DL_Unit-3
56 pages
cours4
No ratings yet
cours4
30 pages
ANN Module-III
No ratings yet
ANN Module-III
16 pages
DL_M2_Regularization
No ratings yet
DL_M2_Regularization
12 pages
Lecture 5-6
No ratings yet
Lecture 5-6
45 pages
Regularization Slides (2)
No ratings yet
Regularization Slides (2)
50 pages
Unit-3
No ratings yet
Unit-3
47 pages
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
No ratings yet
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
18 pages
Why Do We Need Weight Decay in Modern Deep Learning
No ratings yet
Why Do We Need Weight Decay in Modern Deep Learning
33 pages
unit4
No ratings yet
unit4
93 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
Module - 2 Ver 1.4
No ratings yet
Module - 2 Ver 1.4
35 pages
Practical Aspects of Deep Learning PI
No ratings yet
Practical Aspects of Deep Learning PI
46 pages
07_regularization
No ratings yet
07_regularization
51 pages
Unit -4-NNDL- Notes
No ratings yet
Unit -4-NNDL- Notes
14 pages
Deep Learning Basics Lecture 4 Regularization II
No ratings yet
Deep Learning Basics Lecture 4 Regularization II
27 pages
Unit 2
No ratings yet
Unit 2
112 pages
Deep Learning (All in One)
No ratings yet
Deep Learning (All in One)
23 pages
Pattern Classification Using Simplified Neural Networks With Pruning Algorithm
No ratings yet
Pattern Classification Using Simplified Neural Networks With Pruning Algorithm
7 pages
DL Unit-3
No ratings yet
DL Unit-3
10 pages
Accelerated Bayesian Optimization For Deep Learning
No ratings yet
Accelerated Bayesian Optimization For Deep Learning
13 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
DL Mod 4 & 6 Notes
No ratings yet
DL Mod 4 & 6 Notes
12 pages
unit-online-1.3
No ratings yet
unit-online-1.3
21 pages
DL Mod2
No ratings yet
DL Mod2
45 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Chiquimula+Sheny Fajardo+ECRIF Assignment.
No ratings yet
Chiquimula+Sheny Fajardo+ECRIF Assignment.
5 pages
Research Paper Final.
No ratings yet
Research Paper Final.
8 pages
Leadership Thesis Book
No ratings yet
Leadership Thesis Book
48 pages
Hegel in America: As American As Apple Pie
No ratings yet
Hegel in America: As American As Apple Pie
8 pages
Annotated
No ratings yet
Annotated
13 pages
00pp 2011 - Aragao - Beliefs and Emotions in Foreign Language Learning
No ratings yet
00pp 2011 - Aragao - Beliefs and Emotions in Foreign Language Learning
12 pages
English Proficiency: Different Types of Pronouns
No ratings yet
English Proficiency: Different Types of Pronouns
10 pages
Question Bank-Ai
No ratings yet
Question Bank-Ai
3 pages
q2 Peta 2 Unchanging Values in A VUCA World
No ratings yet
q2 Peta 2 Unchanging Values in A VUCA World
1 page
Toward A Social Psychology of Loneliness (Pág. 31... )
No ratings yet
Toward A Social Psychology of Loneliness (Pág. 31... )
13 pages
Information Technology and Its Role in Creating Sustainable Competitive Advantage
No ratings yet
Information Technology and Its Role in Creating Sustainable Competitive Advantage
8 pages
Psychology Memory Revision
No ratings yet
Psychology Memory Revision
13 pages
# LET's DISCOVER SCIENCE PART 4 - David Horsburgh (1.3 MB PDF) Wonderful Science Activities Book
No ratings yet
# LET's DISCOVER SCIENCE PART 4 - David Horsburgh (1.3 MB PDF) Wonderful Science Activities Book
91 pages
Simple Present Tense Subject+ Do/does Form of The Verb+object (1) To Express A Habitual Action As
No ratings yet
Simple Present Tense Subject+ Do/does Form of The Verb+object (1) To Express A Habitual Action As
17 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
CSA 1 - 1st Post-Test
No ratings yet
CSA 1 - 1st Post-Test
6 pages
Activity 3 Assessment
No ratings yet
Activity 3 Assessment
8 pages
The Importance of Critical Thinking in Everyday Life
No ratings yet
The Importance of Critical Thinking in Everyday Life
1 page
Ed 208 Lesson 1
No ratings yet
Ed 208 Lesson 1
15 pages
CLIQUE Algorithm Grid-Based Subspace Clustering
No ratings yet
CLIQUE Algorithm Grid-Based Subspace Clustering
10 pages
Plan de Lectie Engleza
100% (1)
Plan de Lectie Engleza
6 pages
A Profect Report On Star Claytech Pvt. LTD
No ratings yet
A Profect Report On Star Claytech Pvt. LTD
44 pages
Mapeh 2 - Q4 W7 DLL - Edited
No ratings yet
Mapeh 2 - Q4 W7 DLL - Edited
4 pages
Resonancia Magnética Funcional
No ratings yet
Resonancia Magnética Funcional
4 pages
Lesson 4 Language Teaching Strategies For MTB MLE
No ratings yet
Lesson 4 Language Teaching Strategies For MTB MLE
46 pages
The Doctrine of CSW
No ratings yet
The Doctrine of CSW
2 pages
Personality Types and Learning Styles of
No ratings yet
Personality Types and Learning Styles of
6 pages