0% found this document useful (0 votes)

12 views

Data Aug Trans

The document discusses data augmentation techniques like horizontal flips, random crops/scales, and color jitter that can be applied to images for training deep learning models. It also discusses transfer learning where pretrained models are used as feature extractors or fine-tuned for a new task by freezing earlier layers and training higher layers.

Uploaded by

mavoho1719

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Data Aug Trans

Uploaded by

mavoho1719

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Chapter 1

Making the most of your data - Data

Augmentation and Transfer Learning

1.1 Data Augmentation

Horizontal Flip

Figure 1.1: Horizontal flip. Flip image 180 degrees in the horizontal direction.

Random Crops/Scales

Figure 1.2: Random Crops/Scales

Training: sample random crops / scales

Specific example. How ResNet trains:

1
2 Chapter 1 Making the most of your data - Data Augmentation and Transfer Learning

1. Pick random L in range [256, 480]

2. Resize training image, short side = L

3. Sample random 224 x 224 patch

Testing: average a fixed set of crops

Specific example. How ResNet tests:

1. Resize image at 5 scales: 224, 256, 384, 480, 640

2. For each size, use 10 224 x 224 crops: 4 corners + center, + flips

Color Jitter

Figure 1.3: Color Jitter

• Simple: Randomly jitter contrast

• Complex:

1. Apply PCA to all [R, G, B] pixels in training set

2. Sample a color offset along principal component directions
3. Add offset to all pixels of a training image

Get Creative

At the end you have to think for your specific case, what transformations you want to be robust
to and add them to the training set: translation, rotation, stretching, shearing, lens distortions,
...
Chapter 1 Making the most of your data - Data Augmentation and Transfer Learning 3

1.2 Transfer Learning

In practice, very few people train an entire Deep Network from scratch (with random initializa-
tion), because it is relatively rare to have a dataset of sufficient size. Instead, it is common to
pretrain a ConvNet on a very large dataset (e.g. ImageNet, which contains 1.2 million images
with 1000 categories), and then use the ConvNet either as an initialization or a fixed feature
extractor for the task of interest. Pretrained DeepNets are usually trained with large computer
clusters and downloaded by others. The three major Transfer Learning scenarios look as follows:

ConvNet as fixed feature extractor Take a ConvNet pretrained on ImageNet, remove the
last fully-connected layer (this layers outputs are the 1000 class scores for a different task like
ImageNet), then treat the rest of the ConvNet as a fixed feature extractor for the new dataset.
In an AlexNet, this would compute a 4096-D vector for every image that contains the activations
of the hidden layer immediately before the classifier. We call these features CNN codes.

Fine-tuning the ConvNet The second strategy is not only to replace and retrain the classi-
fier on top of the ConvNet on the new dataset, but to also fine-tune the weights of the pretrained
network by continuing the backpropagation. It is possible to fine-tune all the layers of the Con-
vNet, or its possible to keep some of the earlier layers fixed (due to overfitting concerns) and
only fine-tune some higher-level portion of the network. This is motivated by the observation
that the earlier features of a ConvNet contain more generic features (e.g. edge detectors or color
blob detectors) that should be useful to many tasks, but later layers of the ConvNet becomes
progressively more specific to the details of the classes contained in the original dataset. In case
of ImageNet for example, which contains many dog breeds, a significant portion of the repre-
sentational power of the ConvNet may be devoted to features that are specific to differentiating
between dog breeds.

Check points of pretrained models Since modern ConvNets take 2-3 weeks to train across
multiple GPUs on ImageNet, it is common to see people release their final ConvNet checkpoints
for the benefit of others who can use the networks for fine-tuning. For example, the Caffe library
has a Model Zoo where people share their network weights.

When and how to fine-tune?

IMPORTANT: When doing transfer learning if you make changes in the green and orange part
do the following:

• Train green, freeze everything else

• When green is starting to converge unfreeze the orange part that you want to modify
4 Chapter 1 Making the most of your data - Data Augmentation and Transfer Learning

Figure 1.4: Random Crops/Scales

This is necessary because green starts with random values and it may produce very big gradients
that may destroy your previous layers

Practical advice

There are a few additional things to keep in mind when performing Transfer Learning:

Constraints from pretrained models Note that if you wish to use a pretrained network,
you may be slightly constrained in terms of the architecture you can use for your new dataset.
For example, you cant arbitrarily take out Conv layers from the pretrained network. However,
some changes are straight-forward: Due to parameter sharing, you can easily run a pretrained
network on images of different spatial size. This is clearly evident in the case of Conv/Pool
layers because their forward function is independent of the input volume spatial size (as long
as the strides fit). In case of FC layers, this still holds true because FC layers can be converted
to a Convolutional Layer: For example, in an AlexNet, the final pooling volume before the first
FC layer is of size [6x6x512]. Therefore, the FC layer looking at this volume is equivalent to
having a Convolutional Layer that has receptive field size 6x6, and is applied with padding of
0.

Learning rates Its common to use a smaller learning rate for ConvNet weights that are being
fine-tuned, in comparison to the (randomly-initialized) weights for the new linear classifier that
computes the class scores of your new dataset. This is because we expect that the ConvNet
weights are relatively good, so we dont wish to distort them too quickly and too much (especially
while the new Linear Classifier above them is being trained from random initialization).

Subjective Time. The Philosophy, Psychology, and Neuroscience of Temporality - Arstila, Valtteri
100% (1)
Subjective Time. The Philosophy, Psychology, and Neuroscience of Temporality - Arstila, Valtteri
687 pages
Jungian 16-Type Personality Assessment Questionnaire
0% (1)
Jungian 16-Type Personality Assessment Questionnaire
3 pages
Deep Magic 16 Time Magic
100% (5)
Deep Magic 16 Time Magic
14 pages
06 Pytorch Transfer Learning
No ratings yet
06 Pytorch Transfer Learning
18 pages
Week7_ConvNets and Transfer Learning
No ratings yet
Week7_ConvNets and Transfer Learning
39 pages
Lect11 Neural Nets2
No ratings yet
Lect11 Neural Nets2
48 pages
Cats and Dogs Classification
No ratings yet
Cats and Dogs Classification
12 pages
Slides CNN
No ratings yet
Slides CNN
17 pages
PROGRAM 5n6 Dl_final
No ratings yet
PROGRAM 5n6 Dl_final
9 pages
Program 5n6 Dl
No ratings yet
Program 5n6 Dl
9 pages
NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
Application of Transfer Learning For Image Classification On Dataset With Not Mutually Exclusive Classes
No ratings yet
Application of Transfer Learning For Image Classification On Dataset With Not Mutually Exclusive Classes
4 pages
UNIT III
No ratings yet
UNIT III
26 pages
Transfer Learning: Objectives
No ratings yet
Transfer Learning: Objectives
16 pages
06 Transfer Learning With Tensorflow Part 3 Scaling Up
No ratings yet
06 Transfer Learning With Tensorflow Part 3 Scaling Up
29 pages
Unit 4
No ratings yet
Unit 4
50 pages
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
No ratings yet
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
25 pages
Make 04 00002 v2
No ratings yet
Make 04 00002 v2
20 pages
Some Improvements On Deep Convolutional Neural Netw Ork Based Image Classification
No ratings yet
Some Improvements On Deep Convolutional Neural Netw Ork Based Image Classification
6 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
Zhuang 2017
No ratings yet
Zhuang 2017
12 pages
Transfer Learning and Fine-Tuning
No ratings yet
Transfer Learning and Fine-Tuning
32 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Structure of Convolutional Neural Networks - Deep Learning
No ratings yet
Structure of Convolutional Neural Networks - Deep Learning
12 pages
Deep Learning Approach For Object Detection Using CNN: Abstract
No ratings yet
Deep Learning Approach For Object Detection Using CNN: Abstract
7 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
78 pages
ch4_CNN
No ratings yet
ch4_CNN
35 pages
DL7 2
No ratings yet
DL7 2
11 pages
Classic Cnn
No ratings yet
Classic Cnn
39 pages
Lecture 6 Deep Learning Training and Testing 2025
No ratings yet
Lecture 6 Deep Learning Training and Testing 2025
36 pages
Exer8 TresMarias
No ratings yet
Exer8 TresMarias
3 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Object Detection Using Convolutional Neural Network Transfer Learning
No ratings yet
Object Detection Using Convolutional Neural Network Transfer Learning
11 pages
CNN For Transfer Learning
No ratings yet
CNN For Transfer Learning
1 page
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
No ratings yet
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
24 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
73 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Implemented LeNet on PyTorch
100% (1)
Implemented LeNet on PyTorch
17 pages
Implemented MobileNet on PyTorch
No ratings yet
Implemented MobileNet on PyTorch
20 pages
2 Convolutional Neural Network For Image Classification
No ratings yet
2 Convolutional Neural Network For Image Classification
6 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
[Fall 2024] Deep Learning 3
No ratings yet
[Fall 2024] Deep Learning 3
54 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
Lecture_3
No ratings yet
Lecture_3
48 pages
Ethinking The Yperparameters FOR INE Tuning
No ratings yet
Ethinking The Yperparameters FOR INE Tuning
20 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
IC - Lez4-5-6 - Convolutional Nets
No ratings yet
IC - Lez4-5-6 - Convolutional Nets
85 pages
Augmentation and Segmentation
No ratings yet
Augmentation and Segmentation
32 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
1.convolutional Neural Networks For Image Classification
No ratings yet
1.convolutional Neural Networks For Image Classification
11 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
55 pages
Transfer Learnring
No ratings yet
Transfer Learnring
5 pages
Summary
No ratings yet
Summary
36 pages
Post-Reading Report Alex Shen (Mid Exam)
No ratings yet
Post-Reading Report Alex Shen (Mid Exam)
36 pages
Session15 TransferLearning
No ratings yet
Session15 TransferLearning
13 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
MSCDA 605 Machine Learning Exam Model Answers May_2019
No ratings yet
MSCDA 605 Machine Learning Exam Model Answers May_2019
7 pages
Lecture4 - Convnets For CV Slide
No ratings yet
Lecture4 - Convnets For CV Slide
65 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
Module 5
No ratings yet
Module 5
72 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Data Preprocessing
No ratings yet
Data Preprocessing
2 pages
Activation F
No ratings yet
Activation F
4 pages
SimplifyingCFGs Examples
No ratings yet
SimplifyingCFGs Examples
3 pages
Substitution Example
No ratings yet
Substitution Example
1 page
FiniteAutomata Anim
No ratings yet
FiniteAutomata Anim
41 pages
Chemical Bonds
No ratings yet
Chemical Bonds
8 pages
Arthrokinematics
No ratings yet
Arthrokinematics
6 pages
A Corporation With Both Preferred Stock and Common Stock Out Quizlet
No ratings yet
A Corporation With Both Preferred Stock and Common Stock Out Quizlet
3 pages
ယမ်းထုတ်လုပ်နည်းမျာ 1
No ratings yet
ယမ်းထုတ်လုပ်နည်းမျာ 1
30 pages
Extrapulmonary Tuberculosis
No ratings yet
Extrapulmonary Tuberculosis
19 pages
Mary Wright Priest
No ratings yet
Mary Wright Priest
2 pages
RSA Secured Implementation Guide For VPN Products: 1. Partner Information
No ratings yet
RSA Secured Implementation Guide For VPN Products: 1. Partner Information
10 pages
Ipbt Course 1
No ratings yet
Ipbt Course 1
246 pages
Hanlon Windows Brochure
No ratings yet
Hanlon Windows Brochure
9 pages
Q Test
No ratings yet
Q Test
3 pages
(Routledge Studies in Contemporary Philosophy) Sebastian Morello - Conservatism and Grace - The Conservative Case For Religion by Establishmen (2023, Routledge) - Libgen - Li
No ratings yet
(Routledge Studies in Contemporary Philosophy) Sebastian Morello - Conservatism and Grace - The Conservative Case For Religion by Establishmen (2023, Routledge) - Libgen - Li
306 pages
1.6 HDLC Protocol: - HDLC High-Level Data Link Control (ISO 2382-8 Standard)
No ratings yet
1.6 HDLC Protocol: - HDLC High-Level Data Link Control (ISO 2382-8 Standard)
14 pages
KP - EZine March 2k9
No ratings yet
KP - EZine March 2k9
25 pages
VINCULADO Character Analysis
No ratings yet
VINCULADO Character Analysis
2 pages
Reanimator 4600
No ratings yet
Reanimator 4600
22 pages
Antidiabetic Efficacy of Mimosa Pudica (Lajwanti) Root in Albino Rabbits
No ratings yet
Antidiabetic Efficacy of Mimosa Pudica (Lajwanti) Root in Albino Rabbits
1 page
Time-Table-of- ENDSEM S.E.(2019-PAT.) (1)
No ratings yet
Time-Table-of- ENDSEM S.E.(2019-PAT.) (1)
15 pages
The Company Man - Revision
No ratings yet
The Company Man - Revision
2 pages
Flipkart HDFC Offer Cashback Offers On Electronics Mobiles Laptops Clothing 2019
No ratings yet
Flipkart HDFC Offer Cashback Offers On Electronics Mobiles Laptops Clothing 2019
8 pages
A Short Overview of Reinforced Concrete
No ratings yet
A Short Overview of Reinforced Concrete
8 pages
Problems and Prospects of Oilseeds Production in India: Indian Institute of Management (IIM)
No ratings yet
Problems and Prospects of Oilseeds Production in India: Indian Institute of Management (IIM)
236 pages
D3165 en
No ratings yet
D3165 en
8 pages
Educ Tech 2 Chapter 8
No ratings yet
Educ Tech 2 Chapter 8
2 pages
Exercises - Chapter 3
No ratings yet
Exercises - Chapter 3
6 pages
Chapter 12 PDF
No ratings yet
Chapter 12 PDF
50 pages
13 Panel Interview Criteria Upd 03july2018
No ratings yet
13 Panel Interview Criteria Upd 03july2018
3 pages
Apple AirPods Pro (2nd Gen) Vs Sony WF-1000XM5 - What Is The Difference
No ratings yet
Apple AirPods Pro (2nd Gen) Vs Sony WF-1000XM5 - What Is The Difference
1 page

Data Aug Trans

Uploaded by

Data Aug Trans

Uploaded by

Chapter 1

Making the most of your data - Data

1.1 Data Augmentation

Figure 1.2: Random Crops/Scales

Training: sample random crops / scales

Specific example. How ResNet trains:

1. Pick random L in range [256, 480]

2. Resize training image, short side = L

3. Sample random 224 x 224 patch

Testing: average a fixed set of crops

Specific example. How ResNet tests:

1. Resize image at 5 scales: 224, 256, 384, 480, 640

Figure 1.3: Color Jitter

• Simple: Randomly jitter contrast

1. Apply PCA to all [R, G, B] pixels in training set

1.2 Transfer Learning

When and how to fine-tune?

• Train green, freeze everything else

Figure 1.4: Random Crops/Scales

You might also like