0% found this document useful (0 votes)

31 views48 pages

Self-Supervised Learning Strategies

The document discusses self-supervised learning, highlighting its motivation to utilize vast amounts of unlabeled data for training deep learning models without costly manual labeling. It outlines strategies for self-supervision, including non-contrastive and contrastive methods, and provides examples of pretext tasks such as rotation, jigsaw puzzles, and colorization. The document also touches on the effectiveness of various self-supervised learning techniques and their applications in downstream tasks.

Uploaded by

Rajdip Ingale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views48 pages

Self-Supervised Learning Strategies

Uploaded by

Rajdip Ingale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

SELF SUPERVISED

LEARNING
Prof. Biplab Banerjee
GNR 650
Slides Overview
• Motivation
• Introduction to self-supervision
• Strategies of self-supervision
• Non-Contrastive Strategies
• Contrastive Strategies
• Use cases
Motivation
• Supervised Learning has shown great promise in deep
learning.
• Deep Learning models are data hungry in nature.
• Manual labelling of data is too costly and time-
consuming affair.
• For e.g. ImageNet dataset curation started in 2006 and
continued till 2010 (~5 years for collecting and
labelling).
• Internet consists of vast amount of unlabeled data that
has not yet been effectively utilised.
Motivation
• Can we learn label agnostic feature representation from
a large unlabeled data which can be generalised to
multiple task?
• Unsupervised learning is hard therefore we carry out
self-supervision for learning from data without labels.

Neural
Network

Unlabeled Representatio
Data n
What is Self-Supervision?
• Generate labelled data from unlabeled data by
some means of automation and without much of
a human supervision.
• Train a neural network to predict "generated"
labels to learn better representation
• Finally Fine-tune on downstream
• task with few samples
What is Self-Supervision?
• We train a neural network by "generating" labels from
unlabeled data and then fine tune on the downstream
task that you might be interested in.
Strategy for Self-Supervision
• From unlabeled data generate some tasks, also known
as pretext task.
• For learning a better representation of data via neural
network encoder
• Then transfer the learnt encoder to the downstream
task by replacing the header and training it.
• Two primary ways to do it:
o Non-contrastive method
o Contrastive method
How to create pretext task?
• So given a data input we
would like to generate the
pretext task in such a way
that model tries to predict or
reconstruct some part/entire
data itself.
• For e.g. Using a part of an
image to generate some
other part of image.
• Then train a neural network in SOURCE:
Yann LeCun @EPFL - "Self-supervised learning: could machines learn like huma
contrastive or non-contrastive ns?" ([Link])

way for self-supervision.

• Hope that the learnt encoder
"generalizes" for downstream
task.
Non-Contrastive way
• Involves working with input image, distorting it in some
way and then try to predict that.
• Create a supervised learning task via automatically
generating labels.
• Some common tasks are:
o Rotation
o Impainting
o Colorization
o Jigsaw puzzle
o Counting objects
o Relative patch position
Rotation
• You rotate images and try to
predict the rotation of image.
• Idea: a good model should be
able to predict only if have visual
common sense of how the object
looks like without distortion.
• Rotate image into 4 direction and
try to predict the rotation.
• Treated as classification problem
Source: Unsupervised Representation
Learning by Predicting Image Rotations by
Gidaris et. Al. 2018
Relative patch position
• Given two patches, predict the relative position of each
other.
• Models tries to learn how different parts of images are
relatively placed and thus learn how• Relative
the objects
to aare in
single
real world. patch, we take
surrounding 8 patch
and number them
and try to predict it.
• Treated as
Source: Unsupervised Visual Representation Learning by Context classification problem
Prediction
by Doersch et. Al. 2015
Jigsaw Puzzle
• Given 9 patches, you jumble them up and then try to
predict the correct class of the jumbling
• Treated as classification problem by indexing to specific
permutation index.

Source: Unsupervised Learning of Visual Representations by Solving

Jigsaw Puzzles by Noroozi & Favaro, 2016
Impainting
• Fill in the missing patch via impainting
• Can be treated as regression problem for reconstructing
missing pixel.
• Can also be treated as
generative problem to
generate missing pixel.
• So, it utilizes both
reconstruction and
adversarial for good result
Source: Context Encoders: Feature Learning by Inpainting by
Pathak et al., 2016
Impainting (Losses)
• Reconstruction loss tries to reconstruct the pixels
• Adversarial loss tries to find whether the image passed
is real or impainted one.

Source: Context Encoders: Feature Learning by Inpainting by Pathak

et al., 2016
Impainting (comparison of loss
results)

Source: Context Encoders: Feature Learning by Inpainting by Pathak

et al., 2016
Masking possibilities
Results on different tasks
Colorization
• Tries to find the color using the black and white image.
• This was tries and done in LAB color space opposed to
RGB color space.
• L denotes perceptual
lightness and a,b denotes
rgb colors.
• Can be treated as
reconstruction and
generative problem.
Source: Colorful Image Colorization by Zhang et al.
2016a
Colorization using split brain
autoencoder
• Divide the input into different channel then use one
channel to predict another one via separate encoders.
• Some common way of splitting, separate color from
lightning and use color to predict lightning and vice
versa.
• Other way is to split image into depth and color image
and use either of them to predict other.

Source: Split-Brain Autoencoders: Unsupervised Learning by

Cross-Channel Prediction by Zhang et al. 2016b
Colorization using split brain
autoencoder
• Colorization and depth prediction example as shown.

Source: Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction by

Zhang et al. 2016b
Alternate aggregation technique
Results
• Pre-trained using ImageNet labels and fine-tuned on
PASCAL data.

Pretrained on imagenet fully

No Pretraining for them, utilizes weight
rescaling
Impainting

Relative Position
Colorization

Jigsaw Solver

Split brain autoencoder

Rotation of image

Source: Unsupervised Representation Learning by Predicting

Image Rotations by Gidaris et. Al. 2018
Contrastive self supervised learning
multi-modal contrastive learning
SimCLR
Data augmentations
Results
Effects of the projection head
Effects on the batch size
Momentum contrast
MoCo decouples batch size from
negatives

Uses a queue of negative samples

Tackles the time issue of SimCLR

Gradient update in MoCo
Results
Barlow twins
Advantages of BT
• Redundancy reduction in the features
• Avoids collapse in the embedding space
• It does not require large batches, or momentum
encoder
Results
DINO
Some visualization
Segmentation task using DINO
backbone
Pretext invariant SSL
PIRL with memory bank
Results
References
• lecture_12.pdf ([Link])
• Self-Supervised Representation Learning | Lil'Log (lilianw
[Link])
• Week 10 · Deep Learning ([Link])
• Yann LeCun @EPFL - "Self-supervised learning: could ma
chines learn like humans?" ([Link])

Self-Supervised Learning Techniques
No ratings yet
Self-Supervised Learning Techniques
76 pages
Self-Supervised Learning in Computer Vision
No ratings yet
Self-Supervised Learning in Computer Vision
5 pages
Self-Supervised Representation Learning
No ratings yet
Self-Supervised Representation Learning
27 pages
Self-Supervised Learning in Deep Learning
No ratings yet
Self-Supervised Learning in Deep Learning
20 pages
Pretext Tasks in Self-Supervised Learning
No ratings yet
Pretext Tasks in Self-Supervised Learning
50 pages
Advances in Self-Supervised Learning Techniques
No ratings yet
Advances in Self-Supervised Learning Techniques
7 pages
Self-Supervised Learning in Computer Vision
No ratings yet
Self-Supervised Learning in Computer Vision
104 pages
Contrastive Self-Supervised Learning Survey
No ratings yet
Contrastive Self-Supervised Learning Survey
21 pages
Unsupervised Learning Overview
No ratings yet
Unsupervised Learning Overview
85 pages
Self-Supervised Learning Techniques Overview
No ratings yet
Self-Supervised Learning Techniques Overview
91 pages
(AIDL 2025 - Octubre) Selfsupervised - Students
No ratings yet
(AIDL 2025 - Octubre) Selfsupervised - Students
107 pages
Insights on Self-Supervised Learning
No ratings yet
Insights on Self-Supervised Learning
10 pages
Advanced Machine Learning Techniques
No ratings yet
Advanced Machine Learning Techniques
103 pages
Forward-Forward Algorithm in Self-Supervised Learning
No ratings yet
Forward-Forward Algorithm in Self-Supervised Learning
23 pages
Revisiting Self-Supervised Learning Insights
No ratings yet
Revisiting Self-Supervised Learning Insights
13 pages
Contrastive Learning in Visual AI
No ratings yet
Contrastive Learning in Visual AI
58 pages
Advanced Deep Learning Techniques
No ratings yet
Advanced Deep Learning Techniques
44 pages
Self-Supervised Image Recognition Review
No ratings yet
Self-Supervised Image Recognition Review
22 pages
Self-Supervised Learning: Overview & Challenges
No ratings yet
Self-Supervised Learning: Overview & Challenges
19 pages
Unsupervised Learning in Machine Learning
No ratings yet
Unsupervised Learning in Machine Learning
6 pages
Deep Learning Concepts and Techniques
No ratings yet
Deep Learning Concepts and Techniques
54 pages
Contrastive Self-Supervised Learning Method
No ratings yet
Contrastive Self-Supervised Learning Method
8 pages
Unsupervised Deep Learning Overview
No ratings yet
Unsupervised Deep Learning Overview
90 pages
SimCLR and I-JEPA: Unsupervised Learning Insights
No ratings yet
SimCLR and I-JEPA: Unsupervised Learning Insights
23 pages
Advanced Techniques in Machine Learning
No ratings yet
Advanced Techniques in Machine Learning
62 pages
Data Augmentation Techniques for ML
No ratings yet
Data Augmentation Techniques for ML
28 pages
Self-Supervised Learning Techniques
No ratings yet
Self-Supervised Learning Techniques
11 pages
Pretext Tasks in Self-Supervised Learning
No ratings yet
Pretext Tasks in Self-Supervised Learning
3 pages
Self-Supervised Learning in Imaging
No ratings yet
Self-Supervised Learning in Imaging
22 pages
Understanding Semi-Supervised Learning
No ratings yet
Understanding Semi-Supervised Learning
26 pages
Split-Brain Autoencoders for Unsupervised Learning
No ratings yet
Split-Brain Autoencoders for Unsupervised Learning
11 pages
Understanding Representation Learning Techniques
No ratings yet
Understanding Representation Learning Techniques
28 pages
Semi-Supervised Learning with Ladder Networks
No ratings yet
Semi-Supervised Learning with Ladder Networks
19 pages
Deep Learning for Natural Language Processing
No ratings yet
Deep Learning for Natural Language Processing
78 pages
Introduction to Deep Learning Concepts
No ratings yet
Introduction to Deep Learning Concepts
20 pages
Deep Learning Course Syllabus 2025
No ratings yet
Deep Learning Course Syllabus 2025
42 pages
Self-Supervised Learning in CV
No ratings yet
Self-Supervised Learning in CV
7 pages
Self-Supervised Learning for Skin Lesion Classification
No ratings yet
Self-Supervised Learning for Skin Lesion Classification
15 pages
Self-Supervised Learning in ML
No ratings yet
Self-Supervised Learning in ML
13 pages
Learning Paradigms in AI Explained
No ratings yet
Learning Paradigms in AI Explained
123 pages
Conmatphys 031119 050745
No ratings yet
Conmatphys 031119 050745
28 pages
Deep Learning Short Course at IIIT Hyderabad
No ratings yet
Deep Learning Short Course at IIIT Hyderabad
57 pages
Semi-Supervised Learning Overview
No ratings yet
Semi-Supervised Learning Overview
59 pages
Understanding Dimensional Collapse in Contrastive Learning
No ratings yet
Understanding Dimensional Collapse in Contrastive Learning
17 pages
Regularization and Data Augmentation Techniques
No ratings yet
Regularization and Data Augmentation Techniques
22 pages
Data Science Interview Questions (#Day25)
No ratings yet
Data Science Interview Questions (#Day25)
16 pages
Self-Supervised Learning in Vision
No ratings yet
Self-Supervised Learning in Vision
15 pages
2019BurkovTheHundred PageMachineLearning 101 152
No ratings yet
2019BurkovTheHundred PageMachineLearning 101 152
52 pages
Learning with Limited Data: Liwicki's Insights
No ratings yet
Learning with Limited Data: Liwicki's Insights
67 pages
Statistical Mechanics of Deep Learning
No ratings yet
Statistical Mechanics of Deep Learning
30 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
TS-TCC: Self-Supervised Time-Series Learning
No ratings yet
TS-TCC: Self-Supervised Time-Series Learning
15 pages
Dinov2: Learning Robust Visual Features Without Supervision: Reviewed On Openreview
No ratings yet
Dinov2: Learning Robust Visual Features Without Supervision: Reviewed On Openreview
32 pages
3learning in Robotics
No ratings yet
3learning in Robotics
53 pages
CSE 465: Pattern Recognition Overview
No ratings yet
CSE 465: Pattern Recognition Overview
47 pages
Advanced Deep Learning Topics 2024
No ratings yet
Advanced Deep Learning Topics 2024
37 pages
Generative Pretraining From Pixels
No ratings yet
Generative Pretraining From Pixels
13 pages
Confusing Machine Learning Models
No ratings yet
Confusing Machine Learning Models
78 pages
Task Transfer Learning Framework
No ratings yet
Task Transfer Learning Framework
11 pages
Join Nexus Cognitive: Data Careers
No ratings yet
Join Nexus Cognitive: Data Careers
5 pages
Deep Learning for Image Analysis
No ratings yet
Deep Learning for Image Analysis
37 pages
Understanding Vision Transformers
No ratings yet
Understanding Vision Transformers
69 pages
Agentic Workflows in LLMs
No ratings yet
Agentic Workflows in LLMs
43 pages
Function Calling in Large Language Models
No ratings yet
Function Calling in Large Language Models
43 pages
Climate Change's Impact on Food Security
No ratings yet
Climate Change's Impact on Food Security
1 page
PySpark Interview Questions & Answers
No ratings yet
PySpark Interview Questions & Answers
8 pages
Tool Augmentation in LLMs Explained
No ratings yet
Tool Augmentation in LLMs Explained
46 pages
Truth and Lies: Guard Puzzle Solutions
No ratings yet
Truth and Lies: Guard Puzzle Solutions
4 pages
Preventing AI Hallucinations in LLMs
No ratings yet
Preventing AI Hallucinations in LLMs
9 pages
LLM Reasoning: Key Insights & Challenges
No ratings yet
LLM Reasoning: Key Insights & Challenges
87 pages
Plan of Action To Prevent Human Extinction Risks
No ratings yet
Plan of Action To Prevent Human Extinction Risks
1 page
Unique Challenges in Catching Wyrms
No ratings yet
Unique Challenges in Catching Wyrms
1 page
Thermal Energy and Steel Properties Analysis
No ratings yet
Thermal Energy and Steel Properties Analysis
16 pages
Samsung Galaxy Beam GT I8530 User Manual
No ratings yet
Samsung Galaxy Beam GT I8530 User Manual
172 pages
Understanding Osteomyelitis: Causes & Care
No ratings yet
Understanding Osteomyelitis: Causes & Care
13 pages
Understanding Egyptian Ginamos: A Family Story
No ratings yet
Understanding Egyptian Ginamos: A Family Story
8 pages
Demand and Diversity Factors Explained
No ratings yet
Demand and Diversity Factors Explained
17 pages
Boiler Performance Testing Guide
No ratings yet
Boiler Performance Testing Guide
5 pages
Hodka Village: Rural Tourism Impact
100% (1)
Hodka Village: Rural Tourism Impact
6 pages
Offshore Pipeline Installation Methods
100% (4)
Offshore Pipeline Installation Methods
40 pages
OPPO Find X9 Reviewer’s Guide
No ratings yet
OPPO Find X9 Reviewer’s Guide
44 pages
Gram-Negative Enteric Bacteria Overview
No ratings yet
Gram-Negative Enteric Bacteria Overview
5 pages
Employability Skills Study Material
No ratings yet
Employability Skills Study Material
151 pages
2017-10-11 Quick Setup and Operation Manual-ThunderJet AD&C
100% (3)
2017-10-11 Quick Setup and Operation Manual-ThunderJet AD&C
133 pages
24" Dishwasher Dimensions Guide
No ratings yet
24" Dishwasher Dimensions Guide
12 pages
Family Assets and Expenses Analysis 2017
No ratings yet
Family Assets and Expenses Analysis 2017
2 pages
Product Design and Development Overview
No ratings yet
Product Design and Development Overview
24 pages
Aryan Jain Traffic Awareness
No ratings yet
Aryan Jain Traffic Awareness
7 pages
(Ebook) Thermodynamics - Fundamentals For Applications - J.
96% (27)
(Ebook) Thermodynamics - Fundamentals For Applications - J.
664 pages
Munich and Zurich Travel Itinerary
No ratings yet
Munich and Zurich Travel Itinerary
8 pages
Postoperative Nursing Care Test Bank
100% (1)
Postoperative Nursing Care Test Bank
10 pages
Simulation Theory Manual
No ratings yet
Simulation Theory Manual
115 pages
National Building Code of the Philippines
No ratings yet
National Building Code of the Philippines
37 pages
eSATA Connector Features Explained
No ratings yet
eSATA Connector Features Explained
3 pages
Multiple Access Techniques for Satellites
No ratings yet
Multiple Access Techniques for Satellites
47 pages
Environmental History of Medieval India
No ratings yet
Environmental History of Medieval India
21 pages
Eucalyptus-Based Rat Repellent Concept
No ratings yet
Eucalyptus-Based Rat Repellent Concept
40 pages
Overview of Ball and Race Mill
No ratings yet
Overview of Ball and Race Mill
5 pages
Grundfosliterature 4609696
No ratings yet
Grundfosliterature 4609696
42 pages

Self-Supervised Learning Strategies

Uploaded by

Self-Supervised Learning Strategies

Uploaded by

SELF SUPERVISED

way for self-supervision.

Source: Unsupervised Learning of Visual Representations by Solving

Source: Context Encoders: Feature Learning by Inpainting by Pathak

Source: Context Encoders: Feature Learning by Inpainting by Pathak

Source: Split-Brain Autoencoders: Unsupervised Learning by

Source: Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction by

Pretrained on imagenet fully

Split brain autoencoder

Source: Unsupervised Representation Learning by Predicting

Uses a queue of negative samples

Tackles the time issue of SimCLR

You might also like