0% found this document useful (0 votes)

48 views67 pages

Learning with Limited Data: Liwicki's Insights

Learning with few data presentation.

Uploaded by

martinekbh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views67 pages

Learning with Limited Data: Liwicki's Insights

Learning with few data presentation.

Uploaded by

martinekbh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

learning with few data

[Link]/2023-nldl-tutorial
Marcus Liwicki, Machine Learning
Luleå University of Technology

Marcus Liwicki : learning with few data

are you working on your
PhD
or finished recently ?
did you ever

feel insignificant
doubt your skills
or

feel unchallenged ?
did you ever

feel insignificant
doubt your skills
or

feel unchallenged
did you ever

feel insignificant
doubt your skills
or

feel unchallenged ?
did you ever

feel insignificant
doubt your skills
or

feel unchallenged
unchallenged ?
You are not alone !
Marcus Liwicki, Machine Learning
Luleå University of Technology
[Link]/2023-nldl-tutorial
ELLIS member, WASP member
IEEE senior member, IAPR award winner, …
agenda

motivation
prior
approaches
end to end learning
transfer learning
clustering
representation learning
auto-encoding
contrastive learning
comparative summary
remarks on contrastive learning

and some spices in-between:

what I have learned during my life as presenter
agenda

motivation
prior
approaches
end to end learning
transfer learning
clustering
representation learning
auto-encoding
contrastive learning
comparative summary
remarks on contrastive learning

and some spices in-between:

what I have learned during my life as presenter
machine learning needs data

11
machine learning (ideal)

Data Labels

Priors

12
reality

Data

Priors

Labels

13
Data
minimize
human
Data Labels supervision Priors

Priors Labels

how?
1. adding more unlabeled data or synthetic data
2. incorporating more prior (knowledge)
14
there are so many priors hidden in structure

15
there are so many priors hidden in structure

including priors
92.15% (SotA 88.2%)

Better than
Google

16
prior

experience (from earlier experiments)

proven architectures, meta parameters, …

knowledge (human reasoning)

correlating the given input details and identifying discriminative features

data (intrinsic or human induced)

sequential correlation, local correlation
filenames folder structures, taxonomies

[Link]
[Link]
17
time to learn something about presentations ;)
should we use dark background ?
or white ?
ok, enough of the torture

but why did so many of you torture each other?

Contrast is important
equity in the machine learning group

Marcus Gustav
Pedro Konstantina Fotini Christian Kanjar Vibha Fredrik

Notice
something?
Almost 40%
Priyamvada Saleha
woman
György Rajkumar
Oluwatosin Homam Mattias Nosheen

Sana Ali András Richa Karl Carl Prakash Lama Elisa

machine learning for the welfare of society

thanks to previous and current PhDs

Michele Alberti Vinay Pondenkandath Gustav G. Pihlgren Prakash

Ch. Chhipa
overview of approaches

end to end learning

• transfer learning (A Survey on Deep Transfer Learning - 2018)
• Utilizing pretrained models and finetuning on application specific data
• Required less data to fine tune than training it from scratch

• clustering – (Deep Clustering for Unsupervised Learning of Visual Features - 2018)

• Labelled data not required

representation learning
• auto-encoding – (Variational Autoencoder for Deep Learning of Images, Labels and Captions, 2016)
• Questionable if this is a good way to go – (A Pitfall of Unsupervised Pre-Training, 2017)

• contrastive learning (SimCLR - July 2020, SwAV – October 2020)

• Pretraining mechanism which utilizes application specific unlabeled data
• Also compute intensive but possibility to scale down
25
transfer learning

Source: [Link] Source: [Link]

learning/

remarks
• successful but only initial layers with low-level features are common & useful across applications
• no possibility for unlabeled data

26
ImageNet pretraining works outside of
natural images

footsteps for person identification

(88 % for 13 persons, previous SotA 77 %)

MS Singh, V Pondenkandath, B Zhou, P Lukowicz, M Liwicki

Transforming sensor data to the image domain for deep learning—An application to footstep detection, IJCNN 2017

27
ImageNet pre-training works often well

Linda Studer, Michele Alberti, Vinaychandran Pondenkandath, Pinar Goktepe, Thomas Kolonko, Andreas Fischer, Marcus Liwicki, Rolf Ingold:
A Comprehensive Study of ImageNet Pre-Training for Historical Document Image Analysis, ICDAR, 2019

28
shortcomings – ImageNet transfer learning
ImageNet-trained CNNs are biased towards texture
– Strongly biased towards recognizing textures rather than shapes
Geirhos, R., Rubisch, P., Michaelis, C., Bethge, M., Wichmann, F. A., & Brendel, W. (2018, September). ImageNet-trained CNNs are biased towards texture; increasing shape bias
improves accuracy and robustness. In International Conference on Learning Representations.

29
ImageNet transfer learning in medical images
Medical image domain

Transfer learning

Retina DR dataset CheXpert dataset

ImageNet

ImageNet transfer learning does not significantly affect performance on medical imaging tasks
– Ref: Transfusion: Understanding Transfer Learning for Medical Imaging
Raghu, M., Zhang, C., Kleinberg, J., & Bengio, S. (2019). Transfusion: Understanding transfer learning for medical imaging. Advances in neural information processing systems, 32.
– Task specific learning - only initial layers with low-level features are useful

Adapted from [Link]

30
ImageNet transfer learning in histopathology
Sharmay, Y., Ehsany, L., Syed, S., & Brown, D. E. (2021, July). HistoTransfer: Understanding Transfer Learning for Histopathology. In 2021 IEEE EMBS International Conference on
Biomedical and Health Informatics (BHI) (pp. 1-4). IEEE.

Gastrointestinal, breast cancer

ImageNet vs. SSL

Why ImageNet supervised transfer learning is sub-optimal?

Possibly, ImageNet trained model is overfitted for natural scenes
Optimized for dataset specific characteristics

31
clustering
group features with k-means and update the weights to optimize for these assignments

Source: [Link]
remarks
• Compute intensive when applied on images
• Non robust feature representation when feature extracted with pretrained models

32
agenda

motivation
prior
approaches
end to end learning
transfer learning
clustering
representation learning
auto-encoding – and alternatives
contrastive learning
comparative summary
remarks on contrastive learning

and some spices in-between:

what I have learned during my life as presenter
Auto-Encoding – pre-training

INPUT ENCODER FEATURES DECODER OUTPUT

34
Auto-Encoding – classification

INPUT ENCODER FEATURES CLASSIFIER OUTPUT

“cat”

35
a pitfall of unsupervised pre-training, 2017

a good auto-encoder (low reconstruction error) does not

necessarily lead to better accuracy
alternative: use PCA or LDA for initialization

Will they
converge ?

No ! Better local minima ?

Michele Alberti, Mathias Seuret, Vinaychandran Pondenkandath, Rolf Ingold, Marcus Liwicki
Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks. ICDAR 2017
37
auto-encoding limitation

what we want what we might get

38
39
variational auto-encoders

X Encoder N(μ, σ2) z Decoder X’

σ
2

Kingma, Diederik P., and Max Welling. "Auto-encoding variational bayes.“

2013
40
perceptual loss

Another
Encode z Decoder X’ Neural y’
r Network
X
Another
Neural y
Network
Thorough investigation :
Improving image autoencoder embeddings with perceptual loss, 2020
And Oskar Sjögren (yesterday)
41
42
try it out …

[Link]/2023-nldl-tutorial
[Link]

[Link]

43
Contrastive Learning (CL)

Self-Supervised Method:
Allows model to learn
generic representations on unlabeled
data

Method:
Learn similarity between augmented representation from
same image
Learn dissimilarity otherwise

Source: [Link] 44
(not so) recent work in Contrastive Learning

Simple Framework for Contrastive Learning (SimCLR)

A Simple Framework for Contrastive Learning of Visual Representations (SimCLR v1), ICML - 2020
Big Self-Supervised Models are Strong Semi-Supervised Learners (SimCLR v2), NeurIPS – 2020

Momentum Contrast Learning (MOCO)

Momentum Contrast for Unsupervised Visual Representation Learning (MOCO v1), CVPR - Mar 2020
Improved Baselines with Momentum Contrastive Learning (MOCO v2), ?? Arxiv Oct- 2020
Bootstrap Your Own Latent A New Approach to Self-Supervised Learning, NeurlPS - 2020
Contrastive Learning with Clustering
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments (SwAE), Arxiv 2020

45
Comparative Summary on SOTA

Contrastive Learning

Clustering + Self-supervised

Self-Labelling

Source (IARAI): [Link]

• Remarks
• Priors (augmentation mechanism) is more important than learning method
• Obtains performance approx. equal to supervised methods with 10% labelled data
it’s easy on natural images

distorted views (augmented views) of input visual

Human prior for visual Relevant Augmentation

Size Resize

Shape Crop, Flip

Foreground-Background Blur, Noise, Color schemes, filtering

Angle Flip, Rotation

Color spectrum Contrast, saturation

but does not work in other domains

Distorted views (augmented views) of input visual

Human prior for visual Relevant Augmentation

Size Resize

Shape Crop, Flip

Foreground-Background Blur, Noise, Color schemes, filtering

Angle Flip, Rotation

Color spectrum Contrast, saturation

medical images, remote sensing imagery, non-obvious visual concepts

insufficiency of human prior for distorted view

49
use two views of same patient

Azizi, S., Mustafa, B., Ryan, F., Beaver, Z., Freyberg, J., Deaton, J., ... & Norouzi, M. (2021). Big self-supervised models advance medical image classification. In
Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 3478-3488).

50
but wait … did we just use labels ?

51
our approach: shifting focus from human prior to data prior

Supervised Self-supervised Adapting self-supervised

approach approach (on natural approach on specialized
visual concepts) domain

Data reduce human prior Data

minimize human (augmentation) &
supervision Huma incorporate data
Label n
Data
s Priors
prior Huma Data
n Prior
Priors s
Human Label Label
Priors s s

52
let us use the data prior

data (prior) magnification levels (in BreakHis data) are

utilized to generate both views for SSL input

the only human prior used in magnification sampling

achieves state-of-the-art results with only 20%

labels on classification

Chhipa, P. C., Upadhyay, R., Pihlgren, G. G., Saini, R., Uchida, S., & Liwicki, M. (2022). Magnification Prior: A Self-Supervised Method for Learning Representations on
Breast Cancer Histopathological Images. arXiv preprint arXiv:2203.07707.

53
ideas for data prior

temporal proximity

spatial proximity

sequential co-occurrence (BERT)

different modalities

more ?

54
curious, what more we can learn about
presentation techniques ?

btw., should we use slide numbers ?

typical issues, I observe at scientific
conferences :

unconfident posture
filler sounds
angle and interaction
typical issues, I observe at scientific
conferences :

unconfident posture
filler sounds
angle and interaction
agenda

motivation
prior
approaches
end to end learning
transfer learning
clustering
representation learning
auto-encoding
contrastive learning
comparative summary
remarks on contrastive learning

And some spices in-between:

What I have learned during my life as presenter
97’123’452
summary

end to end learning

• transfer learning
• clustering

representation learning
• auto-encoding
• PCA, LDA
• perceptual loss
• contrastive learning

meta learning (not covered today)

63
remarks on contrastive learning
Method Contrastive Learning Contribution Limitation
Key Factor

SimCLR V1.0 K1: Similarity learning for positive Established benchmark performance on 1. ‘Large batch size’ due to positive + negative pair
pairs unsupervised contrastive learning 2. ‘Mass gradient computation & backprop issue’ due to all
K2: Dissimilarity learning for (+ve & -ve) pairs
negative pairs

SimCLR V2.0 K1 + K2 on Task agnostic Big n/w + Added enablement of semi-supervised Same as SimCLR V1.0 + usage of bigger networks
which used in distillation for task learning through distillation
specific small n/w

MOCO V1.0 K1 + K2 over momentum encoder Revealed unsupervised contrastive learning 1. ‘Mass gradient computation & backprop issue’ due to all
where CL as dynamic dictionary with smaller batch size and lessor (+ve & -ve) pairs (same as SimCLR because as q-encoder
lookup backpropagation of gradients backpropagates)
2. Overhead of dynamic dictionary queue

MOCO V2.0 MOCO V1.0 + 2-layer MLP Stronger baseline, outperformed on 1. ‘Mass gradient computation & backprop issue’ due to all
projection head SimCLR and MOCO v1.0. (+ve & -ve) pairs same as SimCLR because q-encoder and
k-encoder both backpropagates
2. Overhead of dynamic dictionary queue

BYOL K1+ momentum encoding + two Achieves self supervised CL without 1. Complex pipeline with large number of pruning. Makes it
separate networks (online and negative pair. Establishes benchmarks in challenging for concept utilization.
target) semi-supervised approach. Robust for
smaller batch size.

SwAE K1 + Swapped” prediction Achieves self supervised CL without 1. Relatively complex loss computation due to swapped
mechanism + cluster assignment negative pair. Claims state of art in prediction
unsupervised image clustering. 2. Additional online cluster assignment swapping

DINO Distillation Self attention without supervision 1. More research required

transformers Moderate computation power 2. Authors are not self-critical

Barlow Twins Redundancy reduction minimize covariance across embedding

dimension
Maximize invariance across sample
64
remarks on contrastive learning

CL is leading the self-supervision & potential push for semi-supervised

CL in current state is compute intensive

65
batch size is huge
SimCLR, performance increase, when batch size of 2048
reason: large number of negative pairs
requires array of GPUs and sophisticated parallel processing

66
batch size is huge
SimCLR, performance increase, when batch size of 2048
reason: large number of negative pairs
requires array of GPUs and sophisticated parallel processing

knowledge distillation ( BYOL 2020, SimSiam 2020) do not use negative pairs
batch size 512
however, embedding output size in range of 4096

67
batch size is huge
SimCLR, performance increase, when batch size of 2048
reason: large number of negative pairs
requires array of GPUs and sophisticated parallel processing

knowledge distillation ( BYOL 2020, SimSiam 2020) do not use negative pairs
batch size 512
however, embedding output size in range of 4096

for non natural images, smaller batch size

is already good (128)
reason: not RGB images, but simpler

68
Remarks on Contrastive Learning

CL is leading the self-supervision & potential push for semi-supervised

CL in current state is compute intensive (batch size, negative pairs, & gradients) which
makes it challenging for direct (as-it-is) applications. Needs (Research Potential) to be
tailored for custom and small-scale application requirement.
Contrastive methods are sensitive to the choice of image/data augmentation.
Leveraging utilization of application specific but unlabeled data.

CL can be benchmarking framework (Different methods for different applications) for

semi-supervised and even supervised task.

69
thanks to my colleagues

there is so much more, I could share [Link]/2023-nldl-tutorial

[Link]

Sim CLR
No ratings yet
Sim CLR
11 pages
Contrastive Learning in Visual AI
No ratings yet
Contrastive Learning in Visual AI
58 pages
Pretext Tasks in Self-Supervised Learning
No ratings yet
Pretext Tasks in Self-Supervised Learning
50 pages
Deep Learning for Chest Disease Detection
No ratings yet
Deep Learning for Chest Disease Detection
11 pages
SimCLR and I-JEPA: Unsupervised Learning Insights
No ratings yet
SimCLR and I-JEPA: Unsupervised Learning Insights
23 pages
Contrastive Learning of Global and Local Features For Medical Img Seg W Limited Annotations
No ratings yet
Contrastive Learning of Global and Local Features For Medical Img Seg W Limited Annotations
18 pages
Self-Supervised Learning Strategies
No ratings yet
Self-Supervised Learning Strategies
48 pages
Pseudo-Label Contrastive Learning in Medical Segmentation
No ratings yet
Pseudo-Label Contrastive Learning in Medical Segmentation
12 pages
Self-Supervised Learning in Computer Vision
No ratings yet
Self-Supervised Learning in Computer Vision
5 pages
Deep Learning Overview and Techniques
No ratings yet
Deep Learning Overview and Techniques
271 pages
Domain-Enriched Deep Learning Techniques
No ratings yet
Domain-Enriched Deep Learning Techniques
35 pages
Self-Supervised Learning in Deep Learning
No ratings yet
Self-Supervised Learning in Deep Learning
38 pages
Contrastive Self-Supervised Learning Survey
No ratings yet
Contrastive Self-Supervised Learning Survey
21 pages
Self-Supervised Learning Techniques Overview
No ratings yet
Self-Supervised Learning Techniques Overview
91 pages
Advanced Machine Learning Techniques
No ratings yet
Advanced Machine Learning Techniques
103 pages
Self-Supervised Learning Techniques
No ratings yet
Self-Supervised Learning Techniques
76 pages
Leukemia Cancer Cells Segmentation and Classification Using Machine Learning
No ratings yet
Leukemia Cancer Cells Segmentation and Classification Using Machine Learning
18 pages
Deep Learning Course Syllabus 2025
No ratings yet
Deep Learning Course Syllabus 2025
42 pages
(AIDL 2025 - Octubre) Selfsupervised - Students
No ratings yet
(AIDL 2025 - Octubre) Selfsupervised - Students
107 pages
AAI Module 4
No ratings yet
AAI Module 4
13 pages
Medical Image Classification with TL
No ratings yet
Medical Image Classification with TL
4 pages
Contrastive Learning of Medical Visual Representations From Paired Images and Text
No ratings yet
Contrastive Learning of Medical Visual Representations From Paired Images and Text
15 pages
Unsupervised Deep Learning Overview
No ratings yet
Unsupervised Deep Learning Overview
90 pages
Dense Contrastive Learning for Vision Tasks
No ratings yet
Dense Contrastive Learning for Vision Tasks
11 pages
Selective Pre-Training for Imbalanced Data
No ratings yet
Selective Pre-Training for Imbalanced Data
10 pages
Xavier Initialization in Deep Networks
No ratings yet
Xavier Initialization in Deep Networks
8 pages
Self-Supervised Contrastive Learning for MRI
No ratings yet
Self-Supervised Contrastive Learning for MRI
9 pages
Understanding Representation Learning Techniques
No ratings yet
Understanding Representation Learning Techniques
28 pages
Advanced Deep Learning Techniques
No ratings yet
Advanced Deep Learning Techniques
44 pages
Understanding Transfer Learning in AI
No ratings yet
Understanding Transfer Learning in AI
79 pages
AI in Image Processing: ML & DL Insights
No ratings yet
AI in Image Processing: ML & DL Insights
14 pages
Self-Supervised Learning in CV
No ratings yet
Self-Supervised Learning in CV
7 pages
CLIP: Advanced Multi-Modal Learning
No ratings yet
CLIP: Advanced Multi-Modal Learning
22 pages
TS-TCC: Self-Supervised Time-Series Learning
No ratings yet
TS-TCC: Self-Supervised Time-Series Learning
15 pages
Deep Learning Overview and History
No ratings yet
Deep Learning Overview and History
54 pages
Text-to-Image Generation with Diffusion Models
No ratings yet
Text-to-Image Generation with Diffusion Models
15 pages
Medical Imaging: AI & Machine Learning
No ratings yet
Medical Imaging: AI & Machine Learning
21 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
56 pages
Advanced Deep Learning Topics 2024
No ratings yet
Advanced Deep Learning Topics 2024
37 pages
1 Intro Updated
No ratings yet
1 Intro Updated
42 pages
Introduction to Deep Learning Concepts
No ratings yet
Introduction to Deep Learning Concepts
20 pages
Forward-Forward Algorithm in Self-Supervised Learning
No ratings yet
Forward-Forward Algorithm in Self-Supervised Learning
23 pages
Deep Learning Concepts and Techniques
No ratings yet
Deep Learning Concepts and Techniques
54 pages
Context-Aware Meta-Learning for Vision
No ratings yet
Context-Aware Meta-Learning for Vision
22 pages
Machine Learning in Computer Vision
No ratings yet
Machine Learning in Computer Vision
18 pages
Big Models Enhance Semi-Supervised Learning
No ratings yet
Big Models Enhance Semi-Supervised Learning
17 pages
Deep Learning: Hierarchical Feature Learning
No ratings yet
Deep Learning: Hierarchical Feature Learning
204 pages
ETSEF: Enhancing Medical Imaging with AI
No ratings yet
ETSEF: Enhancing Medical Imaging with AI
64 pages
Transfer Learning Strategies Explained
No ratings yet
Transfer Learning Strategies Explained
42 pages
Unsupervised Learning Overview
No ratings yet
Unsupervised Learning Overview
85 pages
Data Science Interview Questions (#Day25)
No ratings yet
Data Science Interview Questions (#Day25)
16 pages
Self-Supervised Learning in Medical Imaging
No ratings yet
Self-Supervised Learning in Medical Imaging
5 pages
Conmatphys 031119 050745
No ratings yet
Conmatphys 031119 050745
28 pages
Deep Learning Watermarking Techniques
No ratings yet
Deep Learning Watermarking Techniques
99 pages
Greedy Layer-wise Pretraining in Deep Learning
No ratings yet
Greedy Layer-wise Pretraining in Deep Learning
15 pages
Data Augmentation Techniques for ML
No ratings yet
Data Augmentation Techniques for ML
28 pages
Curriculum Learning: A Survey: Petru Soviany Radu Tudor Ionescu Paolo Rota Nicu Sebe
No ratings yet
Curriculum Learning: A Survey: Petru Soviany Radu Tudor Ionescu Paolo Rota Nicu Sebe
40 pages
FashionCLIP: Contrastive Learning Model
No ratings yet
FashionCLIP: Contrastive Learning Model
13 pages
Convolutional Neural PDF
No ratings yet
Convolutional Neural PDF
187 pages
Electrical Engineer Resume Overview
No ratings yet
Electrical Engineer Resume Overview
3 pages
PHILIPS CH - TPS2.1E LA PDF
No ratings yet
PHILIPS CH - TPS2.1E LA PDF
103 pages
Salesforce AI Associate Exam Guide
No ratings yet
Salesforce AI Associate Exam Guide
11 pages
Mechatronics Bachelor's Module Handbook
No ratings yet
Mechatronics Bachelor's Module Handbook
91 pages
prplMesh Controller Overview and Tasks
No ratings yet
prplMesh Controller Overview and Tasks
23 pages
USC Screen Scoring Tech Requirements
No ratings yet
USC Screen Scoring Tech Requirements
2 pages
NLP Report: NLG, MT, and NLU Insights
No ratings yet
NLP Report: NLG, MT, and NLU Insights
6 pages
Fixing EWA HTTP Error 404 Guide
No ratings yet
Fixing EWA HTTP Error 404 Guide
1 page
UI Developer Profile: Ashish Chakravarti
No ratings yet
UI Developer Profile: Ashish Chakravarti
1 page
Multimedia Evolution and Applications Guide
No ratings yet
Multimedia Evolution and Applications Guide
8 pages
Hybrid Workplace Insights for 2022
No ratings yet
Hybrid Workplace Insights for 2022
342 pages
CQL Syntax Guide for CSV Management
No ratings yet
CQL Syntax Guide for CSV Management
40 pages
Sarbanes-Oxley Act Overview and Impact
No ratings yet
Sarbanes-Oxley Act Overview and Impact
3 pages
Essentials of Computer Organization 4th Ed.
No ratings yet
Essentials of Computer Organization 4th Ed.
5 pages
Computer Security Ethics and Privacy
No ratings yet
Computer Security Ethics and Privacy
55 pages
PHP Manual: Functions, Loops, and OOP
No ratings yet
PHP Manual: Functions, Loops, and OOP
5 pages
Series and Progressions in Mathematics
No ratings yet
Series and Progressions in Mathematics
77 pages
Automatic Circuit Design via Evolution
No ratings yet
Automatic Circuit Design via Evolution
132 pages
Aphelion Patient Portal User Guide
No ratings yet
Aphelion Patient Portal User Guide
23 pages
Dell Precision 5820 Compliance Overview
No ratings yet
Dell Precision 5820 Compliance Overview
11 pages
November 2024 Price List Overview
No ratings yet
November 2024 Price List Overview
123 pages
CPE 354 Lab: CPU Setup and Circuits
No ratings yet
CPE 354 Lab: CPU Setup and Circuits
8 pages
Grade 9 Quadratic Equations Review
No ratings yet
Grade 9 Quadratic Equations Review
34 pages
Summer Training Report: Data Science
No ratings yet
Summer Training Report: Data Science
47 pages
Best Crypto Wallets in Australia 2022
No ratings yet
Best Crypto Wallets in Australia 2022
11 pages
Efficient HMAC Implementation with SHA-1
No ratings yet
Efficient HMAC Implementation with SHA-1
4 pages
Flutter Task Management App Overview
No ratings yet
Flutter Task Management App Overview
4 pages
Data Integrity in Google Analytics Course
No ratings yet
Data Integrity in Google Analytics Course
6 pages
Mahr Digimar 817 CLM Height Measuring Instrument 3759019 - Manual
No ratings yet
Mahr Digimar 817 CLM Height Measuring Instrument 3759019 - Manual
152 pages
PowerPC Architecture and Applications
No ratings yet
PowerPC Architecture and Applications
13 pages

Learning with Limited Data: Liwicki's Insights

Uploaded by

Learning with Limited Data: Liwicki's Insights

Uploaded by

learning with few data

Marcus Liwicki : learning with few data

and some spices in-between:

and some spices in-between:

experience (from earlier experiments)

knowledge (human reasoning)

data (intrinsic or human induced)

but why did so many of you torture each other?

Sana Ali András Richa Karl Carl Prakash Lama Elisa

machine learning for the welfare of society

Michele Alberti Vinay Pondenkandath Gustav G. Pihlgren Prakash

end to end learning

• clustering – (Deep Clustering for Unsupervised Learning of Visual Features - 2018)

• contrastive learning (SimCLR - July 2020, SwAV – October 2020)

Source: [Link] Source: [Link]

footsteps for person identification

MS Singh, V Pondenkandath, B Zhou, P Lukowicz, M Liwicki

Retina DR dataset CheXpert dataset

Adapted from [Link]

Gastrointestinal, breast cancer

Why ImageNet supervised transfer learning is sub-optimal?

and some spices in-between:

INPUT ENCODER FEATURES DECODER OUTPUT

INPUT ENCODER FEATURES CLASSIFIER OUTPUT

a good auto-encoder (low reconstruction error) does not

No ! Better local minima ?

what we want what we might get

X Encoder N(μ, σ2) z Decoder X’

Kingma, Diederik P., and Max Welling. "Auto-encoding variational bayes.“

Simple Framework for Contrastive Learning (SimCLR)

Momentum Contrast Learning (MOCO)

Source (IARAI): [Link]

distorted views (augmented views) of input visual

Shape Crop, Flip

Angle Flip, Rotation

Color spectrum Contrast, saturation

Distorted views (augmented views) of input visual

Shape Crop, Flip

Angle Flip, Rotation

Color spectrum Contrast, saturation

medical images, remote sensing imagery, non-obvious visual concepts

insufficiency of human prior for distorted view

Supervised Self-supervised Adapting self-supervised

Data reduce human prior Data

data (prior) magnification levels (in BreakHis data) are

the only human prior used in magnification sampling

achieves state-of-the-art results with only 20%

sequential co-occurrence (BERT)

btw., should we use slide numbers ?

And some spices in-between:

end to end learning

meta learning (not covered today)

DINO Distillation Self attention without supervision 1. More research required

Barlow Twins Redundancy reduction minimize covariance across embedding

CL is leading the self-supervision & potential push for semi-supervised

CL in current state is compute intensive

for non natural images, smaller batch size

CL is leading the self-supervision & potential push for semi-supervised

CL can be benchmarking framework (Different methods for different applications) for

there is so much more, I could share [Link]/2023-nldl-tutorial

You might also like