0% found this document useful (0 votes)

16 views14 pages

VR Part2 Lecture 5 Annotated

Uploaded by

Achintya Harsha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views14 pages

VR Part2 Lecture 5 Annotated

Uploaded by

Achintya Harsha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

VISUAL RECOGNITION – PART 2

Lecture 5: Transformers
Image Captioning

Encoder
Representation
Attention in LSTMs for Image Captioning
Sequence Modeling : Issues with RNN Architecture

• Single hidden state from Encoder fed into Decoder

• Every step in decoder depends only on previous hidden state and

prediction

• Dependencies in a sequence need to be ‘sequential’

• Learn to attend to relevant regions in ‘source’ at every step

Language Translation
http://jalammar.github.io/illustrated-transformer/
https://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf
Self Attention- Motivation
Self Attention: A method to improve word embeddings.

Example : “The animal didn’t cross the street because it was too tired”

RNN-style Modeling Modeling with Self-Attention

http://jalammar.github.io/illustrated-transformer/
https://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf
Self Attention

dmodel

Queries: The animal

= dk

Keys:

= dk

Values:

= dv
To control the input
values to softmax
function
http://jalammar.github.io/illustrated-transformer/

Matrix Calculation of Self Attention

http://jalammar.github.io/illustrated-transformer/

Multiple- Attention Heads

The
animal

2. Multiply with Weight matrix W0

that is trained jointly with model.
1. Concatenate all attention heads

3. Final Matrix Z captures

attention from all heads.
http://jalammar.github.io/illustrated-transformer/

Multiple- Attention Heads

Attention Calculation Results concatenation

8 -heads
using Qi , Ki, Vi and multiplication with
final output weight
matrix Wo

The
animal
https://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf

Multiple- Attention Heads – Illustration

Advantages:
Parallelizable Computations
Long Range Dependency Modeling.
http://jalammar.github.io/illustrated-transformer/

Self Attention - Illustration

Focus of single attention head Focus of Multiple attention heads

https://kazemnejad.com/blog/transformer_architecture_positional_encoding/

Positional Encoding Illustration:

Intuition for Position representation : Red to Orange
is decreasing frequency of bit reversal

Positional Encoding: uses float values instead of bits

Positional encoding contains pairs of sines

and cosines at different frequencies
https://kazemnejad.com/blog/transformer_architecture_positional_encoding/

Learnable Position Embeddings

https://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf

Machine Translation : Transformer Architecture

Encoder & Decoder has stack of N =6 identical layers

Encoder Self Attention :

Input = Word embedding + Position embedding

Decoder Self Attention :

Input = Word embedding + Position embedding + Masking

Encoder - Decoder Attention :

Queries = Previous Decoder Layer
Keys, Values : Encoder Layer Output

Every position in the decoder to attend over all positions in the input
sequence

Genai Week4 Mini
No ratings yet
Genai Week4 Mini
132 pages
Seq2Seq, Attention and Transformers
No ratings yet
Seq2Seq, Attention and Transformers
142 pages
HP Venture Report Formated
No ratings yet
HP Venture Report Formated
947 pages
AE556_2024_Topic7_Transformer
No ratings yet
AE556_2024_Topic7_Transformer
49 pages
The Transformer Family
No ratings yet
The Transformer Family
25 pages
Attention is All You Need PPT
No ratings yet
Attention is All You Need PPT
18 pages
Transformers Explained Visually (Part 3) - Multi-Head Attention, Deep Dive - by Ketan Doshi - Towards Data Science
No ratings yet
Transformers Explained Visually (Part 3) - Multi-Head Attention, Deep Dive - by Ketan Doshi - Towards Data Science
24 pages
attention_transformer
No ratings yet
attention_transformer
41 pages
ISO 230-2-2006-03 Test Code For Machine Tools-Part 2 Determination of Accuracy and Repeatability of Positioning Numerically Controlled Axes
100% (1)
ISO 230-2-2006-03 Test Code For Machine Tools-Part 2 Determination of Accuracy and Repeatability of Positioning Numerically Controlled Axes
39 pages
Transformers Tutorial
No ratings yet
Transformers Tutorial
22 pages
Blender Notes PDF
No ratings yet
Blender Notes PDF
27 pages
Attention is all you need
No ratings yet
Attention is all you need
15 pages
SensePost Eye of A Needle
No ratings yet
SensePost Eye of A Needle
67 pages
Lec 7 Trans(decoder)+ViT
No ratings yet
Lec 7 Trans(decoder)+ViT
20 pages
11.1. Queries, Keys, and Values - Dive Into Deep Learning 1.0-Merged-Compressed
No ratings yet
11.1. Queries, Keys, and Values - Dive Into Deep Learning 1.0-Merged-Compressed
55 pages
Transformer (v5)
No ratings yet
Transformer (v5)
31 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
ATV - CVPR'23 Tutorial
No ratings yet
ATV - CVPR'23 Tutorial
152 pages
Attention LLM
No ratings yet
Attention LLM
36 pages
TRANSFORMER
No ratings yet
TRANSFORMER
29 pages
NLP_Lecture___01-15-attnmechanism
No ratings yet
NLP_Lecture___01-15-attnmechanism
13 pages
Model 564 Omni VI Plus Manual 2262e
No ratings yet
Model 564 Omni VI Plus Manual 2262e
93 pages
Lecture-28-TransformerIntroductionFinal-1
No ratings yet
Lecture-28-TransformerIntroductionFinal-1
69 pages
16_
No ratings yet
16_
41 pages
Transformers: Intro
No ratings yet
Transformers: Intro
7 pages
SIPP Module-01-Social-Context-of-Computing
No ratings yet
SIPP Module-01-Social-Context-of-Computing
12 pages
Attention Is All You Need
No ratings yet
Attention Is All You Need
18 pages
AATN Merged
No ratings yet
AATN Merged
139 pages
15 - NEW 2020 ATTENTION ENC DEC TRANSFORMERS Lect15
No ratings yet
15 - NEW 2020 ATTENTION ENC DEC TRANSFORMERS Lect15
50 pages
Transformers 22nd April 2025 (2)
No ratings yet
Transformers 22nd April 2025 (2)
67 pages
Transformer
No ratings yet
Transformer
33 pages
lec-12
No ratings yet
lec-12
30 pages
20190630transformer-210110081057
No ratings yet
20190630transformer-210110081057
32 pages
Abel, Aehling, Djyber - Normalization by Evaluation for Martin-Lof Type Theory
No ratings yet
Abel, Aehling, Djyber - Normalization by Evaluation for Martin-Lof Type Theory
23 pages
NLP-week8-transformers
No ratings yet
NLP-week8-transformers
66 pages
Transformers
No ratings yet
Transformers
41 pages
Computer Vision 11 Transformers
No ratings yet
Computer Vision 11 Transformers
63 pages
Transformer
No ratings yet
Transformer
41 pages
12 Transformer
No ratings yet
12 Transformer
41 pages
Transformer
No ratings yet
Transformer
58 pages
Unlocking Linguistic Intelligence_ Attention Mechanisms and Transformer Architectures in NLP (1)
No ratings yet
Unlocking Linguistic Intelligence_ Attention Mechanisms and Transformer Architectures in NLP (1)
117 pages
2450 Central Avenue, Suite G Boulder, CO 80301 800-821-0426 303-443-1319 Fax: 303-440-8878
No ratings yet
2450 Central Avenue, Suite G Boulder, CO 80301 800-821-0426 303-443-1319 Fax: 303-440-8878
20 pages
paper2
No ratings yet
paper2
8 pages
Transformer
No ratings yet
Transformer
31 pages
Transformer's Not Working Properly in This Room
No ratings yet
Transformer's Not Working Properly in This Room
65 pages
02-Transformer Based NLP Applications
No ratings yet
02-Transformer Based NLP Applications
57 pages
Class47 49 - AttentionBasedModels Transformers 10 15may2023
No ratings yet
Class47 49 - AttentionBasedModels Transformers 10 15may2023
27 pages
2024_Transformer_master
No ratings yet
2024_Transformer_master
50 pages
Chap6 Transformer (20240219) - DL4H practioner guide
No ratings yet
Chap6 Transformer (20240219) - DL4H practioner guide
36 pages
Transformers
No ratings yet
Transformers
15 pages
Utility Applications of Time Sensitive Networking - White Paper - Final Review
No ratings yet
Utility Applications of Time Sensitive Networking - White Paper - Final Review
17 pages
Aiayn
No ratings yet
Aiayn
15 pages
智能计算系统实验教程v2_0_第六章实验
No ratings yet
智能计算系统实验教程v2_0_第六章实验
20 pages
11 Transformers Notes
No ratings yet
11 Transformers Notes
25 pages
Deep Neural Network Module 7 Attention Transformer
No ratings yet
Deep Neural Network Module 7 Attention Transformer
40 pages
Transformers_v1.1
No ratings yet
Transformers_v1.1
1 page
Opoku Daniel Dartey - pg7366621 - Instrumentation and Control
No ratings yet
Opoku Daniel Dartey - pg7366621 - Instrumentation and Control
19 pages
attention
No ratings yet
attention
15 pages
lecture15_transformer
No ratings yet
lecture15_transformer
26 pages
Chapter-7. Big Data Tools and Techniques
No ratings yet
Chapter-7. Big Data Tools and Techniques
16 pages
What Is A Transformer
No ratings yet
What Is A Transformer
11 pages
DR 68 V 7 BT 98 Ny 9 M
No ratings yet
DR 68 V 7 BT 98 Ny 9 M
23 pages
1706.03762v1
No ratings yet
1706.03762v1
15 pages
An Introduction To Transformers
No ratings yet
An Introduction To Transformers
8 pages
495 Lecture 10 Attall
No ratings yet
495 Lecture 10 Attall
18 pages
Manual de Instalación de Camaras Trampa
No ratings yet
Manual de Instalación de Camaras Trampa
33 pages
Control Structure C
No ratings yet
Control Structure C
12 pages
Lecture Notes - Advanced Language Model - BERT, GPT
No ratings yet
Lecture Notes - Advanced Language Model - BERT, GPT
24 pages
Sysmon: How To Install, Upgrade, and Uninstall
No ratings yet
Sysmon: How To Install, Upgrade, and Uninstall
6 pages
Chapter 4
No ratings yet
Chapter 4
12 pages
20-10-2023_Deepshikha_Call_Performance_Report (1)
No ratings yet
20-10-2023_Deepshikha_Call_Performance_Report (1)
9 pages
Sample MCQ Stats
No ratings yet
Sample MCQ Stats
6 pages
Differentiate Copy and Move - Google Search
No ratings yet
Differentiate Copy and Move - Google Search
1 page
Python For DS Cheat Sheet
100% (2)
Python For DS Cheat Sheet
6 pages
Attention is All you Need - NIPS-2017-attention-is-all-you-need-Paper
No ratings yet
Attention is All you Need - NIPS-2017-attention-is-all-you-need-Paper
11 pages
SAP FICO Practice1
No ratings yet
SAP FICO Practice1
22 pages
Transformer
No ratings yet
Transformer
4 pages
MLA Style Research Paper Template - Basic - Google Docs
No ratings yet
MLA Style Research Paper Template - Basic - Google Docs
4 pages
Hby Ums Im8
No ratings yet
Hby Ums Im8
4 pages
How To Create Charts in Excel - 105042
No ratings yet
How To Create Charts in Excel - 105042
11 pages
Revit Reviewer
No ratings yet
Revit Reviewer
7 pages
Attn Is All You Need
No ratings yet
Attn Is All You Need
15 pages
Specifications-PC212DC212Fom212 Amended 02.2
No ratings yet
Specifications-PC212DC212Fom212 Amended 02.2
8 pages
Determining The Pressure Drop To Be Used in A Control Valve Sizing Calculation - Valin1
No ratings yet
Determining The Pressure Drop To Be Used in A Control Valve Sizing Calculation - Valin1
3 pages
Home Automation PPT - Modified - 2
0% (2)
Home Automation PPT - Modified - 2
15 pages
Sample Question Paper 1
No ratings yet
Sample Question Paper 1
2 pages
Process of Gamification
No ratings yet
Process of Gamification
7 pages
PyQt6 101: A Beginner’s guide to PyQt6
From Everand
PyQt6 101: A Beginner’s guide to PyQt6
Edward Chang
No ratings yet
Blender Pro Studio Advanced Techniques for Real-World Projects: Blender, #3
From Everand
Blender Pro Studio Advanced Techniques for Real-World Projects: Blender, #3
Steven Mcananey
No ratings yet
Node.js, JavaScript, API: Interview Questions and Answers
From Everand
Node.js, JavaScript, API: Interview Questions and Answers
John Edward Cooper Berg
5/5 (1)