Self Attention Mechanism Presentation

Self-attention is a mechanism in deep learning that assesses the importance of words in a sequence relative to one another, enabling context-aware understanding and is crucial for Transformer models. It involves generating query, key, and value vectors, calculating attention scores, and producing an output through weighted sums. Applications include natural language processing, machine translation, computer vision, speech recognition, and protein structure prediction.

Uploaded by

sireeshakskatta8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views6 pages

Self Attention Mechanism Presentation

Uploaded by

sireeshakskatta8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Self-Attention Mechanism in

Deep Learning
Definition, Architecture, Applications,
and How It Works
What is Self-Attention?
• Self-attention is a mechanism that allows a
model to weigh the importance of different
words in an input sequence relative to each
other.
• - Computes relationships between elements of
a sequence.
• - Enables context-aware understanding.
• - Fundamental to Transformer models.
Architecture of Self-Attention
• Key components:
• - Input Embeddings
• - Query, Key, and Value Vectors (Q, K, V)
• - Scaled Dot-Product Attention
• - Output Weighted Sum
• Often used in multi-head format to capture
diverse relationships.
How Does Self-Attention Work?
• 1. Generate Q, K, V vectors from input.
• 2. Calculate dot product of Q and K to get raw
attention scores.
• 3. Apply softmax to obtain attention weights.
• 4. Multiply weights with V to get the final
output.
• Formula: Attention(Q, K, V) = Softmax(QKᵀ /
√d_k) V
Applications of Self-Attention
• - Natural Language Processing (NLP): BERT,
GPT
• - Machine Translation and Summarization
• - Vision Transformers (ViT) in Computer Vision
• - Speech Recognition and Audio Processing
• - Protein Structure Prediction (e.g., AlphaFold)
Thank You
• Questions and Discussions Welcome!

NLP Lecture 01-15-Attnmechanism
No ratings yet
NLP Lecture 01-15-Attnmechanism
13 pages
Lecture 6 Transformers
No ratings yet
Lecture 6 Transformers
92 pages
All You Need To Know About The Self-Attention Layer
No ratings yet
All You Need To Know About The Self-Attention Layer
80 pages
Understanding Attention Mechanisms in Deep Learning
No ratings yet
Understanding Attention Mechanisms in Deep Learning
104 pages
Attention in Neural Networks
No ratings yet
Attention in Neural Networks
8 pages
Understanding Self-Attention in Transformers
No ratings yet
Understanding Self-Attention in Transformers
16 pages
Research Papers
No ratings yet
Research Papers
2 pages
Transformers From Scratch PoliTO - Ipynb Colab
No ratings yet
Transformers From Scratch PoliTO - Ipynb Colab
17 pages
Attention Paper Summary
No ratings yet
Attention Paper Summary
3 pages
Visual Attention Methods in Deep Learning An In-Depth Survey
No ratings yet
Visual Attention Methods in Deep Learning An In-Depth Survey
29 pages
Understanding and Coding Self-Attention, Multi-Head Attention, Cross-Attention, and Causal-Attention in LLMs
No ratings yet
Understanding and Coding Self-Attention, Multi-Head Attention, Cross-Attention, and Causal-Attention in LLMs
38 pages
Lec 12
No ratings yet
Lec 12
30 pages
Notes of Transformer
No ratings yet
Notes of Transformer
8 pages
Visual Attention Methods in Deep Learning An In-De
No ratings yet
Visual Attention Methods in Deep Learning An In-De
20 pages
NLPMCQ
No ratings yet
NLPMCQ
23 pages
Chapter 4
No ratings yet
Chapter 4
24 pages
495 Lecture 8
No ratings yet
495 Lecture 8
28 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
21 pages
The Transformer Family
No ratings yet
The Transformer Family
25 pages
NLP 8
No ratings yet
NLP 8
42 pages
Neurocomputing: Zhaoyang Niu, Guoqiang Zhong, Hui Yu
No ratings yet
Neurocomputing: Zhaoyang Niu, Guoqiang Zhong, Hui Yu
15 pages
Presentation On Attention Model
No ratings yet
Presentation On Attention Model
14 pages
Gen AI - Gen AI Models III
No ratings yet
Gen AI - Gen AI Models III
22 pages
Deep Learning Attention Guide
No ratings yet
Deep Learning Attention Guide
17 pages
4.attention Mechanism
No ratings yet
4.attention Mechanism
10 pages
Attention Is Not You Need: Pure Attention Loses Rank Doubly Exponentially With Depth
No ratings yet
Attention Is Not You Need: Pure Attention Loses Rank Doubly Exponentially With Depth
22 pages
Transformers
No ratings yet
Transformers
15 pages
3.1 Language Models and Attention
No ratings yet
3.1 Language Models and Attention
22 pages
Dis7 Sol
No ratings yet
Dis7 Sol
8 pages
Beyond Self-Attention - External Attention Using Two Linear Layers For Visual Tasks
No ratings yet
Beyond Self-Attention - External Attention Using Two Linear Layers For Visual Tasks
11 pages
Attention Mechanism by Hand Exercise
No ratings yet
Attention Mechanism by Hand Exercise
1 page
LLM Attention
No ratings yet
LLM Attention
13 pages
Genai Week4 Mini
No ratings yet
Genai Week4 Mini
132 pages
Paper 2
No ratings yet
Paper 2
8 pages
Memory-Efficient Attention Mechanism
No ratings yet
Memory-Efficient Attention Mechanism
8 pages
Lecture 10
No ratings yet
Lecture 10
66 pages
Transformers Explained Visually (Part 3) - Multi-Head Attention, Deep Dive - by Ketan Doshi - Towards Data Science
No ratings yet
Transformers Explained Visually (Part 3) - Multi-Head Attention, Deep Dive - by Ketan Doshi - Towards Data Science
24 pages
02-Transformer Based NLP Applications
No ratings yet
02-Transformer Based NLP Applications
57 pages
Duman Keles23a
No ratings yet
Duman Keles23a
23 pages
GenAI For Developers
No ratings yet
GenAI For Developers
205 pages
S: Rethinking Self-Attention in Transformer Models: Ynthesizer
No ratings yet
S: Rethinking Self-Attention in Transformer Models: Ynthesizer
12 pages
"Attention-Augmented CNNs for Image Classification"
No ratings yet
"Attention-Augmented CNNs for Image Classification"
12 pages
ML - Attention Mechanism - GeeksforGeeks
No ratings yet
ML - Attention Mechanism - GeeksforGeeks
18 pages
Attention For Time Series Forecasting and Classification - by Isaac Godfried - Towards Data Science
No ratings yet
Attention For Time Series Forecasting and Classification - by Isaac Godfried - Towards Data Science
10 pages
Understanding and Coding The Self-Attention Mechanism of Large Language Models From Scratch
No ratings yet
Understanding and Coding The Self-Attention Mechanism of Large Language Models From Scratch
20 pages
Vision Transformers for CV Experts
No ratings yet
Vision Transformers for CV Experts
14 pages
Transformer
No ratings yet
Transformer
33 pages
Causal Attention Explained - Don't Peek Into The F...
No ratings yet
Causal Attention Explained - Don't Peek Into The F...
4 pages
Neural Network Attention Evolution
No ratings yet
Neural Network Attention Evolution
22 pages
Attention Mechanism
No ratings yet
Attention Mechanism
7 pages
Cordonnier 2020
No ratings yet
Cordonnier 2020
18 pages
Efficient Attention Mechanisms
No ratings yet
Efficient Attention Mechanisms
37 pages
Lecture15 Transformer
No ratings yet
Lecture15 Transformer
26 pages
Transformers - The Brain of ChatGPT
No ratings yet
Transformers - The Brain of ChatGPT
25 pages
Attention Is All You Need
No ratings yet
Attention Is All You Need
18 pages
Chap6 Transformer (20240219) - DL4H Practioner Guide
No ratings yet
Chap6 Transformer (20240219) - DL4H Practioner Guide
36 pages
Bello Attention Augmented Convolutional Networks ICCV 2019 Paper
No ratings yet
Bello Attention Augmented Convolutional Networks ICCV 2019 Paper
10 pages
7181 Attention Is All You Need
No ratings yet
7181 Attention Is All You Need
11 pages

Self Attention Mechanism Presentation

Uploaded by

Self Attention Mechanism Presentation

Uploaded by

Self-Attention Mechanism in

You might also like