0% found this document useful (0 votes)

106 views6 pages

Attention Module in CNN

Attention mechanisms enhance Convolutional Neural Networks (CNNs) by allowing them to focus on important image regions, improving tasks like classification and detection. Various types of attention, including channel, spatial, and hybrid, help models prioritize significant features, reduce noise, and capture global context. Techniques like Squeeze-and-Excitation blocks and Convolutional Attention Blocks are used to implement these mechanisms effectively.

Uploaded by

aaminasiddiqui82

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

106 views6 pages

Attention Module in CNN

Uploaded by

aaminasiddiqui82

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Attention Mechanisms in Convolutional Neural Networks (CNNs)

Attention mechanisms improve CNNs by helping them focus on the most

important parts of an image, similar to how humans concentrate on specific
parts of a scene. This makes CNNs more effective at tasks like image
classification, object detection, and segmentation.

A. What is Attention in CNNs?

Attention in CNNs mimics the human ability to focus on key areas in an image.
For example, when identifying a person, the model pays more attention to the
face than the background.

B. Why Use Attention in CNNs?

Selective Focus: Helps the model prioritize important parts of the image.
Handle Noise: Reduces the impact of irrelevant or noisy image areas.
Capture Global Context: Goes beyond local details to understand the full
image.
Improves Interpretability: Highlights areas that influenced predictions.

C. Types of Attention Mechanisms

Channel Attention: Focuses on important feature channels (e.g.,
Squeeze-and-Excitation Block).
Spatial Attention: Focuses on key image regions (e.g., PSANet).
Hybrid Attention: Combines both channel and spatial attention (e.g., CBAM).

D. How Attention Works in CNNs

Extract Features: CNN processes the image to generate feature maps.
Generate Attention Weights: Identifies important regions or channels.
Recalibrate Features: Adjusts feature maps using these weights.
Predict: Refined features are used for the final task (e.g., classification).
1. Squeeze-and-Excitation (SE) Block
The SE block introduces channel-wise attention by recalibrating channel
importance. Here's a step-by-step breakdown:

Global Average Pooling (GAP): For each channel in the feature map, GAP
calculates the average value across all spatial positions.
This reduces the feature map size from 𝐶 × 𝐻 × 𝑊 to 𝐶, summarizing spatial
information for each channel.

Fully Connected (FC) Layer with Reduction: The GAP output

(𝐶-dimensional vector) is fed into an FC layer that reduces the number of
channels by a reduction ratio 𝑟 (e.g., 𝑟 = 16). This step compresses information,
forcing the model to focus on essential features.

ReLU Activation: Adds non-linearity to help the model learn complex channel
dependencies.
FC Layer to Restore Dimensions: A second FC layer restores the reduced
channel count back to the original size 𝐶.

Sigmoid Activation: Converts the output into attention weights between 0 and
1 for each channel.

Rescaling: The original feature map is multiplied by these weights

channel-wise, enhancing important channels and suppressing irrelevant ones.

2. Efficient Channel Attention (ECA-Net)

ECA simplifies channel attention by replacing FC layers with a 1D convolution,
making it lightweight and efficient.

Global Average Pooling (GAP): Summarizes spatial information for each

channel, similar to SE.

Adaptive Kernel Size: ECA avoids FC layers by applying a 1D convolution

along the channel dimension. The kernel size 𝑘 for the convolution is adaptively
determined based on the number of channels 𝐶 : 𝑘 = 𝜓(𝐶) where 𝜓(𝐶) is a
function (e.g., logarithmic) that ensures scalability.
1D Convolution: This lightweight operation captures channel dependencies
efficiently without increasing model complexity.

Sigmoid Activation and Rescaling: The output of the 1D convolution is passed

through a sigmoid function, generating channel-wise attention weights.
These weights are applied to the input feature map for channel refinement.

3. Point-wise Spatial Attention (PSANet)

PSANet emphasizes spatial attention by considering relationships between all
points in the feature map.

Feature Map Reduction: The input feature map (𝐶×𝐻×𝑊) is reduced to a

smaller size (𝐶′×𝐻×𝑊) using a convolutional layer. This makes subsequent
computations more efficient.

Collect and Distribute Attention: The reduced feature map is split into two
streams:
● Collect Attention: Determines how much attention each pixel collects
from the entire image.
● Distribute Attention: Determines how much attention each pixel
distributes to other pixels.

Attention Map Generation: Both streams generate attention maps (𝐻×𝑊×

(2𝐻+1) × (2𝑊 +1)) using convolution and non-linear activations.

Feature Refinement: Attention maps are applied to the feature maps,

enhancing pixels based on their global and local importance.
Concatenation and Projection: The refined feature maps are combined and
projected back to match the original input dimensions.

Convolutional Attention Block Module (CBAM) :-

The Convolutional Attention Block (CAB) enhances the learning capability of
deep neural networks by integrating attention mechanisms with convolutional
layers. The primary goal of CAB is to allow the network to focus on the most
relevant features in the input data, such as important spatial regions or specific
channels in a feature map, while ignoring less important details.

The CAB starts with an input feature map 𝑋∈𝑅^(𝐶×𝐻×𝑊), where 𝐶 is the
number of channels, and 𝐻 and 𝑊 are the spatial dimensions (height and width).
The module applies attention mechanisms to determine which features are most
important. There are two common types of attention:
Channel Attention focuses on identifying the most important channels. The
input feature map is processed using global pooling operations, such as average
pooling or max pooling, to summarize the information in each channel.
This summarized information is passed through small fully connected layers
with activation functions to calculate the importance (or attention) of each
channel.
The computed attention values are used to scale the original feature map,
enhancing significant channels and reducing less important ones. By focusing
on the most important channels, CAM improves the representation of key
features in the data.

The Spatial Attention Module (SAM) emphasizes the significance of specific

spatial regions within the feature map, such as areas corresponding to objects or
patterns in an image. SAM highlights the critical spatial locations where
features are most relevant.
The input feature map is pooled along the channel dimension using average
pooling and max pooling to create two spatial maps that summarize information
across all channels. These spatial maps are concatenated and processed through
a convolutional layer to generate an attention map. The generated attention map
is applied to the input feature map, emphasizing important spatial regions and
downplaying less relevant ones. SAM ensures that the network focuses on the
most crucial areas within an image, improving spatial feature extraction.

Spatial Attention and Channel Attention
No ratings yet
Spatial Attention and Channel Attention
8 pages
GAM1
No ratings yet
GAM1
6 pages
Page 1 Channel
No ratings yet
Page 1 Channel
1 page
基于跨空间学习的高效多尺度注意力模型
No ratings yet
基于跨空间学习的高效多尺度注意力模型
9 pages
Shuffle Attention for CNNs Optimization
No ratings yet
Shuffle Attention for CNNs Optimization
9 pages
Hou Coordinate Attention For Efficient Mobile Network Design CVPR 2021 Paper
No ratings yet
Hou Coordinate Attention For Efficient Mobile Network Design CVPR 2021 Paper
10 pages
33 - SA - Net
No ratings yet
33 - SA - Net
5 pages
FcaNet Frequency Channel Attention Networks
No ratings yet
FcaNet Frequency Channel Attention Networks
10 pages
2020 - SPA-Net - Spatial Pyramid Attention Network For Enhanced Image Recognition
No ratings yet
2020 - SPA-Net - Spatial Pyramid Attention Network For Enhanced Image Recognition
6 pages
Understanding Cbam
No ratings yet
Understanding Cbam
6 pages
Attention Mechanisms in Computer Vision: A Survey
No ratings yet
Attention Mechanisms in Computer Vision: A Survey
38 pages
CBAM: Enhancing CNNs with Attention
No ratings yet
CBAM: Enhancing CNNs with Attention
17 pages
Rotate To Attend: Convolutional Triplet Attention Module
No ratings yet
Rotate To Attend: Convolutional Triplet Attention Module
13 pages
Convolutional Block Attention Module (CBAM)
No ratings yet
Convolutional Block Attention Module (CBAM)
17 pages
CBAM
No ratings yet
CBAM
4 pages
Convolutional Block Attention Module (CBAM)
100% (1)
Convolutional Block Attention Module (CBAM)
17 pages
WCAM - Wavelet Convolutional Attention Module
No ratings yet
WCAM - Wavelet Convolutional Attention Module
6 pages
Object Detection Based On An Adaptive Attention Mechanism: Wei Li, Kai Liu, Lizhe Zhang & Fei Cheng
No ratings yet
Object Detection Based On An Adaptive Attention Mechanism: Wei Li, Kai Liu, Lizhe Zhang & Fei Cheng
13 pages
Channel Prior Convolutional Attention For Medical Image Segmentation
No ratings yet
Channel Prior Convolutional Attention For Medical Image Segmentation
10 pages
ELA注意力模块
No ratings yet
ELA注意力模块
12 pages
Midterm: Subject: Introduction To Computer Vision
No ratings yet
Midterm: Subject: Introduction To Computer Vision
35 pages
1 s2.0 S0925231225005387 Main
No ratings yet
1 s2.0 S0925231225005387 Main
12 pages
Zhang ResNeSt Split-Attention Networks CVPRW 2022 Paper
No ratings yet
Zhang ResNeSt Split-Attention Networks CVPRW 2022 Paper
11 pages
Enhancing The Robustness of Computer Vision Models To Adversarial Perturbations Using Multi-Scale Attention Mechanisms
No ratings yet
Enhancing The Robustness of Computer Vision Models To Adversarial Perturbations Using Multi-Scale Attention Mechanisms
14 pages
Neural Image Compression and Explanation: Submitted By: Sampad Mohanty 2002070059
No ratings yet
Neural Image Compression and Explanation: Submitted By: Sampad Mohanty 2002070059
19 pages
CS-Net: Channel and Spatial Attention Network For Curvilinear Structure Segmentation
No ratings yet
CS-Net: Channel and Spatial Attention Network For Curvilinear Structure Segmentation
10 pages
An Overview of The Attention Mechanisms in Compute
No ratings yet
An Overview of The Attention Mechanisms in Compute
8 pages
Visual Vs Internal Attention Mechanisms in Deep Neural Networks For Image Classification and Object Detection
No ratings yet
Visual Vs Internal Attention Mechanisms in Deep Neural Networks For Image Classification and Object Detection
31 pages
Multi-Scale Hybrid Attention Integrated With Vision Transformers For Enhanced Image Segmentation
No ratings yet
Multi-Scale Hybrid Attention Integrated With Vision Transformers For Enhanced Image Segmentation
5 pages
NAM: Normalization-Based Attention Module: Yichao Liu Zongru Shao
No ratings yet
NAM: Normalization-Based Attention Module: Yichao Liu Zongru Shao
5 pages
L9-CAM Attention Transformer-V4
No ratings yet
L9-CAM Attention Transformer-V4
39 pages
Attention Mechanisms in CNN-Based Single Image Sup
No ratings yet
Attention Mechanisms in CNN-Based Single Image Sup
17 pages
Attention Map Guided An Etal 2022
No ratings yet
Attention Map Guided An Etal 2022
11 pages
Efficient Image Super-Resolution with PA
No ratings yet
Efficient Image Super-Resolution with PA
17 pages
Synergistic Spatial-Channel Attention
No ratings yet
Synergistic Spatial-Channel Attention
11 pages
Pattern So LN
No ratings yet
Pattern So LN
15 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
(2018) Cpa
No ratings yet
(2018) Cpa
10 pages
Unit 3
No ratings yet
Unit 3
59 pages
Transnext: Robust Foveal Visual Perception For Vision Transformers
No ratings yet
Transnext: Robust Foveal Visual Perception For Vision Transformers
22 pages
Lab - 8.1 - CNN
No ratings yet
Lab - 8.1 - CNN
5 pages
Visual Attention in Deep Learning
No ratings yet
Visual Attention in Deep Learning
21 pages
NN Jaguar Lava 122
No ratings yet
NN Jaguar Lava 122
10 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Visual Attention Network for Computer Vision
No ratings yet
Visual Attention Network for Computer Vision
21 pages
Computer Vision
No ratings yet
Computer Vision
17 pages
Deep Learning For Computer Vision
No ratings yet
Deep Learning For Computer Vision
55 pages
Medical Image Segmentation with SE Blocks
No ratings yet
Medical Image Segmentation with SE Blocks
8 pages
Dual Attention Network For Scene Segmentation
No ratings yet
Dual Attention Network For Scene Segmentation
10 pages
Object Detection With Deep Learning - A Review Summary
No ratings yet
Object Detection With Deep Learning - A Review Summary
11 pages
A Feature-Wise Attention Module Based On The Difference With Surrounding Features For Convolutional Neural Networks
No ratings yet
A Feature-Wise Attention Module Based On The Difference With Surrounding Features For Convolutional Neural Networks
10 pages
Area Efficient Compression For Floating-Point Feature Maps in Convolutional Neural Network Accelerators
No ratings yet
Area Efficient Compression For Floating-Point Feature Maps in Convolutional Neural Network Accelerators
5 pages
Computer Vision With Deep Learning
No ratings yet
Computer Vision With Deep Learning
5 pages
VAE Visual Explanations for Experts
No ratings yet
VAE Visual Explanations for Experts
10 pages
Squeeze-and-Excitation Networks
No ratings yet
Squeeze-and-Excitation Networks
13 pages
EfficientViT Enhanced Linear Attention For High-Re
No ratings yet
EfficientViT Enhanced Linear Attention For High-Re
15 pages
Presentation (Theoretical Evaluation)
No ratings yet
Presentation (Theoretical Evaluation)
107 pages
AI-Driven Visual Attention Maps
No ratings yet
AI-Driven Visual Attention Maps
14 pages
Why Beauty Matters A Critical Review
No ratings yet
Why Beauty Matters A Critical Review
6 pages
A Feminist Reading of The Great Gatsby
No ratings yet
A Feminist Reading of The Great Gatsby
2 pages
TRIBUTO EDSON GOMES Pout-Pourry - Trombone 2.PDF Filename UTF-8''TRIBUTO EDSON GOMES (Pout-Pourry) - Trombone 2
No ratings yet
TRIBUTO EDSON GOMES Pout-Pourry - Trombone 2.PDF Filename UTF-8''TRIBUTO EDSON GOMES (Pout-Pourry) - Trombone 2
3 pages
Mansi Jani - My Mother at Sixty-Six Assignment Worksheet
No ratings yet
Mansi Jani - My Mother at Sixty-Six Assignment Worksheet
5 pages
Corrected DLP
No ratings yet
Corrected DLP
1 page
Fated To The Alpha 252 25
No ratings yet
Fated To The Alpha 252 25
13 pages
Icpna Portfolio - Angie Gonzales Diaz
No ratings yet
Icpna Portfolio - Angie Gonzales Diaz
23 pages
Gratitude Orientation Reduces Death Anxiety
No ratings yet
Gratitude Orientation Reduces Death Anxiety
10 pages
Well-Being Goal-Setting Worksheet
No ratings yet
Well-Being Goal-Setting Worksheet
4 pages
Oral Communication: Quarter 2: Las 9
No ratings yet
Oral Communication: Quarter 2: Las 9
5 pages
Psychologists' Attitudes and Therapeutic Approaches Toward Gay, Lesbian, and Bisexual
No ratings yet
Psychologists' Attitudes and Therapeutic Approaches Toward Gay, Lesbian, and Bisexual
6 pages
Stage 2 Sentence Analysis Guide
No ratings yet
Stage 2 Sentence Analysis Guide
2 pages
The Sneed Chuck Interrogation Analysis
No ratings yet
The Sneed Chuck Interrogation Analysis
3 pages
Profile
No ratings yet
Profile
3 pages
BPA A2024-GR1-Communication Processes Principles, and Ethics
No ratings yet
BPA A2024-GR1-Communication Processes Principles, and Ethics
31 pages
8 Sensation & Perception
No ratings yet
8 Sensation & Perception
15 pages
Understanding Academic Texts vs. Non-Academic
No ratings yet
Understanding Academic Texts vs. Non-Academic
2 pages
VSMS Scale Answer Key
No ratings yet
VSMS Scale Answer Key
3 pages
Assignment: ECE 313 Science For Young Learners
No ratings yet
Assignment: ECE 313 Science For Young Learners
7 pages
剑桥雅思阅读同义替换词表（7 9）
No ratings yet
剑桥雅思阅读同义替换词表（7 9）
22 pages
GM - Chapter 02
No ratings yet
GM - Chapter 02
26 pages
Q1 WS English 10 Lesson 1 Week 1
No ratings yet
Q1 WS English 10 Lesson 1 Week 1
26 pages
Rahul Pradhan's CV: Rural Management MBA
No ratings yet
Rahul Pradhan's CV: Rural Management MBA
3 pages
Purposehood Transform
No ratings yet
Purposehood Transform
46 pages
Used To vs. Be/Get Used To
No ratings yet
Used To vs. Be/Get Used To
8 pages
Topic1-Concept of Teaching and Learning-Tesl2016
100% (1)
Topic1-Concept of Teaching and Learning-Tesl2016
38 pages
Oral Communication Performance Task Guide
No ratings yet
Oral Communication Performance Task Guide
10 pages
DETAILED LESSON PLAN May18 (g7)
No ratings yet
DETAILED LESSON PLAN May18 (g7)
6 pages
Allisnotlost Byzaishah
No ratings yet
Allisnotlost Byzaishah
197 pages
Use of English B2 Book 1 - PDF - Test (Assessment)
No ratings yet
Use of English B2 Book 1 - PDF - Test (Assessment)
153 pages