ResNet_CBAM源码

### ResNet with CBAM Source Code Implementation The Convolutional Block Attention Module (CBAM) can be integrated into a ResNet model to enhance its performance through attention mechanisms. Below is the Python code implementing this combination using PyTorch: ```python import torch import torch.nn as nn from torchvision import models class BasicConv(nn.Module): def __init__(self, in_planes, out_planes, kernel_size, stride=1, padding=0, dilation=1, groups=1, relu=True, bn=True, bias=False): super(BasicConv, self).__init__() self.out_channels = out_planes self.conv = nn.Conv2d(in_planes, out_planes, kernel_size=kernel_size, stride=stride, padding=padding, dilation=dilation, groups=groups, bias=bias) self.bn = nn.BatchNorm2d(out_planes, eps=1e-5, momentum=0.01, affine=True) if bn else None self.relu = nn.ReLU() if relu else None def forward(self, x): x = self.conv(x) if self.bn is not None: x = self.bn(x) if self.relu is not None: x = self.relu(x) return x class ChannelGate(nn.Module): def __init__(self, gate_channels, reduction_ratio=16, pool_types=['avg', 'max']): super(ChannelGate, self).__init__() self.gate_channels = gate_channels self.mlp = nn.Sequential( Flatten(), nn.Linear(gate_channels, gate_channels // reduction_ratio), nn.ReLU(), nn.Linear(gate_channels // reduction_ratio, gate_channels) ) self.pool_types = pool_types def forward(self, x): channel_att_sum = None for pool_type in self.pool_types: if pool_type == 'avg': avg_pool = F.avg_pool2d(x, (x.size(2), x.size(3)), stride=(x.size(2), x.size(3))) channel_att_raw = self.mlp(avg_pool) elif pool_type == 'max': max_pool = F.max_pool2d(x, (x.size(2), x.size(3)), stride=(x.size(2), x.size(3))) channel_att_raw = self.mlp(max_pool) if channel_att_sum is None: channel_att_sum = channel_att_raw else: channel_att_sum = channel_att_sum + channel_att_raw scale = torch.sigmoid(channel_att_sum).unsqueeze(2).unsqueeze(3).expand_as(x) return x * scale def logsumexp_2d(tensor): tensor_flatten = tensor.view(tensor.size(0), tensor.size(1), -1) s, _ = torch.max(tensor_flatten, dim=2, keepdim=True) outputs = s + (tensor_flatten - s).exp().sum(dim=2, keepdim=True).log() return outputs class SpatialGate(nn.Module): def __init__(self): super(SpatialGate, self).__init__() kernel_size = 7 self.compress = ChannelPool() self.spatial = BasicConv(2, 1, kernel_size, stride=1, padding=(kernel_size - 1) // 2, relu=False) def forward(self, x): x_compress = self.compress(x) x_out = self.spatial(x_compress) scale = torch.sigmoid(x_out) # broadcasting return x * scale class CBAM(nn.Module): def __init__(self, gate_channels, reduction_ratio=16, pool_types=['avg', 'max'], no_spatial=False): super(CBAM, self).__init__() self.ChannelGate = ChannelGate(gate_channels, reduction_ratio, pool_types) self.no_spatial = no_spatial if not no_spatial: self.SpatialGate = SpatialGate() def forward(self, x): x_out = self.ChannelGate(x) if not self.no_spatial: x_out = self.SpatialGate(x_out) return x_out class ResNetWithCBAM(nn.Module): def __init__(pretrained=True): super().__init__() resnet = models.resnet50(pretrained=pretrained) modules = list(resnet.children())[:-2] # Remove AvgPooling and FC layer. self.backbone = nn.Sequential(*modules) # Add CBAM module after each residual block of ResNet self.cbam1 = CBAM(256) self.cbam2 = CBAM(512) self.cbam3 = CBAM(1024) self.cbam4 = CBAM(2048) def forward(self, inputs): x = self.backbone[:5](inputs) x = self.cbam1(x) x = self.backbone[5:6](x) x = self.cbam2(x) x = self.backbone[6:7](x) x = self.cbam3(x) x = self.backbone[7:](x) x = self.cbam4(x) return x ``` This implementation integrates CBAM into different stages of the ResNet architecture. The `ChannelGate` focuses on refining feature maps based on their importance across channels while `SpatialGate` refines them spatially.

阅读全文

相关推荐

基于resnet+cbam注意力机制实现stanfordDogs识别分类python源码(可换其他数据识别分类).zip

CBAM+Resnet源码，SENet+Resnet源码

基于Pytorch框架实现ResNet18中嵌入视觉注意力机制python源码+项目说明.zip

使用ResNet+CBAM实现斯坦福犬种识别分类项目源码

基于CBAM和ResNet的食物识别分类python源码.zip

基于CBAM和ResNet的食物识别分类python源代码+详细注释(高分课程设计)

python项目源码-实现迁移学习ResNet网络的食物图像分类项目源码+文档说明（高分课程设计）.rar

Python基于Resnet50等模型结合Attention的多模型消融实验的人脸表情识别项目源码

基于YOLOv5改进更多的主干resnet、shufflenet、moblenet、efficientnet、hrnet、cbam、dcn以及tensorrt等（源码+说明）.rar

基于Pytorch框架实现的ResNet18中嵌入视觉注意力机制python+源代码+文档说明+数据集

基于Pytorch框架实现的ResNet18中嵌入视觉注意力机制python+源代码+文档说明+数据集.zip

(源码)基于Python框架的YOLOv5目标检测项目.zip

深度学习CBAM模型源码解析及Keras实现指南

Pytorch框架下视觉注意力机制的ResNet18实现及源码

利用CBAM和ResNet优化的Python食物识别分类系统

Pytorch实现ResNet18带视觉注意力机制代码详解

Python实现果蔬菜品识别系统课程设计与源码解析

YOLOV8注意力机制源码包：多种注意力模型开箱即用

YOLOv5+resnet

阿达啊是的租出去水电费水电费

复合函数奇偶性

2025年国家网络安全知识竞赛题库附含答案(基础题).docx

大家在看

ISIS Draw 2.5

最新飞利浦监护仪开发接口文档

电赛省一作品 盲盒识别 2022TI杯 10月联赛 D题

新版3Dmax中导出.x文件2020版64位

动态供应链环境下的供应商分类评价研究

最新推荐

阿达啊是的租出去水电费水电费

Typora下载问题解决：资源安装包实测可用

网络嗅探器实战进阶：掌握高效数据捕获与准确分析的6大策略

system verilog task中用宏定义传参

Java开发的Help GUI 1.1源码：可视化组件库详解

网络嗅探器全攻略：从入门到精通的15大技巧与实践案例

RTL8720DN-VA1-CG后面的VA1-CG是什么意思

CCPD2019车牌数据集：10000张带YOLO标签图片

【精准温度测量与HP303B校准】：掌握这些高级技巧，提升测量准确性

那如果我加上turbulenceProperties，是不是这么写FoamFile { version 2.0; format ascii; class dictionary; object turbulenceProperties; } // * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * // simulationType laminar;

电赛省一作品盲盒识别 2022TI杯 10月联赛 D题