RuntimeError: Error(s) in loading state_dict for ArSSR: Unexpected key(s) in state_dict:

### 解决 PyTorch 加载 ArSSR 模型时遇到的 `Unexpected key(s) in state_dict` 错误当加载预训练模型权重时，如果出现 `Unexpected key(s) in state_dict` 的错误提示，通常意味着保存的模型状态字典中的键名与当前模型架构的状态字典中的键名不匹配。 #### 原因分析 1. **多 GPU 训练的影响**：如果模型是在多个 GPU 上使用 `DataParallel` 或者 `DistributedDataParallel` 进行训练并保存，则会带有前缀 `"module."`[^1]。 2. **额外元数据的存在**：有时保存的不仅仅是模型参数，还可能包含了优化器状态、学习率调度器等其他信息[^4]。 3. **不同版本间的差异**：不同的框架版本可能导致保存格式有所变化，进而引发键名不符的情况[^5]。 #### 解决策略 ##### 方法一：处理模块化命名空间问题对于由 `DataParallel` 导致的键名冲突，可以通过移除或添加 `"module."` 来修正： ```python from collections import OrderedDict def fix_key_names(checkpoint, remove_module_prefix=True): new_state_dict = OrderedDict() for k, v in checkpoint.items(): name = k.replace('module.', '') if remove_module_prefix else f'module.{k}' new_state_dict[name] = v return new_state_dict ``` ##### 方法二：提取特定部分作为新状态字典针对包含多余条目的情况，可以只保留所需的参数子集来构建新的状态字典对象： ```python import torch # 只取 params_ema 中的内容作为实际要加载的部分 state_dict = torch.load(model_path)['params_ema'] model.load_state_dict(state_dict, strict=True) ``` ##### 方法三：忽略缺失项和意外项设置 `strict=False` 参数允许跳过那些不存在于目标网络结构里的层及其对应的权值初始化操作： ```python try: model.load_state_dict(torch.load(path), strict=False) except RuntimeError as e: print(f"Some keys were not matched due to architecture differences:\n{str(e)}") ``` 通过上述方法之一应该能够有效解决大多数情况下由于键名不一致所引起的加载失败现象。具体应用哪种方式取决于实际情况以及个人需求偏好。

阅读全文

RuntimeError: Error(s) in loading state_dict for ArSSR: Unexpected key(s) in state_dict:

相关推荐

Python RuntimeError: thread.__init__() not called解决方法

RuntimeError: DataLoader worker (pid(s) 9528, 8320) exited unexpectedly

RuntimeError: Cannot run the event loop while another loop is running(目前没有解决)

RuntimeError: Error(s) in loading state_dict for BertNer: Unexpected key(s) in state_dict: "bert.embeddings.position_ids".

RuntimeError: Error(s) in loading state_dict for BertNer: Unexpected key(s) in state_dict: "bert.embeddings.position_ids".

RuntimeError: Error(s) in loading state_dict for UNet: Unexpected key(s) in state_dict:

RuntimeError: Error(s) in loading state_dict for YoloBody: Unexpected key(s) in state_dict:

RuntimeError: Exception: Error(s) in loading state_dict for FAST_LCF_ATEPC: Unexpected key(s) in state_dict:

RuntimeError: Error(s) in loading state_dict for MobileNetV2: Missing key(s) in state_dict: "classifier.1.weight", "classifier.1.bias". Unexpected key(s) in state_dict: "classifier.weight", "classifier.bias".

net.load_state_dict(torch.load(PATH))为什么报错RuntimeError: Error(s) in loading state_dict for Net: Unexpected key(s) in state_dict: "conv1.weight", "conv1.bias", "conv2.weight", "conv2.bias", "fc1.weight", "fc1.bias", "fc2.weight", "fc2.bias".

为什么RuntimeError: Error(s) in loading state_dict for Net: Unexpected key(s) in state_dict: "conv1.weight", "conv1.bias", "conv2.weight", "conv2.bias", "fc1.weight", "fc1.bias", "fc2.weight", "fc2.bias".

大家在看

NAND FLASH 控制器源码（verilog）

实体消歧系列文章.rar

matlab飞行轨迹代码-msa-toolkit:这是在MATLAB中开发的用于模拟火箭6自由度动力学的代码

qt打包程序(自定义打包界面及功能)

易语言WinSock模块应用

最新推荐

试谈商业电子商务师创业计划书撰写要求.doc

ASP.NET新闻管理系统：用户管理与内容发布功能

【实战派量化投资秘籍】：Pair Trading策略全方位解析

fpga中保持时间建立时间时序约束

Notepad2: 高效替代XP系统记事本的多功能文本编辑器

【mPower1203驱动故障全攻略】：排除新手疑难杂症，提升部署效率

keil5打不开

远程进程注入技术详解：DLL注入的实现步骤

【驱动安装背后的故事】：mPower1203机制深度剖析及优化技巧

tensorflow2.5.0 linux-aarch64.whl

Python RuntimeError: thread.init() not called解决方法