执行model.load_state_dict报错map_location=torch.device(‘cpu‘)以及Unexpected key(s) in state_dict

作者：知新_RL | 2024-07-19 12:34:06

踩

本蒟蒻原先是在有GPU的服务器上训练得到了模型，现在想在没有GPU的服务器部署网页，需要加载该模型，当运行了下面代码时

model.load_state_dict(torch.load(output_dir))

报错信息如下

RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False. If you are running on a CPU-only machine, please use torch.load with map_location=torch.device('cpu') to map your storages to the CPU.

是因为没有将state_dict转移到CPU上，在PyTorch中，可以通过指定map_location参数为torch.device('cpu')来实现。

model.load_state_dict(torch.load(output_dir, map_location=torch.device('cpu')))

修改后再次运行到这一句，发现又有错误，报错信息如下：

model.load_state_dict(torch.load(output_dir, map_location=torch.device('cpu')))
  File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 2041, in load_state_dict
    raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for Model:
        Unexpected key(s) in state_dict: "encoder.embeddings.position_ids".

我打印出了两台服务器中state_dict的内容

for k, v in torch.load(output_dir, map_location=torch.device('cpu').items():
    print(k)

打印的内容都是一样的，都是下面的内容

encoder.embeddings.position_ids
encoder.embeddings.word_embeddings.weight
encoder.embeddings.position_embeddings.weight
encoder.embeddings.token_type_embeddings.weight
encoder.embeddings.LayerNorm.weight
encoder.embeddings.LayerNorm.bias
encoder.encoder.layer.0.attention.self.query.weight
encoder.encoder.layer.0.attention.self.query.bias
encoder.encoder.layer.0.attention.self.key.weight
encoder.encoder.layer.0.attention.self.key.bias
encoder.encoder.layer.0.attention.self.value.weight
encoder.encoder.layer.0.attention.self.value.bias
encoder.encoder.layer.0.attention.output.dense.weight
encoder.encoder.layer.0.attention.output.dense.bias
...
内容太多不展示了

仔细查看提示信息，发现是有一个未知的Key，我把它删掉竟然就跑通了

new_state_dict = torch.load(output_dir, map_location=torch.device('cpu'))
del new_state_dict['encoder.embeddings.position_ids']
model.load_state_dict(new_state_dict)

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/知新_RL/article/detail/851511