如何设置在原有的pytorch_model.bin基础上进行微调训练？ #204

hdjlcbz · 2024-10-08T09:02:59Z

怎么在原有的模型基础上进行微调训练？

foreverhell · 2024-10-18T08:39:58Z

训练代码这个部分

把权重导入
weights = torch.load(unet_path)
unet.load_state_dict(weights)

hnsywangxin · 2024-10-22T09:07:07Z

训练代码这个部分把权重导入 weights = torch.load(unet_path) unet.load_state_dict(weights)

按照该代码加载了权重，光unet显存占用都将近30G,然后bs=1都会报OOM(我每个卡40G显存)，请问你也是消耗这么多显存嘛

foreverhell · 2024-10-22T09:14:25Z

训练代码这个部分把权重导入 weights = torch.load(unet_path) unet.load_state_dict(weights)

按照该代码加载了权重，光unet显存占用都将近30G,然后bs=1都会报OOM(我每个卡40G显存)，请问你也是消耗这么多显存嘛

是，weights可以先放在CPU上，保证unet在GPU就行

hnsywangxin · 2024-10-22T12:17:11Z

训练代码这个部分把权重导入 weights = torch.load(unet_path) unet.load_state_dict(weights)

按照该代码加载了权重，光unet显存占用都将近30G,然后bs=1都会报OOM(我每个卡40G显存)，请问你也是消耗这么多显存嘛

是，weights可以先放在CPU上，保证unet在GPU就行

已解决，谢谢

sswax000643 · 2025-01-13T02:29:05Z

训练代码这个部分把权重导入 weights = torch.load(unet_path) unet.load_state_dict(weights)

按照该代码加载了权重，光unet显存占用都将近30G,然后bs=1都会报OOM(我每个卡40G显存)，请问你也是消耗这么多显存嘛

是，weights可以先放在CPU上，保证unet在GPU就行

已解决，谢谢

请问是如何解决的呢？可以分享下吗，我现在也是加载权重就爆显存

hnsywangxin · 2025-01-14T07:45:10Z

@sswax000643 问下gpt，或者谷歌查一下，有个关键字参数，这个不难

aidenyzhang closed this as completed Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

如何设置在原有的pytorch_model.bin基础上进行微调训练？ #204

如何设置在原有的pytorch_model.bin基础上进行微调训练？ #204

hdjlcbz commented Oct 8, 2024

foreverhell commented Oct 18, 2024

hnsywangxin commented Oct 22, 2024

foreverhell commented Oct 22, 2024

hnsywangxin commented Oct 22, 2024

sswax000643 commented Jan 13, 2025

hnsywangxin commented Jan 14, 2025

如何设置在原有的pytorch_model.bin基础上进行微调训练？ #204

如何设置在原有的pytorch_model.bin基础上进行微调训练？ #204

Comments

hdjlcbz commented Oct 8, 2024

foreverhell commented Oct 18, 2024

hnsywangxin commented Oct 22, 2024

foreverhell commented Oct 22, 2024

hnsywangxin commented Oct 22, 2024

sswax000643 commented Jan 13, 2025

hnsywangxin commented Jan 14, 2025