Which file is the lora weight? #108

hxx-who · 2024-01-17T10:28:06Z

Hi, Great work!
I was wondering what is the difference between different model files, like bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt, mp_rank_00_model_states.pt, pytorch_model.bin which is obtained after the zero_to_fp32.py, and the file obtained after merge_lora_weights_and_save_hf_model.py.
Which of them is the lora weight? or they are just different formats of lora weight?
Thanks!

The text was updated successfully, but these errors were encountered:

hxx-who · 2024-01-17T10:28:59Z

and which of them is the full model weight?

GaoXiaoshan · 2024-05-07T07:59:56Z

+1

GaoXiaoshan · 2024-05-08T09:29:02Z

I have met the same problem.
If you train with deepspeed, save the model and run python zero_to_fp32.py, you will get the pytorch_model.bin. The pytorch_model.bin is the full model weight which means there is no need to merge.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which file is the lora weight? #108

Which file is the lora weight? #108

hxx-who commented Jan 17, 2024

hxx-who commented Jan 17, 2024

GaoXiaoshan commented May 7, 2024

GaoXiaoshan commented May 8, 2024

Which file is the lora weight? #108

Which file is the lora weight? #108

Comments

hxx-who commented Jan 17, 2024

hxx-who commented Jan 17, 2024

GaoXiaoshan commented May 7, 2024

GaoXiaoshan commented May 8, 2024