Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Which file is the lora weight? #108

Open
hxx-who opened this issue Jan 17, 2024 · 3 comments
Open

Which file is the lora weight? #108

hxx-who opened this issue Jan 17, 2024 · 3 comments

Comments

@hxx-who
Copy link

hxx-who commented Jan 17, 2024

Hi, Great work!
I was wondering what is the difference between different model files, like bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt, mp_rank_00_model_states.pt, pytorch_model.bin which is obtained after the zero_to_fp32.py, and the file obtained after merge_lora_weights_and_save_hf_model.py.
Which of them is the lora weight? or they are just different formats of lora weight?
Thanks!

@hxx-who
Copy link
Author

hxx-who commented Jan 17, 2024

and which of them is the full model weight?

@GaoXiaoshan
Copy link

+1

@GaoXiaoshan
Copy link

I have met the same problem.
If you train with deepspeed, save the model and run python zero_to_fp32.py, you will get the pytorch_model.bin. The pytorch_model.bin is the full model weight which means there is no need to merge.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants