how to use stage2 ckpt fine-tuning stage3？ #102

linqinguang · 2024-04-30T10:02:01Z

First, I modified the scripts/llama/train/stage_1_2_full_v7b_336_hr_768.sh, changing parameter ”--model_name_or_path” to the stage2 checkpoint “MGM-7B”, and then got loar model_path.
Afterwards, using scripts/merge_lora_weights.py to merge the base and lora. But I found that it does not work. Compared to LLava, the mgm.model.builder.load_pretrained_model method seems to be lacking several components, the method don't load peftmodel.

yanwei-li · 2024-05-03T05:00:12Z

Hi, please refer to the issue #49 for the continuous fine-tuning. And all our models are fully finetunded. Actually, I did not try LoRA. This could be checked and supported soon.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to use stage2 ckpt fine-tuning stage3？ #102

how to use stage2 ckpt fine-tuning stage3？ #102

linqinguang commented Apr 30, 2024

yanwei-li commented May 3, 2024

how to use stage2 ckpt fine-tuning stage3？ #102

how to use stage2 ckpt fine-tuning stage3？ #102

Comments

linqinguang commented Apr 30, 2024

yanwei-li commented May 3, 2024