Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

--freeze_backbone False? #164

Open
fyting opened this issue May 11, 2024 · 3 comments
Open

--freeze_backbone False? #164

fyting opened this issue May 11, 2024 · 3 comments

Comments

@fyting
Copy link

fyting commented May 11, 2024

Why does the file internvl_chat_v1_2_hermes2_yi34b_448_finetune.sh include --freeze_backbone False? Isn't the visual encoder supposed to be frozen during the pre-training phase?

@czczup
Copy link
Member

czczup commented May 16, 2024

Hello, this is the fine-tuning script. When we fine-tune, we open the entire model and train it.

@daiqing98
Copy link

daiqing98 commented May 25, 2024

Hello, this is the fine-tuning script. When we fine-tune, we open the entire model and train it.

May I ask if you do any ablation on this? Does open the vision module bring any notable benefits?

@czczup
Copy link
Member

czczup commented May 30, 2024

Yes, in my experiments, turning on the vision encoder was significantly better than freezing it, so in all recent experiments, I have turned on the vision encoder during the finetune phase.

For these hyperparameters, which modules are trained and which modules are frozen at each stage, you can find out on our blog: https://internvl.github.io/blog/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants