Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

断点续训resume时的epoch计数问题 #420

Open
Taichipeace opened this issue Apr 25, 2024 · 1 comment
Open

断点续训resume时的epoch计数问题 #420

Taichipeace opened this issue Apr 25, 2024 · 1 comment

Comments

@Taichipeace
Copy link

首先,感谢各位大佬提供这么好用的工具。

目前遇到的困惑是:
假如我从第16个epoch续训,
续训开始后,又会从epoch1开始计数,
直到max_train_epochs结束,
实际相当于训练了 16+max_train_epochs 个epoch,
这中间还会覆盖掉续训之前保存的safetensors文件,
感觉有点别扭。

可否改成:
从第16个epoch续训,
计数也从16开始,
直到max_train_epochs结束。

再次感谢。

@Pevernow
Copy link

好问题

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants