Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

SFT data and pretrain data problem #126

Open
Emperorizzis opened this issue Dec 11, 2023 · 0 comments
Open

SFT data and pretrain data problem #126

Emperorizzis opened this issue Dec 11, 2023 · 0 comments

Comments

@Emperorizzis
Copy link

The processed data size is 55G.

Are you sure about that size?
Or can you provide processed SFT data link and pre-training data link separately?

Thanks for open source. 馃檹馃徎

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant