Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong tokens / second #852

Closed
EugeoSynthesisThirtyTwo opened this issue May 18, 2024 · 4 comments
Closed

Wrong tokens / second #852

EugeoSynthesisThirtyTwo opened this issue May 18, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@EugeoSynthesisThirtyTwo

It says

Processing Prompt [BLAS] (1676 / 1676 tokens)
Generating (78 / 387 tokens)
(EOS token triggered!)
(Special Stop Token Triggered! ID:128009)
CtxLimit: 1754/8192, Process:25.05s (14.9ms/T = 66.89T/s), Generate:59.05s (152.6ms/T = 6.55T/s), Total:84.11s (4.60T/s)

But 6.55T/s is the speed that would have been achieved if the model generated 387 tokens. The model actually generated only 78 tokens, so the real generation speed is 78 / 59.05 = 1.32 tokens / s

@LostRuins
Copy link
Owner

Will try to fix

@LostRuins LostRuins added the bug Something isn't working label May 19, 2024
@LostRuins
Copy link
Owner

Can you see if the latest version solves this issue?

@EugeoSynthesisThirtyTwo
Copy link
Author

Can you see if the latest version solves this issue?

It's good thank you
image

However, as you can see, if I abort the generation, there is a new log "Generating (301 / 300 tokens)" which is wrong. I don't know if it's related. Let me know if I should open a new issue for this.

@LostRuins
Copy link
Owner

Don't worry about that, probably just a minor thing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants