Wrong tokens / second #852

EugeoSynthesisThirtyTwo · 2024-05-18T19:07:19Z

It says

Processing Prompt [BLAS] (1676 / 1676 tokens)
Generating (78 / 387 tokens)
(EOS token triggered!)
(Special Stop Token Triggered! ID:128009)
CtxLimit: 1754/8192, Process:25.05s (14.9ms/T = 66.89T/s), Generate:59.05s (152.6ms/T = 6.55T/s), Total:84.11s (4.60T/s)

But 6.55T/s is the speed that would have been achieved if the model generated 387 tokens. The model actually generated only 78 tokens, so the real generation speed is 78 / 59.05 = 1.32 tokens / s

The text was updated successfully, but these errors were encountered:

LostRuins · 2024-05-19T02:30:43Z

Will try to fix

LostRuins · 2024-05-24T10:36:36Z

Can you see if the latest version solves this issue?

EugeoSynthesisThirtyTwo · 2024-05-28T08:43:01Z

Can you see if the latest version solves this issue?

It's good thank you

However, as you can see, if I abort the generation, there is a new log "Generating (301 / 300 tokens)" which is wrong. I don't know if it's related. Let me know if I should open a new issue for this.

LostRuins · 2024-05-28T10:40:10Z

Don't worry about that, probably just a minor thing.

LostRuins added the bug Something isn't working label May 19, 2024

EugeoSynthesisThirtyTwo closed this as completed May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong tokens / second #852

Wrong tokens / second #852

EugeoSynthesisThirtyTwo commented May 18, 2024

LostRuins commented May 19, 2024

LostRuins commented May 24, 2024

EugeoSynthesisThirtyTwo commented May 28, 2024

LostRuins commented May 28, 2024

Wrong tokens / second #852

Wrong tokens / second #852

Comments

EugeoSynthesisThirtyTwo commented May 18, 2024

LostRuins commented May 19, 2024

LostRuins commented May 24, 2024

EugeoSynthesisThirtyTwo commented May 28, 2024

LostRuins commented May 28, 2024