Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

InternVL−Chat−V1.5-Int8的耗时是InternVL−Chat−V1.5的三倍吗? #157

Closed
wtl0207 opened this issue May 9, 2024 · 2 comments
Closed

Comments

@wtl0207
Copy link

wtl0207 commented May 9, 2024

InternVL−Chat−V1.5-Int8的耗时是InternVL−Chat−V1.5的三倍吗?我在A100上进行测试,同样的数据,InternVL−Chat−V1.5耗时550秒,InternVL−Chat−V1.5-Int8耗时1810秒

@czczup
Copy link
Member

czczup commented May 9, 2024

Int8虽然省显存了,但是推理会变慢

@czczup czczup closed this as completed May 19, 2024
@xylovezxy
Copy link

请问模型效果会变差吗

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants