Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vllm 支持流式的batch推理吗? #5

Open
yungangwu opened this issue Nov 14, 2023 · 0 comments
Open

vllm 支持流式的batch推理吗? #5

yungangwu opened this issue Nov 14, 2023 · 0 comments

Comments

@yungangwu
Copy link

我看源码好像vllm还是一条一条的推理的,并不是一次计算所有的输入的。也没有看到文档说能否支持stream的batching推理。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant