速度和sglang相比哪个快？ #8

njhouse365 · 2024-03-06T03:21:59Z

No description provided.

depenglee1707 · 2024-03-07T02:48:54Z

Just for the speed of inference, I guess sglang will be better, since it direct adopt RadixAttention and the same thing still in our roadmap.

Our project is more focus on deploying models in production level, since we introduced the scale, serverless and model deployment template .etc and other ops things to help people maintain the inferences easier.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

速度和sglang相比哪个快？ #8

速度和sglang相比哪个快？ #8

njhouse365 commented Mar 6, 2024

depenglee1707 commented Mar 7, 2024

速度和sglang相比哪个快？ #8

速度和sglang相比哪个快？ #8

Comments

njhouse365 commented Mar 6, 2024

depenglee1707 commented Mar 7, 2024