remove chat template config in turbomind engine #1161

irexyc · 2024-02-20T03:54:51Z

Motivation

Align to torch engine, remove chat_template_mdoel in turbomind engine.

BC-breaking
When using turbomind engine directly, user needs to explicitly specify EngineGenerationConfig and set stop_words

lmdeploy/serve/async_engine.py

AllentDan

Tesed with

lmdeploy chat turbomind internlm2-chat-7b w/wo --chat-template
pipeline of internlm2-chat-7b w/wo --chat-template
api_server with chat-template

lvhan028 · 2024-03-26T11:51:59Z

lmdeploy/turbomind/chat.py

+        chat_template (str): user defined chat template
+        kwargs (dict): unused args
+    """ # noqa: E 501
+    print('unused kwargs', kwargs, sep='')


有必要打印么？

lvhan028 · 2024-03-26T11:52:15Z

lmdeploy/turbomind/chat.py

         cap: str = 'chat',
         tp: int = 1,
+         max_batch_size: int = 128,


max_batch_size 感觉不用作为参数

lvhan028 · 2024-03-26T11:52:25Z

lmdeploy/turbomind/chat.py

-    tm_model = tm.TurboMind.from_pretrained(
-        model_path,
+    engine_cfg = TurbomindEngineConfig(
+        max_batch_size=max_batch_size,


max_batch_size=1

irexyc added 2 commits February 19, 2024 12:03

remove .model of turbomind engine

01436be

update async engine init

0bab82f

irexyc added the BC-breaking label Feb 20, 2024

lvhan028 changed the title ~~Rm tm model~~ remove chat template config in turbomind engine Mar 19, 2024

lvhan028 requested review from AllentDan and lvhan028 March 21, 2024 06:23

irexyc added 2 commits March 21, 2024 09:52

update

3b0115c

remove unused

6ebb3c8

AllentDan reviewed Mar 22, 2024

View reviewed changes

lmdeploy/serve/async_engine.py Show resolved Hide resolved

AllentDan approved these changes Mar 22, 2024

View reviewed changes

irexyc added 4 commits March 22, 2024 02:31

remove chat_template_config

f26ee1b

remove chat_template_config

7e60b9c

Merge branch 'main' into rm-tm-model

a4abc2b

add max_batch_size for lmdeploy.chat cli

726cc4c

lvhan028 reviewed Mar 26, 2024

View reviewed changes

lmdeploy/turbomind/chat.py

cap: str = 'chat',

tp: int = 1,

max_batch_size: int = 128,

Copy link

Collaborator

lvhan028 Mar 26, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

max_batch_size 感觉不用作为参数

lvhan028 reviewed Mar 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove chat template config in turbomind engine #1161

remove chat template config in turbomind engine #1161

irexyc commented Feb 20, 2024

AllentDan left a comment

lvhan028 Mar 26, 2024

lvhan028 Mar 26, 2024

lvhan028 Mar 26, 2024

remove chat template config in turbomind engine #1161

Are you sure you want to change the base?

remove chat template config in turbomind engine #1161

Conversation

irexyc commented Feb 20, 2024

Motivation

AllentDan left a comment

Choose a reason for hiding this comment

lvhan028 Mar 26, 2024

Choose a reason for hiding this comment

lvhan028 Mar 26, 2024

Choose a reason for hiding this comment

lvhan028 Mar 26, 2024

Choose a reason for hiding this comment