Visualize layer activations and weights to simplify the quantization process. #607

HIT-cwh · 2023-10-24T13:45:14Z

Usage

Draw the first type of plot which shows the absmax/absmean/max/mean/min value of a linear layer at different layers.

For instance, the code below presents the visualized results of computing the absolute maximum value across multiple tokens specifically for the input of all 'q_proj' linear layers within DecoderLayers 0 and 1

lmdeploy quant_visualization draw \
    --modes 1  \
    --pretrained_model_name_or_path internlm/internlm-chat-7b \
    --work_dir work_dir \
    --use_input \
    --key absmax \
    --linear_name q_proj \
    --layers 0,1

Output:

Draw the second type of plot which shows the relationship between activations and weights.

For instance, the code below presents the visualized results of the relationship between the input of all 'q_proj' linear layers within DecoderLayers 0 and 1, and their corresponding weights.

lmdeploy quant_visualization draw \
    --modes 2  \
    --pretrained_model_name_or_path internlm/internlm-chat-7b \
    --work_dir work_dir \
    --key absmax \
    --linear_name q_proj \
    --layers 0,1

Draw the third type of plot which is a boxplot showing the absmax/absmean/max/mean/min value of the input or output of a linear layer at different layers.

For instance, the code below displays a boxplot showcasing the absolute maximum values computed across multiple tokens, specifically for the input of all 'q_proj' linear layers.

lmdeploy quant_visualization draw \
    --modes 3  \
    --pretrained_model_name_or_path internlm/internlm-chat-7b \
    --work_dir work_dir \
    --key absmax \
    --linear_name q_proj \
    --use_input

Draw the fourth type of plot which shows the relationship between maximum and minimum values of activations.

For instance, the code below illustrates the computed maximum and minimum values across multiple tokens, specifically for the input of all 'q_proj' linear layers within DecoderLayers 0 and 1.

lmdeploy quant_visualization draw \
    --modes 4  \
    --pretrained_model_name_or_path internlm/internlm-chat-7b \
    --work_dir work_dir \
    --linear_name q_proj \
    --use_input
    --layers 0,1

lmdeploy/lite/apis/auto_awq.py

LZHgrla · 2023-10-25T09:16:51Z

Hi, @HIT-cwh
Do we support the visualization of the weight values?

HIT-cwh · 2023-10-27T08:01:27Z

Hi, @HIT-cwh Do we support the visualization of the weight values?

Support for this feature is currently in development and will be progressively enhanced in the forthcoming iterations.

lvhan028 · 2023-11-15T06:43:24Z

Can we use lmdeploy lite view to visualize the activation and weights? It's simpler than lmdeploy quant_visualization draw

lvhan028 · 2023-11-15T06:46:14Z

May add user guide about the usage of this great tool.

HIT-cwh · 2023-11-15T07:49:03Z

May add user guide about the usage of this great tool.

The commit that fixes the load ckpt bug has been split out. Please refer to pr690

HIT-cwh added 2 commits October 24, 2023 21:14

support drawing different types of plots for quantization

35cc4c7

fix load_checkpoint_in_model bug

2ab506b

pppppM self-requested a review October 24, 2023 14:03

pppppM reviewed Oct 24, 2023

View reviewed changes

lmdeploy/lite/apis/auto_awq.py Outdated Show resolved Hide resolved

add a utility for loading hf model from pretrained

133aee3

HIT-cwh added 5 commits October 25, 2023 18:39

fix bugs

00bde9f

fix savedir format

27a6235

fix savedir

4e3d7b2

Merge branch 'main' of github.com:InternLM/lmdeploy into draw

d22a8bc

support cli

4a2040e

support cli

eb356fe

lvhan028 self-requested a review November 15, 2023 06:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visualize layer activations and weights to simplify the quantization process. #607

Visualize layer activations and weights to simplify the quantization process. #607

HIT-cwh commented Oct 24, 2023 •

edited

LZHgrla commented Oct 25, 2023

HIT-cwh commented Oct 27, 2023

lvhan028 commented Nov 15, 2023

lvhan028 commented Nov 15, 2023

HIT-cwh commented Nov 15, 2023

Visualize layer activations and weights to simplify the quantization process. #607

Are you sure you want to change the base?

Visualize layer activations and weights to simplify the quantization process. #607

Conversation

HIT-cwh commented Oct 24, 2023 • edited

Usage

LZHgrla commented Oct 25, 2023

HIT-cwh commented Oct 27, 2023

lvhan028 commented Nov 15, 2023

lvhan028 commented Nov 15, 2023

HIT-cwh commented Nov 15, 2023

HIT-cwh commented Oct 24, 2023 •

edited