Expert Choice Visualization for Mixtral

This is a straightforward project that provides a visual representation of the expert choices made by the Mixtral router for text generation.

(Re-implementation of the Figure 8 from Mixtral of Experts paper, see reference below)

In the visualization, each token in a text sample is colored with the first expert choice.

The code is kept simplistic for further customization.

Requirements

# Python Library
transformers 4.39.3

Example

Prompt

# prompt (instruction + response)
<s> [INST] Act as Superman and give me a greeting [/INST] Up, up, and away! Greetings, citizen! It's a bird, it's a plane, no, it's Superman here to bring some super smiles to your day! How can I assist you today?</s>

Output

The main.py results in n_layers (8 in Mixtral's case) html files that visualize the colorized text spans.

images/router_choice_layer_0.html
images/router_choice_layer_1.html
...

For the first 2 layers of Mixtral-8x7B-Instruct-v0.1:

Layer 0	Layer 1

Reference

Inspired by the Figure 8 from Mixtral of Experts paper.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
images		images
.gitignore		.gitignore
README.md		README.md
base.py		base.py
main.py		main.py
viz.py		viz.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

.gitignore

.gitignore

README.md

README.md

base.py

base.py

main.py

main.py

viz.py

viz.py

Repository files navigation

Expert Choice Visualization for Mixtral

Requirements

Example

Reference

About

Languages

mrzjy/expert_choice_visualization_for_mixtral

Folders and files

Latest commit

History

Repository files navigation

Expert Choice Visualization for Mixtral

Requirements

Example

Reference

About

Topics

Resources

Stars

Watchers

Forks

Languages