New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Script to convert Grok-1 weights from raw JAX pickle files. #7058

Open

heiner wants to merge 13 commits into ggerganov:master from heiner:master

+467 −0

Commits on May 25, 2024

Script to convert Grok-1 weights from raw JAX pickle files.

heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for 6ddf93b

Browse repository at this point
Copy the full SHA

6ddf93b View commit details

Browse the repository at this point in the history
Don't split MoE weights.
```
As per ggerganov#7058 (comment).
This helps avoid a memcopy when running.
```
heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for 3c57743

Browse repository at this point
Copy the full SHA

3c57743 View commit details

Browse the repository at this point in the history
Use only one list of weight names, with values from the gguf module.
```
This saves weights in the order in which they are in the Grok-1 files.
Since we operate weight-by-weight now, we no longer need caches and
name2key translations.

Per reviewer request, I also moved to using keys in gguf.TENSOR_NAMES.
```
heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for 0842763

Browse repository at this point
Copy the full SHA

0842763 View commit details

Browse the repository at this point in the history
Update convert_grok.py to use logging module

mofosyne authored and heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for 5bc4f10

Browse repository at this point
Copy the full SHA

5bc4f10 View commit details

Browse the repository at this point in the history
Move print to logging: Fixes.

heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for d894497

Browse repository at this point
Copy the full SHA

d894497 View commit details

Browse the repository at this point in the history
Address review comments by foldl.

heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for ef671c6

Browse repository at this point
Copy the full SHA

ef671c6 View commit details

Browse the repository at this point in the history
Don't multiply embeddings with embedding_multiplier_scale as it happe…
```
…ns in llama.cpp.
```
heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for 9a0629d

Browse repository at this point
Copy the full SHA

9a0629d View commit details

Browse the repository at this point in the history
Fix layer order.

heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for f177b65

Browse repository at this point
Copy the full SHA

f177b65 View commit details

Browse the repository at this point in the history
Use Q8_0 quantization from gguf module.
```
This makes tensors exactly as in https://huggingface.co/Arki05/Grok-1-GGUF/tree/main/Q8_0
```
heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for e2f13a3

Browse repository at this point
Copy the full SHA

e2f13a3 View commit details

Browse the repository at this point in the history
More constants from gguf.

heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for 60b29ea

Browse repository at this point
Copy the full SHA

60b29ea View commit details

Browse the repository at this point in the history
Write tensors in layer order.

heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for 0a1ef11

Browse repository at this point
Copy the full SHA

0a1ef11 View commit details

Browse the repository at this point in the history
Move noqa comment to where the lastest flake8 likes it.

heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for abc958b

Browse repository at this point
Copy the full SHA

abc958b View commit details

Browse the repository at this point in the history
Implement Q8_0 quantization fully in PyTorch.
```
This is equivalent to gguf.quantize_q8_0 but doesn't round-trip to
Numpy.
```
heiner committed May 25, 2024
Configuration menu
View commit details

Copy full SHA for 739648f

Browse repository at this point
Copy the full SHA

739648f View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Script to convert Grok-1 weights from raw JAX pickle files. #7058

Script to convert Grok-1 weights from raw JAX pickle files. #7058

Commits on May 25, 2024

Script to convert Grok-1 weights from raw JAX pickle files. #7058

Are you sure you want to change the base?

Script to convert Grok-1 weights from raw JAX pickle files. #7058

Commits on May 25, 2024