-
Notifications
You must be signed in to change notification settings - Fork 8.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Script to convert Grok-1 weights from raw JAX pickle files. #7058
Open
heiner
wants to merge
13
commits into
ggerganov:master
Choose a base branch
from
heiner:master
base: master
Could not load branches
Branch not found: {{ refName }}
Could not load tags
Nothing to show
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+467
−0
Commits on May 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6ddf93b - Browse repository at this point
Copy the full SHA 6ddf93bView commit details -
As per ggerganov#7058 (comment). This helps avoid a memcopy when running.
Configuration menu - View commit details
-
Copy full SHA for 3c57743 - Browse repository at this point
Copy the full SHA 3c57743View commit details -
Use only one list of weight names, with values from the gguf module.
This saves weights in the order in which they are in the Grok-1 files. Since we operate weight-by-weight now, we no longer need caches and name2key translations. Per reviewer request, I also moved to using keys in gguf.TENSOR_NAMES.
Configuration menu - View commit details
-
Copy full SHA for 0842763 - Browse repository at this point
Copy the full SHA 0842763View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5bc4f10 - Browse repository at this point
Copy the full SHA 5bc4f10View commit details -
Configuration menu - View commit details
-
Copy full SHA for d894497 - Browse repository at this point
Copy the full SHA d894497View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef671c6 - Browse repository at this point
Copy the full SHA ef671c6View commit details -
Don't multiply embeddings with embedding_multiplier_scale as it happe…
…ns in llama.cpp.
Configuration menu - View commit details
-
Copy full SHA for 9a0629d - Browse repository at this point
Copy the full SHA 9a0629dView commit details -
Configuration menu - View commit details
-
Copy full SHA for f177b65 - Browse repository at this point
Copy the full SHA f177b65View commit details -
Use Q8_0 quantization from gguf module.
This makes tensors exactly as in https://huggingface.co/Arki05/Grok-1-GGUF/tree/main/Q8_0
Configuration menu - View commit details
-
Copy full SHA for e2f13a3 - Browse repository at this point
Copy the full SHA e2f13a3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 60b29ea - Browse repository at this point
Copy the full SHA 60b29eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0a1ef11 - Browse repository at this point
Copy the full SHA 0a1ef11View commit details -
Configuration menu - View commit details
-
Copy full SHA for abc958b - Browse repository at this point
Copy the full SHA abc958bView commit details -
Implement Q8_0 quantization fully in PyTorch.
This is equivalent to gguf.quantize_q8_0 but doesn't round-trip to Numpy.
Configuration menu - View commit details
-
Copy full SHA for 739648f - Browse repository at this point
Copy the full SHA 739648fView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.