GGUF Conversion - codegemma 2b and vocab for FIM / infill #7205
-
Hello, I am to fine-tuneing codegemma by Google. When I convert to GGUF, I believe that the tokens or configuration for FIM / Infill are going missing. When I run: When I run: I am using the following to perform conversion and quantization:
How can I convert to GGUF and ensure that /infill is supported? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Update: I ran this:
This stops it segfaulting, But I get a garbage response. (words of random letters) |
Beta Was this translation helpful? Give feedback.
I found the problem. Using your docker image with
--convert
, it detects codegemma as 'llama' architecture.If I override the entrypoint to
/app/convert-hf-to-gguf.py
on the same docker image, it gets the architecture correct.I also set the working directory to that of my image.
When I did this, I got a perfect conversion and did not have to set metadata myself.