Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Note to Readme that LLaMA 3 is Not Supported for convert.py
#7065
opened May 3, 2024 by
lyledean1
Loading…
Script to convert Grok-1 weights from raw JAX pickle files.
#7058
opened May 3, 2024 by
heiner
Loading…
BPE pretokenizer - add support for command-r-plus and command-r models
#7041
opened May 2, 2024 by
sealad886
Loading…
Bug fix for server crash if first token is the stop word and asking for logprobs
#7038
opened May 2, 2024 by
maor-ps
Loading…
tests : add test-tokenizer-0.sh
high priority
Very important issue
#7036
opened May 2, 2024 by
ggerganov
Loading…
convert-hf : reduce repeated boilerplate from write_tensors
need feedback
Testing and feedback with results are needed
refactoring
Refactoring
#7031
opened May 1, 2024 by
compilade
Loading…
3 of 18 tasks
convert.py: When --vocab-only is passed, generate false but valid params
#7027
opened May 1, 2024 by
20kdc
Loading…
docs: Fix typo and update description for --embeddings flag
#7026
opened May 1, 2024 by
louixs
Loading…
Update Server's README with undocumented options for RoPE, YaRN, and KV cache quantization
#7013
opened Apr 30, 2024 by
K-Mistele
Loading…
new tokenizer-verifier tool to check gguf tokenizer parameters
#6988
opened Apr 29, 2024 by
anisse
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.