Skip to content

Issues: hiyouga/LLaMA-Factory

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Output difference between LLaMA-Factory and llama.cpp pending This problem is yet to be addressed.
#3563 opened May 3, 2024 by anidh
1 task done
DPO format - Expected a string, got {}".format(value), got None pending This problem is yet to be addressed.
#3555 opened May 3, 2024 by Katehuuh
1 task done
FSDP QDoRa pending This problem is yet to be addressed.
#3550 opened May 2, 2024 by etemiz
1 task done
How to convert Dolphin-2.9 to LLaMA factory? solved This problem has been already solved.
#3535 opened May 1, 2024 by YixinSong-e
1 task done
多节点sft一直卡在这里,微调llama3 8b pending This problem is yet to be addressed.
#3534 opened May 1, 2024 by gongye19
1 task done
DBRX using more gpu memory than mixtral 8x22B for fsdp+qlora pending This problem is yet to be addressed.
#3521 opened Apr 30, 2024 by mces89
1 task done
Got error when exporting model with quantization pending This problem is yet to be addressed.
#3516 opened Apr 29, 2024 by dickens88
1 task done
model.safetensor size changes in according to different finetuning methods pending This problem is yet to be addressed.
#3515 opened Apr 29, 2024 by hunt-47
CUDA out of memory for fsdp training pending This problem is yet to be addressed.
#3494 opened Apr 28, 2024 by v-yunbin
cannot use pure_bf16 with zero3 cpu offload pending This problem is yet to be addressed.
#3476 opened Apr 27, 2024 by mces89
1 task done
[Feature Request] 我们需要更灵活的保存策略? pending This problem is yet to be addressed.
#3472 opened Apr 26, 2024 by marko1616
fsdp-qlora yi-34B-chat throw error " ValueError: Cannot flatten integer dtype tensors" pending This problem is yet to be addressed.
#3470 opened Apr 26, 2024 by hellostronger
1 task done
deepspeed的bug pending This problem is yet to be addressed.
#3461 opened Apr 26, 2024 by bravelyi
1 task done
Could you please share some tips with your rich experience? pending This problem is yet to be addressed.
#3452 opened Apr 25, 2024 by xiaochengsky
1 task done
SFT zero2 zero3下loss不一致 pending This problem is yet to be addressed.
#3442 opened Apr 25, 2024 by wsdmanonymous
1 task done
量化后的gptq模型,部署成openai后调用报错 pending This problem is yet to be addressed.
#3408 opened Apr 24, 2024 by ccp123456789
究竟怎么做dpo呀 pending This problem is yet to be addressed.
#3395 opened Apr 23, 2024 by XuanRen4470
1 task done
Issues of LLaMA3 SFT on multi-nodes pending This problem is yet to be addressed.
#3381 opened Apr 22, 2024 by Liusifei
1 task done
训练一段时间后,在保存文件时,会提示文件夹【拒绝访问】 pending This problem is yet to be addressed.
#3359 opened Apr 20, 2024 by kynow2
1 task done
ProTip! Find all open issues with in progress development work with linked:pr.