cooper12121

Follow

Neo LLama cooper12121

Follow

🎓 Postgrad at WHU 🚀 Unleashing the power of LLM & supercharging MoE efficiency 🎯 Graduating in 2025 💡 Seeking PhD opportunities.

6 followers · 23 following

WuHan University
WuHan
https://scholar.google.com/citations?user=eoUnS60AAAAJ&hl=en&authuser=1
@gaoqiang_nlp

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Block or Report

Block or report cooper12121

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

llama3-Chinese llama3-Chinese Public

对llama3进行中文全参预训练，区别于其他使用lora预训练的项目。

Python 11
llama3-8x8b-MoE llama3-8x8b-MoE Public

Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b MoE model based on llama3.

Python 15 2