Skip to content
View cooper12121's full-sized avatar
Block or Report

Block or report cooper12121

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. llama3-Chinese llama3-Chinese Public

    ๅฏนllama3่ฟ›่กŒไธญๆ–‡ๅ…จๅ‚้ข„่ฎญ็ปƒ๏ผŒๅŒบๅˆซไบŽๅ…ถไป–ไฝฟ็”จlora้ข„่ฎญ็ปƒ็š„้กน็›ฎใ€‚

    Python 11

  2. llama3-8x8b-MoE llama3-8x8b-MoE Public

    Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b MoE model based on llama3.

    Python 15 2