#

moe

Here are 108 public repositories matching this topic...

LLaMA-Factory

hiyouga / LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Updated May 18, 2024
Python

Bangumi

czy0729 / Bangumi

An unofficial https://bgm.tv app client for Android and iOS, built with React Native. 一个无广告、以爱好为驱动、不以盈利为目的、专门做 ACG 的类似豆瓣的追番记录，bgm.tv 第三方客户端。为移动端重新设计，内置大量加强的网页端难以实现的功能，且提供了相当的自定义选项。目前已适配 iOS / Android / WSA、mobile / 简单 pad、light / dark theme、移动端网页。

react android ios design react-native mobx ios-app moe bangumi android-app expo

Updated May 18, 2024
TypeScript

bcgov / nr-epd-digital-services

EPD suite of applications

environment moe epd nr nrs

Updated May 17, 2024
JavaScript

LISTEN-moe / android-app

Official LISTEN.moe Android app

android kotlin music music-player anime jpop japan moe kpop android-auto

Updated May 17, 2024
Kotlin

PKU-YuanGroup / MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

moe multi-modal mixture-of-experts large-vision-language-model

Updated May 15, 2024
Python

Fuwn / mayu

⭐ Moe-Counter Compatible Website Hit Counter Written in Gleam

website counter functional moe gleam moe-counter

Updated May 14, 2024
Gleam

microsoft / tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

nlp pytorch transformer moe mixture-of-experts

Updated May 14, 2024
Python

ednial0zavlare / MixKABRN

This is the repo for the MixKABRN Neural Network (Mixture of Kolmogorov-Arnold Bit Retentive Networks), and an attempt at first adapting it for training on text, and later adjust it for other modalities.

ai neural-network model architecture moe bitnet mixture-of-experts ai-models llms retnet retentive-network kolmogorov-arnold-networks

Updated May 14, 2024
Python

bcgov / nr-omrr-transparency

Source Code and Artifacts Related to Organic Matter Recycling Regulation (OMRR) Transparency Initiative

node helm caddy moe env nrm epd nr gha

Updated May 16, 2024
TypeScript

yvonwin / qwen2.cpp

qwen1.5 cpp implementation

nlp moe large-language-models qwen qwen2 qwen1-5

Updated May 8, 2024
C++

weedge / baby-llm

moe llama gpt llm

Updated May 6, 2024
Jupyter Notebook

marisukukise / japReader

japReader is an app for breaking down Japanese sentences and tracking vocabulary progress

electron javascript flashcards japanese language-learning manga visual-novel moe japanese-language anki furigana jmdict japanese-study ichi japanese-dictionary progress-tracking japanese-language-learners language-learning-tool ichimoe

Updated May 3, 2024
JavaScript

ymcui / Chinese-Mixtral

中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）

nlp moe 64k mixture-of-experts 32k large-language-models llm mixtral

Updated Apr 30, 2024
Python

kyegomez / MHMoE

Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch

machine-learning ai ml transformers artificial-intelligence moe attention chicken

Updated Apr 27, 2024
Python

kravetsone / enkaNetwork

Node JS enka.network API wrapper written on TypeScript which provides localization, caching and convenience.

nodejs typescript moe genshinimpact enkanetwork genshinapi enka gensinimpactapi shinshin

Updated Apr 25, 2024
TypeScript

davidmrau / mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

pytorch moe re-implementation mixture-of-experts sparsely-gated-mixture-of-experts

Updated Apr 19, 2024
Python

mrzjy / expert_choice_visualization_for_mixtral

A simple project that help visualize expert router choices for text generation

visualization router text-generation transformer moe expert huggingface-transformers large-language-models llm-inference mixtral-8x7b

Updated Apr 17, 2024
Python

Moebooru.moe

Tenpi / Moebooru.moe

Moebooru is an image board for anime art.

art anime booru moe moebooru kawaii cute image-board

Updated Apr 14, 2024
TypeScript

JunweiZheng93 / MATERobot

Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments" at ICRA 2024, Best Paper Finalist on Human-Robot Interaction

moe multi-task-learning material-recognition wearable-robot real-time-vit

Updated Apr 11, 2024
Python

IBM / ModuleFormer

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

Updated Apr 10, 2024
Python

Improve this page

Add a description, image, and links to the moe topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the moe topic, visit your repo's landing page and select "manage topics."