#

gemma

Here are 90 public repositories matching this topic...

GaiZhenbiao / ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

spark chatbot gemini llama minimax moss gemma claude ernie midjourney chatgpt-api chatglm stablelm ollama qwen dalle3 inspurai

Updated May 15, 2024
Python

mudler / LocalAI

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

Updated May 18, 2024
C++

unslothai / unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

ai llama lora gemma mistral fine-tuning finetuning llms qlora llama2

Updated May 18, 2024
Python

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

google pytorch gemma

Updated Apr 16, 2024
Python

google / generative-ai-docs

Documentation for Google's Gen AI site - including the Gemini API and Gemma

documentation machine-learning ai chatbot embeddings gemini gemma gemini-api llm

Updated May 18, 2024
Jupyter Notebook

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Updated May 9, 2024
Python

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Updated May 17, 2024
Python

nextjs-ollama-llm-ui

jakobhoeg / nextjs-ollama-llm-ui

Fully-featured, beautiful web interface for Ollama LLMs - built with NextJS. Deploy with a single click.

react typescript ai local offline nextjs chatbot localstorage openai gemma mistral tailwindcss llm shadcn ollama mistral-7b nextjs14

Updated Apr 30, 2024
TypeScript

codingonion / awesome-llm-and-aigc

🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.

Updated May 9, 2024

elia

darrenburns / elia

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

python terminal ai tui llama gpt gemma mistral claude large-language-models llm chatgpt ollama ollama-interface ollama-client mixtral mistral-ai llama3 phi-3

Updated May 14, 2024
Python

LucknowAI / Lucknow-LLM

Collecting data for Building Lucknow's first LLM

india gemma lucknow llm llama2 llama2-7b llm-finetuning mistral-7b

Updated May 9, 2024
Jupyter Notebook

inferflow / inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

bloom falcon moe gemma mistral mixture-of-experts model-quantization multi-gpu-inference m2m100 llamacpp llm-inference internlm llama2 qwen baichuan2 mixtral phi-2 deepseek minicpm

Updated Mar 15, 2024
C++

Beomi / InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

transformers pytorch llama gemma huggingface infinitransformer llama3

Updated Apr 23, 2024
Python

sozercan / aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Updated May 17, 2024
Go

google / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gpu inference pytorch transformer llama gpt gemma model-serving tpu jax mlops large-language-models llm llmops llm-inference llama2

Updated May 18, 2024
Python

marklysze / LlamaIndex-RAG-WSL-CUDA

Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B

windows-10 gemma windows-11 wsl2 llamaindex retrieval-augmented-generation llama-2 mistral-7b yi-34b orca-2 mixtral phi-2 mixtral-8x7b neural-7b neural-chat-7b microsoft-phi-2 gemma-2b gemma-7b

Updated Feb 25, 2024
Jupyter Notebook

Beomi / Gemma-EasyLM

Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)

transformers flax language-model gemma tpu jax huggingface easylm

Updated Mar 2, 2024
Python

google / jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

inference pytorch batching attention llama gemma model-serving tpu llm llm-inference llama2

Updated May 18, 2024
Python

Mobile-Artificial-Intelligence / maid_llm

maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)

facebook meta llama gemma mistral mobile-ai llm flutter-ai llamacpp ggml llm-inference local-ai llama2 gguf mixtral

Updated May 18, 2024
Dart

fly-apps / ollama-open-webui

Deploy your very own ChatGPT-Style Web Interface for Ollama 🦙

gpu gemma mistral llava ollama ollama-webui mixtral llama3

Updated May 8, 2024
Shell

Improve this page

Add a description, image, and links to the gemma topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gemma topic, visit your repo's landing page and select "manage topics."