Here are
15 public repositories
matching this topic...
Updated
Jun 2, 2024
Python
An OpenAI-like LLaMA inference API
Updated
Sep 17, 2023
Python
ExLlamaV2 nodes for ComfyUI.
Updated
Jun 5, 2024
Python
Booster - open platform for serving LLM models
A Python script designed to streamline the process of quantizing models to exllamav2 format
Updated
May 17, 2024
Python
A Simple webserver for generating text with exllamav2
Updated
Dec 18, 2023
Python
A Qt GUI for large language models
Updated
Nov 17, 2023
Python
A QT GUI for large language models
Updated
Dec 27, 2023
Python
A.L.I.C.E (Artificial Labile Intelligence Cybernated Existence). A REST API of A.I companion for creating more complex system
Updated
Dec 3, 2023
Python
This is a playground to explore the ExLlama project in a Windows environment.
Updated
Jul 20, 2023
PowerShell
A constrained generation filter for local LLMs that makes them quote properly from a source document
Updated
May 14, 2024
Python
Run gguf LLM models in Latest Version TextGen-webui
Updated
Jun 3, 2024
Jupyter Notebook
Simple LLM inference server
Updated
Feb 20, 2024
Python
JavaScript WebSocket API for ExLlamav2
Updated
May 13, 2024
JavaScript
A lightweight, fast, parallel inference server for Llama
Updated
Jun 2, 2024
Python
Improve this page
Add a description, image, and links to the
exllama
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
exllama
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.