llm-inference

Here are 391 public repositories matching this topic...

felladrin / MiniSearch

Minimalist web-searching app with an AI assistant that runs directly from your browser. Uses Web-LLM, Ratchet-ML, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space

search nlp search-engine machine-learning information-retrieval typescript ai artificial-intelligence webapp question-answering searxng llm gpu-accelerated generative-ai llm-inference retrieval-augmented-generation web-llm ratchet-ml wllama

Updated May 18, 2024
TypeScript

expectedparrot / edsl

Star

Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.

python open-source openai surveys experiments domain-specific-language market-research social-science synthetic-data data-labeling llm anthropic llm-agent llm-inference llama2 llm-framework mixtral deepinfra

Updated May 18, 2024
Python

mihainadas / summa

Star

Project Summa is an LLM-based text processing system built with Python and Django. It provides a modular and scalable solution for various text processing tasks.

django ai llm llm-inference

Updated May 18, 2024
Python

microsoft / autogen

Star

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

chat chatbot gpt chat-application agent-based-framework agent-oriented-programming gpt-4 chatgpt llmops gpt-35-turbo llm-agent llm-inference agentic llm-framework agentic-agi

Updated May 18, 2024
Jupyter Notebook

Climatik-Project / Climatik-Project

Star

Carbon Limiting Auto Tuning for Kubernetes

kubernetes sustainability kepler kubernetes-operator power-capping green-computing keda kserve llm vllm llm-inference

Updated May 18, 2024
Python

beam-cloud / beta9

Star

The open-source serverless GPU container runtime.

gpu distributed-computing cuda self-hosted fine-tuning ml-platform large-language-models llm generative-ai llm-inference

Updated May 18, 2024
Go

Adriankhl / godot-llm

Star

LLM in Godot

cpp godot llamacpp llm-inference

Updated May 18, 2024
C

3eeps / llmon-py

Star

Local webui for Large Language Models. Supports the GGUF format. Inference LLMs with support for STT/TTS and function calling.

text-to-speech gui local chatbot web-ui tts image-recognition webui speech-to-text stt llm moondream llm-inference function-calling gguf sdxl-turbo

Updated May 18, 2024
Python

Opla / opla

Star

Empower Your Productivity with Local AI Assistants

llama gpt aiassistant opla ai-assistant llm generative-ai llmops llamacpp localai llm-inference local-ai llama2 aiagent ai-agent-front

Updated May 18, 2024
TypeScript

webgptorg / promptbook

Star

Library to supercharge your use of large language models

openai autogpt llm-inference

Updated May 18, 2024
TypeScript

openvinotoolkit / openvino

Star

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated May 18, 2024
C++

Corpus2GPT: A project enabling users to train their own GPT models on diverse datasets, including local languages and various corpus types, using Keras and compatible with TensorFlow, PyTorch, or JAX backends for subsequent storage or sharing.

tensorflow keras python3 pytorch attention-mechanism jax large-language-models llm llm-training llm-inference