Skip to content
@Trainy-ai

Trainy

Tools to make distributed training easy.

Popular repositories

  1. llm-atc llm-atc Public archive

    Fine-tuning and serving LLMs on any cloud

    Python 82 2

  2. nodify nodify Public

    Profiling tools for distributed training

    HTML 37 3

  3. trainy trainy Public

    A simple Pure Python/PyTorch performance daemon for training workloads

    Python 12 1

  4. dynolog dynolog Public

    Forked from facebookincubator/dynolog

    Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also …

    C++ 1

  5. FastChat FastChat Public

    Forked from lm-sys/FastChat

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

    Python

  6. RWKV-LM RWKV-LM Public

    Forked from BlinkDL/RWKV-LM

    RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

    Python

Repositories

Showing 10 of 11 repositories
  • konduktor Public

    cluster/scheduler health monitoring for GPU jobs on k8s

    Python 0 Apache-2.0 0 0 0 Updated May 18, 2024
  • llm-atc Public archive

    Fine-tuning and serving LLMs on any cloud

    Python 82 Apache-2.0 2 1 0 Updated Dec 2, 2023
  • training Public Forked from mlcommons/training

    Reference implementations of MLPerf™ training benchmarks

    Python 0 Apache-2.0 543 0 1 Updated Nov 21, 2023
  • HTML 0 0 0 0 Updated Nov 18, 2023
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 0 Apache-2.0 2,701 0 1 Updated Nov 15, 2023
  • airoboros Public Forked from jondurbin/airoboros

    Customizable implementation of the self-instruct paper.

    Python 0 Apache-2.0 64 0 1 Updated Nov 15, 2023
  • FastChat Public Forked from lm-sys/FastChat

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

    Python 0 Apache-2.0 4,392 0 1 Updated Nov 14, 2023
  • RWKV-LM Public Forked from BlinkDL/RWKV-LM

    RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

    Python 0 Apache-2.0 821 0 0 Updated Nov 2, 2023
  • nodify Public

    Profiling tools for distributed training

    HTML 37 3 1 0 Updated Oct 31, 2023
  • trainy Public

    A simple Pure Python/PyTorch performance daemon for training workloads

    Python 12 1 0 0 Updated Aug 2, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…