supervised-finetuning

Here are 18 public repositories matching this topic...

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated May 17, 2024
Python

sovit-123 / lm_sft

Star

Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks

gpt bert gemma gpt2 large-language-models llms supervised-finetuning

Updated May 16, 2024
Jupyter Notebook

InternLM / InternLM-XComposer

Star

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

foundation gpt language-model multimodal multi-modality vision-transformer gpt-4 visual-language-learning llm chatgpt instruction-tuning large-language-model supervised-finetuning mllm vision-language-model large-vision-language-model

Updated May 8, 2024
Python

sail-sg / sdft

Star

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

language-model self-distillation supervised-finetuning

Updated Apr 20, 2024
Shell

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

Star

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

compression feedback survey alignment self-training multi-modal knowledge-distillation data-augmentation kd data-synthesis self-distillation instruction-following llm large-language-model supervised-finetuning

Updated Apr 12, 2024

ChryssaNab / ECG-Heartbeat-Classification

Star

Binary classification of pathological heartbeats from ECG signals using 1D CNNs in PyTorch

pytorch transfer-learning binary-classification pre-trained-model 1d-cnns heartbeat-classification ecg-signals supervised-finetuning

Updated Mar 15, 2024
Python

fanqiwan / KCA

Star

Knowledge Verification to Nip Hallucination in the Bud

machine-learning hallucination large-language-models supervised-finetuning

Updated Mar 10, 2024
Python

18907305772 / KCA

Star

Knowledge Verification to Nip Hallucination in the Bud

machine-learning hallucination large-language-models supervised-finetuning

Updated Mar 10, 2024
Python

bhattbhavesh91 / google-gemma-finetuning-n2sql

Sponsor

Star

Finetuning Google's Gemma Model for Translating Natural Language into SQL

google lora gemma natural-language-to-sql fine-tuning finetuning supervised-finetuning finetuning-llms

Updated Feb 22, 2024
Jupyter Notebook

sunnynevarekar / LLM_Mistral_7b_SFT

Star

Finetune Mistral 7b v1.0 on custom dataset

text-to-sql sft large-language-models llm supervised-finetuning qlora mistral-7b

Updated Jan 29, 2024
Jupyter Notebook

tien02 / llm-math

Star

Fine tune Large Language Model on Mathematic dataset

mathematics transformer llama lora huggingface llm supervised-finetuning llama2

Updated Dec 4, 2023
Python

KwokHing / AI-Planet-LLM-Bootcamp-Challenge

Star

A LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain