supervised-finetuning

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

compression feedback survey alignment self-training multi-modal knowledge-distillation data-augmentation kd data-synthesis self-distillation instruction-following llm large-language-model supervised-finetuning

Updated Apr 12, 2024

LIN-SHANG / InstructERC

Star

The offical realization of InstructERC

unified-data-processing emotion-recognition-in-conversation large-language-models supervised-finetuning chatglm-6b llama-7b chatglm2-6b llama2-7b

Updated Oct 26, 2023
Python

sail-sg / sdft

Star

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

language-model self-distillation supervised-finetuning

Updated Apr 20, 2024
Shell

bhattbhavesh91 / google-gemma-finetuning-n2sql

Sponsor

Star

Finetuning Google's Gemma Model for Translating Natural Language into SQL

google lora gemma natural-language-to-sql fine-tuning finetuning supervised-finetuning finetuning-llms

Updated Feb 22, 2024
Jupyter Notebook

18907305772 / KCA

Star

Knowledge Verification to Nip Hallucination in the Bud

machine-learning hallucination large-language-models supervised-finetuning

Updated Mar 10, 2024
Python

sovit-123 / lm_sft

Star

Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks

gpt bert gemma gpt2 large-language-models llms supervised-finetuning

Updated May 16, 2024
Jupyter Notebook

nsrinidhibhat / fine-tune-llama-2

Star

This project streamlines the fine-tuning process, enabling you to leverage Llama-2's capabilities for your own projects.

open fine-tuning huggingface large-language-models supervised-finetuning llama-2

Updated Nov 5, 2023
Python

ChryssaNab / ECG-Heartbeat-Classification

Star

Binary classification of pathological heartbeats from ECG signals using 1D CNNs in PyTorch

pytorch transfer-learning binary-classification pre-trained-model 1d-cnns heartbeat-classification ecg-signals supervised-finetuning

Updated Mar 15, 2024
Python

jmaczan / c-137

Star

🦙 Llama 2 7B fine-tuned to revive Rick

nlp machine-learning deep-learning rickandmorty rick-and-morty fine-tuning finetuning google-colab sft llm rick-sanchez supervised-finetuning llama2 llama-2 llama2-7b apple-m2 c-137

Updated Aug 30, 2023
Jupyter Notebook

tien02 / llm-math

Star

Fine tune Large Language Model on Mathematic dataset

mathematics transformer llama lora huggingface llm supervised-finetuning llama2

Updated Dec 4, 2023
Python

thisisHJLee / RLHF

Star

nlp reinforcement-learning language-model ppo rlhf supervised-finetuning reward-model

Updated Jul 20, 2023

fanqiwan / KCA

Star

Knowledge Verification to Nip Hallucination in the Bud

machine-learning hallucination large-language-models supervised-finetuning

Updated Mar 10, 2024
Python

sunnynevarekar / LLM_Mistral_7b_SFT

Star

Finetune Mistral 7b v1.0 on custom dataset

text-to-sql sft large-language-models llm supervised-finetuning qlora mistral-7b

Updated Jan 29, 2024
Jupyter Notebook

KwokHing / AI-Planet-LLM-Bootcamp-Challenge

Star

A LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain

language-model sentence-embeddings fine-tuning transformer-models llm langchain supervised-finetuning qlora embeddings-model retrieval-augmented-generation mistral-7b ocra-mini-3b

Updated Nov 23, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the supervised-finetuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the supervised-finetuning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

supervised-finetuning

Here are 18 public repositories matching this topic...

InternLM / xtuner

InternLM / InternLM-XComposer

chaoswork / sft_datasets

GaryYufei / AlignLLMHumanSurvey

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

LIN-SHANG / InstructERC

sail-sg / sdft

bhattbhavesh91 / google-gemma-finetuning-n2sql

18907305772 / KCA

sovit-123 / lm_sft

nsrinidhibhat / fine-tune-llama-2

ChryssaNab / ECG-Heartbeat-Classification

jmaczan / c-137

tien02 / llm-math

thisisHJLee / RLHF

fanqiwan / KCA

sunnynevarekar / LLM_Mistral_7b_SFT

KwokHing / AI-Planet-LLM-Bootcamp-Challenge

Improve this page

Add this topic to your repo