#

supervised-finetuning

Here are 18 public repositories matching this topic...

chaoswork / sft_datasets

开源SFT数据集整理,随时补充

datasets chinese-dataset large-language-models llms supervised-finetuning

Updated Jun 2, 2023

thisisHJLee / RLHF

nlp reinforcement-learning language-model ppo rlhf supervised-finetuning reward-model

Updated Jul 20, 2023

jmaczan / c-137

🦙 Llama 2 7B fine-tuned to revive Rick

nlp machine-learning deep-learning rickandmorty rick-and-morty fine-tuning finetuning google-colab sft llm rick-sanchez supervised-finetuning llama2 llama-2 llama2-7b apple-m2 c-137

Updated Aug 30, 2023
Jupyter Notebook

GaryYufei / AlignLLMHumanSurvey

Aligning Large Language Models with Human: A Survey

awesome survey llama gpt-4 large-language-models llms chatgpt rlhf supervised-finetuning llama2 chinese-llama

Updated Sep 11, 2023

LIN-SHANG / InstructERC

The offical realization of InstructERC

unified-data-processing emotion-recognition-in-conversation large-language-models supervised-finetuning chatglm-6b llama-7b chatglm2-6b llama2-7b

Updated Oct 26, 2023
Python

nsrinidhibhat / fine-tune-llama-2

This project streamlines the fine-tuning process, enabling you to leverage Llama-2's capabilities for your own projects.

open fine-tuning huggingface large-language-models supervised-finetuning llama-2

Updated Nov 5, 2023
Python

KwokHing / AI-Planet-LLM-Bootcamp-Challenge

A LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain

language-model sentence-embeddings fine-tuning transformer-models llm langchain supervised-finetuning qlora embeddings-model retrieval-augmented-generation mistral-7b ocra-mini-3b

Updated Nov 23, 2023
Jupyter Notebook

tien02 / llm-math

Fine tune Large Language Model on Mathematic dataset

mathematics transformer llama lora huggingface llm supervised-finetuning llama2

Updated Dec 4, 2023
Python

sunnynevarekar / LLM_Mistral_7b_SFT

Finetune Mistral 7b v1.0 on custom dataset

text-to-sql sft large-language-models llm supervised-finetuning qlora mistral-7b

Updated Jan 29, 2024
Jupyter Notebook

bhattbhavesh91 / google-gemma-finetuning-n2sql

Finetuning Google's Gemma Model for Translating Natural Language into SQL

google lora gemma natural-language-to-sql fine-tuning finetuning supervised-finetuning finetuning-llms

Updated Feb 22, 2024
Jupyter Notebook

18907305772 / KCA

Knowledge Verification to Nip Hallucination in the Bud

machine-learning hallucination large-language-models supervised-finetuning

Updated Mar 10, 2024
Python

fanqiwan / KCA

Knowledge Verification to Nip Hallucination in the Bud

machine-learning hallucination large-language-models supervised-finetuning

Updated Mar 10, 2024
Python

ChryssaNab / ECG-Heartbeat-Classification

Binary classification of pathological heartbeats from ECG signals using 1D CNNs in PyTorch

pytorch transfer-learning binary-classification pre-trained-model 1d-cnns heartbeat-classification ecg-signals supervised-finetuning

Updated Mar 15, 2024
Python

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

compression feedback survey alignment self-training multi-modal knowledge-distillation data-augmentation kd data-synthesis self-distillation instruction-following llm large-language-model supervised-finetuning

Updated Apr 12, 2024

sail-sg / sdft

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

language-model self-distillation supervised-finetuning

Updated Apr 20, 2024
Shell

InternLM / InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

foundation gpt language-model multimodal multi-modality vision-transformer gpt-4 visual-language-learning llm chatgpt instruction-tuning large-language-model supervised-finetuning mllm vision-language-model large-vision-language-model

Updated May 8, 2024
Python

sovit-123 / lm_sft

Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks

gpt bert gemma gpt2 large-language-models llms supervised-finetuning

Updated May 16, 2024
Jupyter Notebook

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated May 17, 2024
Python

Improve this page

Add a description, image, and links to the supervised-finetuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the supervised-finetuning topic, visit your repo's landing page and select "manage topics."