Fine Tuning on Custom Data ipynb #87

samarthsarin · 2023-03-30T12:25:14Z

Can you please provide a ipynb notebook which shows steps for fine tuning this model on custom data?

daleevans · 2023-03-30T22:34:50Z

The entire process is just:

make your data, in the jsonl format like you get when you download the standard data
edit configs/train/finetune_lora.yaml to point to your new data file and set up your wandb/hf account info
possibly edit configs/deepspeed/ds_config.json depending on your local GPU/CPU/memory (batch sizes, and maybe set stage3_gather_16bit_weights_on_model_save and cpu offload)
run train.py

If you don't have wandb or hf, you may need to comment some lines out in train.py

samarthsarin · 2023-04-05T19:17:13Z

I got your point. Still for better documentation I would be really grateful if some jupyter nptebook can be provided as majority of the audience here is looking for one such fine tuning code. We all would be really grateful if you can provide one such code for fine tuning gpt4all in a jupyter notebook.

Thank you

magedhelmy1 · 2023-04-21T12:39:47Z

Hi @zanussbaum, any advise on how to move forward with this?

Dineshk011287 · 2023-06-19T05:42:01Z

I got your point. Still for better documentation I would be really grateful if some jupyter nptebook can be provided as majority of the audience here is looking for one such fine tuning code. We all would be really grateful if you can provide one such code for fine tuning gpt4all in a jupyter notebook.

+1
I am also looking for this. If any documentation or jupyter notebook would definitely help

cebtenzzre · 2024-05-09T22:30:02Z

Closing this issue as stale. A lot has changed since Nomic last trained a text completion model.

This comment was marked as spam.

Sign in to view

JulienBrochier mentioned this issue Apr 4, 2023

is it possible to train with new material? #198

Closed

This comment was marked as spam.

Sign in to view

claell mentioned this issue Jun 9, 2023

Training on own data? #532

Closed

niansa added enhancement New feature or request training gpt4all-training issues labels Aug 10, 2023

nomic-ai deleted a comment from WordDealer Oct 13, 2023

nomic-ai deleted a comment from fredcobain Oct 13, 2023

cebtenzzre pinned this issue Oct 13, 2023

This comment was marked as spam.

Sign in to view

cebtenzzre unpinned this issue May 9, 2024

cebtenzzre closed this as not planned Won't fix, can't repro, duplicate, stale May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine Tuning on Custom Data ipynb #87

Fine Tuning on Custom Data ipynb #87

samarthsarin commented Mar 30, 2023

This comment was marked as spam.

daleevans commented Mar 30, 2023

samarthsarin commented Apr 5, 2023

magedhelmy1 commented Apr 21, 2023

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

Dineshk011287 commented Jun 19, 2023

This comment was marked as spam.

cebtenzzre commented May 9, 2024

Fine Tuning on Custom Data ipynb #87

Fine Tuning on Custom Data ipynb #87

Comments

samarthsarin commented Mar 30, 2023

This comment was marked as spam.

daleevans commented Mar 30, 2023

samarthsarin commented Apr 5, 2023

magedhelmy1 commented Apr 21, 2023

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

This comment was marked as spam.

Dineshk011287 commented Jun 19, 2023

This comment was marked as spam.

cebtenzzre commented May 9, 2024