torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 394.00 MiB #23

shifu-learner · 2024-01-16T11:23:19Z

Hello,
I am trying to finetune GPT-j-6b.
I followed the instructions provided in the documentation. But, I get this error.

I tried by changing batch size =1, gradient_accumulation_steps=4.

Any idea how can i solve this.

mallorbc · 2024-01-16T20:39:41Z

Need more information. How are you running it? What is your hardware?

shifu-learner · 2024-01-23T09:27:52Z

Sure
I am trying to execute in a TESLA VM with Nvidia driver.
I have followed the documentation to finetune GPT_J_6b.

shavingtonpitsos · 2024-02-12T22:54:19Z

is sounds like your GPU is not large enough, how much GPU ram is available to you when you run your script? It sounds like you likely have too little GPU ram, probably less than 16 GB

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 394.00 MiB #23

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 394.00 MiB #23

shifu-learner commented Jan 16, 2024

mallorbc commented Jan 16, 2024

shifu-learner commented Jan 23, 2024

shavingtonpitsos commented Feb 12, 2024

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 394.00 MiB #23

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 394.00 MiB #23

Comments

shifu-learner commented Jan 16, 2024

mallorbc commented Jan 16, 2024

shifu-learner commented Jan 23, 2024

shavingtonpitsos commented Feb 12, 2024