Error when running Gemma inference on GPU #47

LarryHawkingYoung · 2024-03-18T12:39:56Z

When I run

docker run -t --rm \
    --gpus all \
    -v ${CKPT_PATH}:/tmp/ckpt \
    ${DOCKER_URI} \
    python scripts/run.py \
    --device=cuda \
    --ckpt=/tmp/ckpt \
    --variant="${VARIANT}" \
    --prompt="${PROMPT}"

It returns the error:
docker: Error response from daemon: could not select device drit device driver "" with capabilities: [[gpu]].

while if I run on CPU with command:

docker run -t --rm \
    -v ${CKPT_PATH}:/tmp/ckpt \
    ${DOCKER_URI} \
    python scripts/run.py \
    --ckpt=/tmp/ckpt \
    --variant="${VARIANT}" \
    --prompt="${PROMPT}"

It works out OK.

The text was updated successfully, but these errors were encountered:

pengchongjin · 2024-03-26T16:33:27Z

What model variant did you use and what GPU did you use?

One guess is that you may run out of GPU memory if you try to run the 7B un-quantized model on a 16GB GPU. You can either try the 7B quantized model or a 2B model and it should work.

tilakrayal added type:support Support issues question Further information is requested stat:awaiting response Status - Awaiting response from author and removed question Further information is requested labels Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when running Gemma inference on GPU #47

Error when running Gemma inference on GPU #47

LarryHawkingYoung commented Mar 18, 2024

pengchongjin commented Mar 26, 2024

Error when running Gemma inference on GPU #47

Error when running Gemma inference on GPU #47

Comments

LarryHawkingYoung commented Mar 18, 2024

pengchongjin commented Mar 26, 2024