Revised run command for CUDA testing #29

DaveSprague · 2024-03-27T23:19:06Z

Under the Run on other devices section, you have the command as:

python run_benchmark.py --include_mps=False --include_mlx_gpu=False --include_mlx_cpu=False --include_cuda=True --include_cpu=True

but it also needs to turn off the --include_mlx_gpu_compile option, so the correct command would be:

python run_benchmark.py --include_mps=False --include_mlx_gpu=False --include_mlx_gpu_compile=False --include_mlx_cpu=False --include_cuda=True --include_cpu=True

The text was updated successfully, but these errors were encountered:

DaveSprague · 2024-03-27T23:36:31Z

Also, in run_benchmark.py around line 126, you need to replace:

if backend in ["mps", "cuda"]:
        torch.mps.empty_cache()
        torch.cuda.empty_cache()

with

if backend == "cuda":
        torch.cuda.empty_cache()

if backend == "mps":
        torch.mps.empty_cache()

on a CUDA machine, the current code to empty the cache fails since there's no mps version of empty_cache on the cuda version of pytorch.

TristanBilot · 2024-03-28T08:43:05Z

Thanks @DaveSprague for reporting this issue, it was indeed an important error!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revised run command for CUDA testing #29

Revised run command for CUDA testing #29

DaveSprague commented Mar 27, 2024

DaveSprague commented Mar 27, 2024 •

edited

TristanBilot commented Mar 28, 2024

Revised run command for CUDA testing #29

Revised run command for CUDA testing #29

Comments

DaveSprague commented Mar 27, 2024

DaveSprague commented Mar 27, 2024 • edited

TristanBilot commented Mar 28, 2024

DaveSprague commented Mar 27, 2024 •

edited