Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What version of CUDA, cuDNN are you using? #60

Open
Yanomizu opened this issue Nov 3, 2022 · 6 comments
Open

What version of CUDA, cuDNN are you using? #60

Yanomizu opened this issue Nov 3, 2022 · 6 comments

Comments

@Yanomizu
Copy link

Yanomizu commented Nov 3, 2022

Hello.

I am now trying to replicate your research.
However, I am having trouble with an error when I try to run the training.
エラーgit
I believe the cause is in the version of CUDA, cuDNN.
So I would like to know what those are if you were able to run the training.
Also, if my guess is wrong, I would like to know the cause.

Thanks.

@BoboRamirez
Copy link

Hello, I am suffering from exactly the same problem. Did you figure it out? Here is my config (which didn't work):
image

@SatyaDewangan05
Copy link

The problem is with torch1.4.0, I tried to solve it by downgrading and upgrading cuda and cudnn but the problem was the same. So, I upgraded the torch1.4.0 > torch1.7.1 then the problem is solved.

pip install torch==1.7.1+cu101 torchvision==0.8.2+cu101 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html
pip install --pre torch torchvision -f https://download.pytorch.org/whl/nightly/cu110/torch_nightly.html -U
Try this, then you should not see the error.

The code may look stuck like below but it'll be loading and processing the data.
Screenshot_272

To check that you can add for x in tqdm.tqdm(dataloader) instead of for x in dataloader on line 50 in train_avd.py

@BoboRamirez
Copy link

The problem is with torch1.4.0, I tried to solve it by downgrading and upgrading cuda and cudnn but the problem was the same. So, I upgraded the torch1.4.0 > torch1.7.1 then the problem is solved.

pip install torch==1.7.1+cu101 torchvision==0.8.2+cu101 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html pip install --pre torch torchvision -f https://download.pytorch.org/whl/nightly/cu110/torch_nightly.html -U Try this, then you should not see the error.

The code may look stuck like below but it'll be loading and processing the data. Screenshot_272

To check that you can add for x in tqdm.tqdm(dataloader) instead of for x in dataloader on line 50 in train_avd.py

Much appreciated! I'll give it a go right away!

@SatyaDewangan05
Copy link

The problem is with torch1.4.0, I tried to solve it by downgrading and upgrading cuda and cudnn but the problem was the same. So, I upgraded the torch1.4.0 > torch1.7.1 then the problem is solved.
pip install torch==1.7.1+cu101 torchvision==0.8.2+cu101 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html pip install --pre torch torchvision -f https://download.pytorch.org/whl/nightly/cu110/torch_nightly.html -U Try this, then you should not see the error.
The code may look stuck like below but it'll be loading and processing the data. Screenshot_272
To check that you can add for x in tqdm.tqdm(dataloader) instead of for x in dataloader on line 50 in train_avd.py

Much appreciated! I'll give it a go right away!

Just let me know buddy if it is working for you or not, maybe be I must have done few more things that made it work.

@BoboRamirez
Copy link

The problem is with torch1.4.0, I tried to solve it by downgrading and upgrading cuda and cudnn but the problem was the same. So, I upgraded the torch1.4.0 > torch1.7.1 then the problem is solved.
pip install torch==1.7.1+cu101 torchvision==0.8.2+cu101 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html pip install --pre torch torchvision -f https://download.pytorch.org/whl/nightly/cu110/torch_nightly.html -U Try this, then you should not see the error.
The code may look stuck like below but it'll be loading and processing the data. Screenshot_272
To check that you can add for x in tqdm.tqdm(dataloader) instead of for x in dataloader on line 50 in train_avd.py

Much appreciated! I'll give it a go right away!

Just let me know buddy if it is working for you or not, maybe be I must have done few more things that made it work.

Long story short, no, it didn't work out. I ran into another cuda runtime error. But again, I really appreciate your kindness!
P.S. I have decided to move on to other works days ago. since I suspect that this bug has something to do with my hardware, I am looking for "fresher" works to work on. XD

@Lier-Accuser
Copy link

The problem is with torch1.4.0, I tried to solve it by downgrading and upgrading cuda and cudnn but the problem was the same. So, I upgraded the torch1.4.0 > torch1.7.1 then the problem is solved.
Try this, then you should not see the error.
The code may look stuck like below but it'll be loading and processing the data. Screenshot_272
To check that you can add instead of on line 50 in train_avd.pypip install torch==1.7.1+cu101 torchvision==0.8.2+cu101 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html``pip install --pre torch torchvision -f https://download.pytorch.org/whl/nightly/cu110/torch_nightly.html -U``for x in tqdm.tqdm(dataloader)``for x in dataloader

Much appreciated! I'll give it a go right away!

Just let me know buddy if it is working for you or not, maybe be I must have done few more things that made it work.

Hello, I tried you advice above, but it still can not run. Here are the errors.
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants