Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Install脚本对torch版本的调整导致部分环境无法正常启动,出现undefined symbol: ncclCommRegister #407

Open
shanshouchen opened this issue Apr 7, 2024 · 1 comment

Comments

@shanshouchen
Copy link

shanshouchen commented Apr 7, 2024

commit: ab15288
这个错误应该是对cuda的版本依赖有关系
报错如图:
4871712454176_ pic


补充问题:
我们使用老版本的时候,多GPU无法充分的利用,不知道作者的这次修改是不是从解决多GPU利用率的问题出发的?

@shanshouchen
Copy link
Author

回退到Torch 2.0.1+cu11就可以正常工作了
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant