Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU Det and Directsum are ridiculously slow #485

Open
jeffry1829 opened this issue Oct 1, 2024 · 3 comments
Open

GPU Det and Directsum are ridiculously slow #485

jeffry1829 opened this issue Oct 1, 2024 · 3 comments

Comments

@jeffry1829
Copy link
Collaborator

GPU Det and Directsum are ridiculously slow

Det uses cusolver ?getrf

Currently not sure whether this only happens to these two methods

@IvanaGyro
Copy link
Collaborator

What did you compare them to, the CPU version? How large is the input tensor? For inspecting the reason, myebe the NVDIA profiler can help.

@yingjerkao
Copy link
Collaborator

GPU Det and Directsum are ridiculously slow

Det uses cusolver ?getrf

Currently not sure whether this only happens to these two methods

Are you benchmarking against CPU version? Or old magma version?

@yingjerkao
Copy link
Collaborator

I believe this issue was due to the fact that our DGX II has been hacked. Should perform the benchmark on some other machines.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants