Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle wide matrices in 2D kernels gracefully. #110

Open
benson31 opened this issue Sep 29, 2020 · 2 comments
Open

Handle wide matrices in 2D kernels gracefully. #110

benson31 opened this issue Sep 29, 2020 · 2 comments
Assignees

Comments

@benson31
Copy link
Collaborator

CUDA has a maximal .y and .z grid dimension of 65536. We should handle cases where the local matrix is wider than that.

@timmoon10
Copy link
Collaborator

And if we want to be extra careful, the max .x dimension is 2147483647.

@benson31
Copy link
Collaborator Author

Now that I think about it, if we really want to be technically precise and extremely careful, this is a property of the device in question and its compute capability, IIRC. So really, we should query this in the initialization and use the dynamically determined maximum value in order to achieve the best portability.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants