-
Notifications
You must be signed in to change notification settings - Fork 178
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[STF] Do not keep track of dangling events in a CUDA graph backend #3327
Merged
caugonnet
merged 5 commits into
NVIDIA:main
from
caugonnet:stf_graph_no_dangling_events
Jan 13, 2025
Merged
[STF] Do not keep track of dangling events in a CUDA graph backend #3327
caugonnet
merged 5 commits into
NVIDIA:main
from
caugonnet:stf_graph_no_dangling_events
Jan 13, 2025
+56
−8
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
… done when the CUDA graph completes. Therefore keeping track of "dangling events" is a waste of time and resources.
/ok to test |
caugonnet
commented
Jan 10, 2025
* @brief Indicate if the backend needs to keep track of dangling events, or if these will be automatically | ||
* synchronized | ||
*/ | ||
virtual bool can_ignore_dangling_events() const = 0; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
instead of keeping track if we can ignore them, we should see if we need them to avoid !no_dangling ...
…ds to more readable code
🟨 CI finished in 31m 42s: Pass: 90%/20 | Total: 4h 07m | Avg: 12m 21s | Max: 15m 44s | Hits: 582%/312
|
Project | |
---|---|
CCCL Infrastructure | |
libcu++ | |
CUB | |
Thrust | |
+/- | CUDA Experimental |
python | |
CCCL C Parallel Library | |
Catch2Helper |
Modifications in project or dependencies?
Project | |
---|---|
CCCL Infrastructure | |
libcu++ | |
CUB | |
Thrust | |
+/- | CUDA Experimental |
python | |
CCCL C Parallel Library | |
Catch2Helper |
🏃 Runner counts (total jobs: 20)
# | Runner |
---|---|
12 | linux-amd64-cpu16 |
4 | linux-arm64-cpu16 |
2 | windows-amd64-cpu16 |
2 | linux-amd64-gpu-v100-latest-1 |
/ok to test |
miscco
approved these changes
Jan 13, 2025
🟨 CI finished in 33m 52s: Pass: 90%/20 | Total: 4h 13m | Avg: 12m 39s | Max: 18m 03s | Hits: 574%/312
|
Project | |
---|---|
CCCL Infrastructure | |
libcu++ | |
CUB | |
Thrust | |
+/- | CUDA Experimental |
python | |
CCCL C Parallel Library | |
Catch2Helper |
Modifications in project or dependencies?
Project | |
---|---|
CCCL Infrastructure | |
libcu++ | |
CUB | |
Thrust | |
+/- | CUDA Experimental |
python | |
CCCL C Parallel Library | |
Catch2Helper |
🏃 Runner counts (total jobs: 20)
# | Runner |
---|---|
12 | linux-amd64-cpu16 |
4 | linux-arm64-cpu16 |
2 | windows-amd64-cpu16 |
2 | linux-amd64-gpu-v100-latest-1 |
…t operations that were producing these events !
/ok to test |
/ok to test |
🟩 CI finished in 24m 42s: Pass: 100%/20 | Total: 1h 51m | Avg: 5m 35s | Max: 20m 24s | Hits: 582%/312
|
Project | |
---|---|
CCCL Infrastructure | |
libcu++ | |
CUB | |
Thrust | |
+/- | CUDA Experimental |
python | |
CCCL C Parallel Library | |
Catch2Helper |
Modifications in project or dependencies?
Project | |
---|---|
CCCL Infrastructure | |
libcu++ | |
CUB | |
Thrust | |
+/- | CUDA Experimental |
python | |
CCCL C Parallel Library | |
Catch2Helper |
🏃 Runner counts (total jobs: 20)
# | Runner |
---|---|
12 | linux-amd64-cpu16 |
4 | linux-arm64-cpu16 |
2 | windows-amd64-cpu16 |
2 | linux-amd64-gpu-v100-latest-1 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Unlike the CUDA stream backend, nodes in a CUDA graph are necessarily done when the CUDA graph completes. Therefore keeping track of "dangling events" is a waste of time and resources.
closes
Checklist