Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use cuda::std::addressof in Thrust #3363

Merged
merged 1 commit into from
Jan 13, 2025
Merged

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber requested review from a team as code owners January 13, 2025 00:45
@bernhardmgruber bernhardmgruber added the thrust For all items related to Thrust. label Jan 13, 2025
Copy link
Contributor

🟩 CI finished in 1h 26m: Pass: 100%/78 | Total: 16h 03m | Avg: 12m 21s | Max: 37m 26s | Hits: 410%/12368
  • 🟩 cub: Pass: 100%/38 | Total: 6h 46m | Avg: 10m 41s | Max: 30m 47s | Hits: 599%/3108

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  6h 36m | Avg: 11m 01s | Max: 30m 47s | Hits: 599%/3108  
      🟩 arm64              Pass: 100%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 50s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 49m 11s | Avg:  9m 50s | Max: 28m 32s | Hits: 599%/777   
      🟩 12.5               Pass: 100%/2   | Total: 18m 07s | Avg:  9m 03s | Max:  9m 10s
      🟩 12.6               Pass: 100%/31  | Total:  5h 38m | Avg: 10m 55s | Max: 30m 47s | Hits: 599%/2331  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 42s | Avg:  4m 21s | Max:  4m 29s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 49m 11s | Avg:  9m 50s | Max: 28m 32s | Hits: 599%/777   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 07s | Avg:  9m 03s | Max:  9m 10s
      🟩 nvcc12.6           Pass: 100%/29  | Total:  5h 30m | Avg: 11m 23s | Max: 30m 47s | Hits: 599%/2331  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 42s | Avg:  4m 21s | Max:  4m 29s
      🟩 nvcc               Pass: 100%/36  | Total:  6h 37m | Avg: 11m 02s | Max: 30m 47s | Hits: 599%/3108  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 58s | Avg:  5m 14s | Max:  5m 26s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 10m | Avg: 10m 05s | Max: 27m 58s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 17s | Avg:  5m 08s | Max:  5m 16s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 28s | Avg:  5m 28s | Max:  5m 28s
      🟩 GCC9               Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 28s
      🟩 GCC10              Pass: 100%/1   | Total:  5m 32s | Avg:  5m 32s | Max:  5m 32s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 18s | Avg:  5m 18s | Max:  5m 18s
      🟩 GCC12              Pass: 100%/3   | Total: 26m 17s | Avg:  8m 45s | Max: 16m 02s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 37m | Avg: 12m 13s | Max: 27m 40s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 58m 25s | Avg: 29m 12s | Max: 29m 53s | Hits: 599%/1554  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 10s | Max: 30m 47s | Hits: 599%/1554  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 07s | Avg:  9m 03s | Max:  9m 10s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  1h 47m | Avg:  7m 41s | Max: 27m 58s
      🟩 GCC                Pass: 100%/18  | Total:  2h 41m | Avg:  8m 58s | Max: 27m 40s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 58m | Avg: 29m 41s | Max: 30m 47s | Hits: 599%/3108  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 07s | Avg:  9m 03s | Max:  9m 10s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 20m 21s | Avg: 10m 10s | Max: 16m 02s
      🟩 v100               Pass: 100%/36  | Total:  6h 25m | Avg: 10m 42s | Max: 30m 47s | Hits: 599%/3108  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  4h 26m | Avg:  8m 35s | Max: 30m 47s | Hits: 599%/3108  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 17m 05s | Avg: 17m 05s | Max: 17m 05s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 22s | Avg: 15m 22s | Max: 15m 22s
      🟩 HostLaunch         Pass: 100%/3   | Total: 51m 50s | Avg: 17m 16s | Max: 18m 27s
      🟩 TestGPU            Pass: 100%/2   | Total: 55m 38s | Avg: 27m 49s | Max: 27m 58s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 21s | Avg: 10m 10s | Max: 16m 02s
      🟩 90a                Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  2h 29m | Avg: 10m 38s | Max: 29m 53s | Hits: 599%/2331  
      🟩 20                 Pass: 100%/24  | Total:  4h 17m | Avg: 10m 42s | Max: 30m 47s | Hits: 599%/777   
    
  • 🟩 thrust: Pass: 100%/37 | Total: 8h 43m | Avg: 14m 08s | Max: 37m 26s | Hits: 347%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 05s | Avg: 10m 32s | Max: 10m 53s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  8h 23m | Avg: 14m 22s | Max: 37m 26s | Hits: 347%/9260  
      🟩 arm64              Pass: 100%/2   | Total: 20m 06s | Avg: 10m 03s | Max: 10m 58s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 12m | Avg: 14m 30s | Max: 33m 01s | Hits: 343%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 43m 30s | Avg: 21m 45s | Max: 22m 48s
      🟩 12.6               Pass: 100%/30  | Total:  6h 47m | Avg: 13m 34s | Max: 37m 26s | Hits: 348%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 20m 56s | Avg: 10m 28s | Max: 10m 30s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 12m | Avg: 14m 30s | Max: 33m 01s | Hits: 343%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 43m 30s | Avg: 21m 45s | Max: 22m 48s
      🟩 nvcc12.6           Pass: 100%/28  | Total:  6h 26m | Avg: 13m 47s | Max: 37m 26s | Hits: 348%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 20m 56s | Avg: 10m 28s | Max: 10m 30s
      🟩 nvcc               Pass: 100%/35  | Total:  8h 22m | Avg: 14m 20s | Max: 37m 26s | Hits: 347%/9260  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 38m 17s | Avg:  9m 34s | Max:  9m 43s
      🟩 Clang15            Pass: 100%/1   | Total:  9m 54s | Avg:  9m 54s | Max:  9m 54s
      🟩 Clang16            Pass: 100%/1   | Total: 10m 39s | Avg: 10m 39s | Max: 10m 39s
      🟩 Clang17            Pass: 100%/1   | Total: 10m 58s | Avg: 10m 58s | Max: 10m 58s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 11m | Avg: 10m 10s | Max: 13m 09s
      🟩 GCC7               Pass: 100%/2   | Total: 20m 18s | Avg: 10m 09s | Max: 10m 38s
      🟩 GCC8               Pass: 100%/1   | Total: 11m 01s | Avg: 11m 01s | Max: 11m 01s
      🟩 GCC9               Pass: 100%/2   | Total: 21m 02s | Avg: 10m 31s | Max: 11m 11s
      🟩 GCC10              Pass: 100%/1   | Total: 11m 21s | Avg: 11m 21s | Max: 11m 21s
      🟩 GCC11              Pass: 100%/1   | Total: 11m 08s | Avg: 11m 08s | Max: 11m 08s
      🟩 GCC12              Pass: 100%/1   | Total: 11m 45s | Avg: 11m 45s | Max: 11m 45s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 20m | Avg: 10m 06s | Max: 11m 41s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 59s | Max: 33m 01s | Hits: 343%/3704  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 47m | Avg: 35m 44s | Max: 37m 26s | Hits: 350%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 43m 30s | Avg: 21m 45s | Max: 22m 48s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  2h 20m | Avg: 10m 04s | Max: 13m 09s
      🟩 GCC                Pass: 100%/16  | Total:  2h 47m | Avg: 10m 28s | Max: 11m 45s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 51m | Avg: 34m 14s | Max: 37m 26s | Hits: 347%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total: 43m 30s | Avg: 21m 45s | Max: 22m 48s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  8h 43m | Avg: 14m 08s | Max: 37m 26s | Hits: 347%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  7h 15m | Avg: 14m 02s | Max: 35m 39s | Hits: 343%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 53m 11s | Avg: 17m 43s | Max: 37m 26s | Hits: 365%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 34m 51s | Avg: 11m 37s | Max: 13m 09s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  6m 49s | Avg:  6m 49s | Max:  6m 49s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  3h 44m | Avg: 16m 01s | Max: 35m 39s | Hits: 343%/5556  
      🟩 20                 Pass: 100%/21  | Total:  4h 37m | Avg: 13m 13s | Max: 37m 26s | Hits: 354%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 53s | Avg: 4m 26s | Max: 6m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  6m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  6m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  6m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  6m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  6m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  6m 51s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  6m 51s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
      🟩 Test               Pass: 100%/1   | Total:  6m 51s | Avg:  6m 51s | Max:  6m 51s
    
  • 🟩 python: Pass: 100%/1 | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 25m 37s | Avg: 25m 37s | Max: 25m 37s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 78)

# Runner
53 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@miscco miscco merged commit b5b6e03 into NVIDIA:main Jan 13, 2025
94 checks passed
@bernhardmgruber bernhardmgruber deleted the addressof branch January 13, 2025 08:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
thrust For all items related to Thrust.
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants