Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backport to 2.8: Deprecate block/warp algo specializations (#3455) #3481

Merged

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

Copy link
Contributor

🟩 CI finished in 2h 07m: Pass: 100%/96 | Total: 2d 18h | Avg: 41m 29s | Max: 1h 16m | Hits: 215%/12392
  • 🟩 cub: Pass: 100%/47 | Total: 1d 16h | Avg: 51m 04s | Max: 1h 16m | Hits: 206%/3132

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 14h | Avg: 50m 46s | Max:  1h 16m | Hits: 206%/3132  
      🟩 arm64              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 51s | Max: 58m 38s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  5h 49m | Avg: 49m 58s | Max:  1h 01m | Hits: 206%/783   
      🟩 12.5               Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 09m
      🟩 12.6               Pass: 100%/38  | Total:  1d 07h | Avg: 50m 28s | Max:  1h 16m | Hits: 206%/2349  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 04m
      🟩 nvcc11.1           Pass: 100%/7   | Total:  5h 49m | Avg: 49m 58s | Max:  1h 01m | Hits: 206%/783   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 09m
      🟩 nvcc12.6           Pass: 100%/36  | Total:  1d 05h | Avg: 49m 44s | Max:  1h 16m | Hits: 206%/2349  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 04m
      🟩 nvcc               Pass: 100%/45  | Total:  1d 13h | Avg: 50m 30s | Max:  1h 16m | Hits: 206%/3132  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  3h 35m | Avg: 53m 55s | Max:  1h 00m
      🟩 Clang10            Pass: 100%/1   | Total: 57m 55s | Avg: 57m 55s | Max: 57m 55s
      🟩 Clang11            Pass: 100%/1   | Total: 55m 23s | Avg: 55m 23s | Max: 55m 23s
      🟩 Clang12            Pass: 100%/1   | Total: 55m 36s | Avg: 55m 36s | Max: 55m 36s
      🟩 Clang13            Pass: 100%/1   | Total: 56m 00s | Avg: 56m 00s | Max: 56m 00s
      🟩 Clang14            Pass: 100%/1   | Total: 57m 52s | Avg: 57m 52s | Max: 57m 52s
      🟩 Clang15            Pass: 100%/1   | Total: 54m 53s | Avg: 54m 53s | Max: 54m 53s
      🟩 Clang16            Pass: 100%/1   | Total: 58m 32s | Avg: 58m 32s | Max: 58m 32s
      🟩 Clang17            Pass: 100%/1   | Total: 56m 36s | Avg: 56m 36s | Max: 56m 36s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 49m | Avg: 49m 53s | Max:  1h 04m
      🟩 GCC6               Pass: 100%/2   | Total:  1h 35m | Avg: 47m 58s | Max: 49m 48s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 51m | Avg: 55m 35s | Max:  1h 00m
      🟩 GCC8               Pass: 100%/1   | Total: 55m 46s | Avg: 55m 46s | Max: 55m 46s
      🟩 GCC9               Pass: 100%/3   | Total:  2h 29m | Avg: 49m 46s | Max: 54m 43s
      🟩 GCC10              Pass: 100%/1   | Total: 59m 01s | Avg: 59m 01s | Max: 59m 01s
      🟩 GCC11              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
      🟩 GCC12              Pass: 100%/3   | Total:  1h 43m | Avg: 34m 24s | Max:  1h 01m
      🟩 GCC13              Pass: 100%/8   | Total:  4h 38m | Avg: 34m 48s | Max:  1h 02m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 55m 43s | Avg: 55m 43s | Max: 55m 43s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m | Hits: 206%/783   
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m | Hits: 206%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 16m | Hits: 206%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 09m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 16h 57m | Avg: 53m 33s | Max:  1h 04m
      🟩 GCC                Pass: 100%/21  | Total: 15h 14m | Avg: 43m 33s | Max:  1h 02m
      🟩 Intel              Pass: 100%/1   | Total: 55m 43s | Avg: 55m 43s | Max: 55m 43s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 39m | Avg:  1h 09m | Max:  1h 16m | Hits: 206%/3132  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 09m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 42m 02s | Avg: 21m 01s | Max: 25m 43s
      🟩 v100               Pass: 100%/45  | Total:  1d 15h | Avg: 52m 24s | Max:  1h 16m | Hits: 206%/3132  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 13h | Avg: 56m 32s | Max:  1h 16m | Hits: 206%/3132  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 32s | Avg: 18m 32s | Max: 18m 32s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s
      🟩 HostLaunch         Pass: 100%/3   | Total: 56m 29s | Avg: 18m 49s | Max: 21m 23s
      🟩 TestGPU            Pass: 100%/2   | Total: 48m 48s | Avg: 24m 24s | Max: 25m 28s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 42m 02s | Avg: 21m 01s | Max: 25m 43s
      🟩 90a                Pass: 100%/1   | Total: 25m 56s | Avg: 25m 56s | Max: 25m 56s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  4h 17m | Avg: 51m 30s | Max: 58m 09s
      🟩 14                 Pass: 100%/4   | Total:  3h 48m | Avg: 57m 10s | Max:  1h 01m | Hits: 206%/783   
      🟩 17                 Pass: 100%/12  | Total: 11h 38m | Avg: 58m 13s | Max:  1h 11m | Hits: 206%/1566  
      🟩 20                 Pass: 100%/26  | Total: 20h 15m | Avg: 46m 45s | Max:  1h 16m | Hits: 205%/783   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 1d 01h | Avg: 33m 34s | Max: 1h 01m | Hits: 217%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 43m 52s | Avg: 21m 56s | Max: 30m 30s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total:  1d 00h | Avg: 33m 37s | Max:  1h 01m | Hits: 217%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 22s | Max: 33m 34s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  3h 45m | Avg: 32m 15s | Max: 59m 54s | Hits: 188%/1852  
      🟩 12.5               Pass: 100%/2   | Total:  1h 49m | Avg: 54m 45s | Max: 57m 18s
      🟩 12.6               Pass: 100%/37  | Total: 20h 09m | Avg: 32m 40s | Max:  1h 01m | Hits: 225%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 59m 47s | Avg: 29m 53s | Max: 30m 12s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  3h 45m | Avg: 32m 15s | Max: 59m 54s | Hits: 188%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 49m | Avg: 54m 45s | Max: 57m 18s
      🟩 nvcc12.6           Pass: 100%/35  | Total: 19h 09m | Avg: 32m 50s | Max:  1h 01m | Hits: 225%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 59m 47s | Avg: 29m 53s | Max: 30m 12s
      🟩 nvcc               Pass: 100%/44  | Total:  1d 00h | Avg: 33m 44s | Max:  1h 01m | Hits: 217%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  1h 59m | Avg: 29m 59s | Max: 35m 56s
      🟩 Clang10            Pass: 100%/1   | Total: 33m 59s | Avg: 33m 59s | Max: 33m 59s
      🟩 Clang11            Pass: 100%/1   | Total: 33m 07s | Avg: 33m 07s | Max: 33m 07s
      🟩 Clang12            Pass: 100%/1   | Total: 34m 17s | Avg: 34m 17s | Max: 34m 17s
      🟩 Clang13            Pass: 100%/1   | Total: 32m 28s | Avg: 32m 28s | Max: 32m 28s
      🟩 Clang14            Pass: 100%/1   | Total: 30m 35s | Avg: 30m 35s | Max: 30m 35s
      🟩 Clang15            Pass: 100%/1   | Total: 35m 13s | Avg: 35m 13s | Max: 35m 13s
      🟩 Clang16            Pass: 100%/1   | Total: 33m 50s | Avg: 33m 50s | Max: 33m 50s
      🟩 Clang17            Pass: 100%/1   | Total: 31m 32s | Avg: 31m 32s | Max: 31m 32s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 01m | Avg: 25m 57s | Max: 37m 23s
      🟩 GCC6               Pass: 100%/2   | Total: 57m 01s | Avg: 28m 30s | Max: 29m 48s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 24s | Max: 35m 14s
      🟩 GCC8               Pass: 100%/1   | Total: 35m 38s | Avg: 35m 38s | Max: 35m 38s
      🟩 GCC9               Pass: 100%/3   | Total:  1h 26m | Avg: 28m 55s | Max: 33m 54s
      🟩 GCC10              Pass: 100%/1   | Total: 33m 49s | Avg: 33m 49s | Max: 33m 49s
      🟩 GCC11              Pass: 100%/1   | Total: 35m 21s | Avg: 35m 21s | Max: 35m 21s
      🟩 GCC12              Pass: 100%/1   | Total: 35m 41s | Avg: 35m 41s | Max: 35m 41s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 18m | Avg: 24m 51s | Max: 40m 21s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 44m 19s | Avg: 44m 19s | Max: 44m 19s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 59m 54s | Avg: 59m 54s | Max: 59m 54s | Hits: 188%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m | Hits: 178%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 36m | Avg: 52m 06s | Max:  1h 01m | Hits: 240%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 45s | Max: 57m 18s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  9h 26m | Avg: 29m 49s | Max: 37m 23s
      🟩 GCC                Pass: 100%/19  | Total:  9h 05m | Avg: 28m 44s | Max: 40m 21s
      🟩 Intel              Pass: 100%/1   | Total: 44m 19s | Avg: 44m 19s | Max: 44m 19s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 37m | Avg: 55m 35s | Max:  1h 01m | Hits: 217%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 49m | Avg: 54m 45s | Max: 57m 18s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total:  1d 01h | Avg: 33m 34s | Max:  1h 01m | Hits: 217%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 00h | Avg: 36m 13s | Max:  1h 01m | Hits: 180%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 03s | Avg: 17m 01s | Max: 35m 25s | Hits: 365%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 44m 32s | Avg: 14m 50s | Max: 18m 08s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 19m 48s | Avg: 19m 48s | Max: 19m 48s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  2h 11m | Avg: 26m 13s | Max: 28m 02s
      🟩 14                 Pass: 100%/4   | Total:  2h 40m | Avg: 40m 13s | Max: 59m 54s | Hits: 188%/1852  
      🟩 17                 Pass: 100%/12  | Total:  8h 10m | Avg: 40m 54s | Max:  1h 01m | Hits: 178%/3704  
      🟩 20                 Pass: 100%/23  | Total: 11h 57m | Avg: 31m 12s | Max: 59m 35s | Hits: 272%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 19s | Avg: 4m 39s | Max: 7m 07s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 07s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 07s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 07s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 07s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 07s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 07s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 07s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
      🟩 Test               Pass: 100%/1   | Total:  7m 07s | Avg:  7m 07s | Max:  7m 07s
    
  • 🟩 python: Pass: 100%/1 | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 28m 37s | Avg: 28m 37s | Max: 28m 37s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 96)

# Runner
71 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber bernhardmgruber enabled auto-merge (squash) January 22, 2025 19:36
Copy link
Contributor

🟩 CI finished in 2h 59m: Pass: 100%/96 | Total: 2d 10h | Avg: 36m 31s | Max: 1h 12m | Hits: 215%/12392
  • 🟩 cub: Pass: 100%/47 | Total: 1d 11h | Avg: 44m 50s | Max: 1h 12m | Hits: 207%/3132

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total:  1d 09h | Avg: 44m 24s | Max:  1h 12m | Hits: 207%/3132  
      🟩 arm64              Pass: 100%/2   | Total:  1h 49m | Avg: 54m 51s | Max: 54m 54s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 30m | Avg: 12m 53s | Max:  1h 04m | Hits: 208%/783   
      🟩 12.5               Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 12m
      🟩 12.6               Pass: 100%/38  | Total:  1d 07h | Avg: 49m 18s | Max:  1h 12m | Hits: 206%/2349  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 30m | Avg: 12m 53s | Max:  1h 04m | Hits: 208%/783   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 12m
      🟩 nvcc12.6           Pass: 100%/36  | Total:  1d 05h | Avg: 48m 31s | Max:  1h 12m | Hits: 206%/2349  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🟩 nvcc               Pass: 100%/45  | Total:  1d 09h | Avg: 44m 01s | Max:  1h 12m | Hits: 207%/3132  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  2h 03m | Avg: 30m 55s | Max: 58m 32s
      🟩 Clang10            Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 Clang11            Pass: 100%/1   | Total: 52m 59s | Avg: 52m 59s | Max: 52m 59s
      🟩 Clang12            Pass: 100%/1   | Total: 52m 56s | Avg: 52m 56s | Max: 52m 56s
      🟩 Clang13            Pass: 100%/1   | Total: 55m 27s | Avg: 55m 27s | Max: 55m 27s
      🟩 Clang14            Pass: 100%/1   | Total: 53m 33s | Avg: 53m 33s | Max: 53m 33s
      🟩 Clang15            Pass: 100%/1   | Total: 52m 41s | Avg: 52m 41s | Max: 52m 41s
      🟩 Clang16            Pass: 100%/1   | Total: 54m 49s | Avg: 54m 49s | Max: 54m 49s
      🟩 Clang17            Pass: 100%/1   | Total: 57m 57s | Avg: 57m 57s | Max: 57m 57s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 35m | Avg: 47m 52s | Max:  1h 03m
      🟩 GCC6               Pass: 100%/2   | Total:  8m 19s | Avg:  4m 09s | Max:  4m 20s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 02s | Max:  1h 00m
      🟩 GCC8               Pass: 100%/1   | Total: 55m 40s | Avg: 55m 40s | Max: 55m 40s
      🟩 GCC9               Pass: 100%/3   | Total:  1h 09m | Avg: 23m 11s | Max:  1h 00m
      🟩 GCC10              Pass: 100%/1   | Total: 53m 01s | Avg: 53m 01s | Max: 53m 01s
      🟩 GCC11              Pass: 100%/1   | Total: 53m 55s | Avg: 53m 55s | Max: 53m 55s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 36m | Avg: 32m 12s | Max: 53m 34s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 41m | Avg: 35m 12s | Max: 57m 33s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 58m 17s | Avg: 58m 17s | Max: 58m 17s
      🟩 MSVC14.16          Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits: 208%/783   
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 09m | Avg:  1h 09m | Max:  1h 09m | Hits: 206%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m | Hits: 206%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 12m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 14h 59m | Avg: 47m 20s | Max:  1h 03m
      🟩 GCC                Pass: 100%/21  | Total: 12h 12m | Avg: 34m 53s | Max:  1h 00m
      🟩 Intel              Pass: 100%/1   | Total: 58m 17s | Avg: 58m 17s | Max: 58m 17s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 33m | Avg:  1h 08m | Max:  1h 12m | Hits: 207%/3132  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 12m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 43m 02s | Avg: 21m 31s | Max: 26m 50s
      🟩 v100               Pass: 100%/45  | Total:  1d 10h | Avg: 45m 53s | Max:  1h 12m | Hits: 207%/3132  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  1d 08h | Avg: 48m 59s | Max:  1h 12m | Hits: 207%/3132  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 08s | Avg: 18m 08s | Max: 18m 08s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 17s | Avg: 19m 17s | Max: 19m 17s
      🟩 HostLaunch         Pass: 100%/3   | Total: 56m 02s | Avg: 18m 40s | Max: 22m 01s
      🟩 TestGPU            Pass: 100%/2   | Total: 54m 31s | Avg: 27m 15s | Max: 28m 52s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 43m 02s | Avg: 21m 31s | Max: 26m 50s
      🟩 90a                Pass: 100%/1   | Total: 25m 38s | Avg: 25m 38s | Max: 25m 38s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  2h 04m | Avg: 24m 49s | Max: 58m 32s
      🟩 14                 Pass: 100%/4   | Total:  3h 05m | Avg: 46m 23s | Max:  1h 04m | Hits: 208%/783   
      🟩 17                 Pass: 100%/12  | Total: 10h 25m | Avg: 52m 07s | Max:  1h 11m | Hits: 206%/1566  
      🟩 20                 Pass: 100%/26  | Total: 19h 32m | Avg: 45m 06s | Max:  1h 12m | Hits: 205%/783   
    
  • 🟩 thrust: Pass: 100%/46 | Total: 22h 42m | Avg: 29m 37s | Max: 1h 03m | Hits: 218%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 40m 30s | Avg: 20m 15s | Max: 28m 51s
    🟩 cpu
      🟩 amd64              Pass: 100%/44  | Total: 21h 35m | Avg: 29m 26s | Max:  1h 03m | Hits: 218%/9260  
      🟩 arm64              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 47s | Max: 36m 52s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  1h 22m | Avg: 11m 43s | Max: 56m 35s | Hits: 193%/1852  
      🟩 12.5               Pass: 100%/2   | Total:  1h 47m | Avg: 53m 44s | Max: 57m 02s
      🟩 12.6               Pass: 100%/37  | Total: 19h 33m | Avg: 31m 42s | Max:  1h 03m | Hits: 225%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 54m 09s | Avg: 27m 04s | Max: 27m 55s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  1h 22m | Avg: 11m 43s | Max: 56m 35s | Hits: 193%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 47m | Avg: 53m 44s | Max: 57m 02s
      🟩 nvcc12.6           Pass: 100%/35  | Total: 18h 38m | Avg: 31m 58s | Max:  1h 03m | Hits: 225%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 54m 09s | Avg: 27m 04s | Max: 27m 55s
      🟩 nvcc               Pass: 100%/44  | Total: 21h 48m | Avg: 29m 44s | Max:  1h 03m | Hits: 218%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total:  1h 10m | Avg: 17m 30s | Max: 34m 18s
      🟩 Clang10            Pass: 100%/1   | Total: 36m 22s | Avg: 36m 22s | Max: 36m 22s
      🟩 Clang11            Pass: 100%/1   | Total: 32m 53s | Avg: 32m 53s | Max: 32m 53s
      🟩 Clang12            Pass: 100%/1   | Total: 31m 41s | Avg: 31m 41s | Max: 31m 41s
      🟩 Clang13            Pass: 100%/1   | Total: 29m 52s | Avg: 29m 52s | Max: 29m 52s
      🟩 Clang14            Pass: 100%/1   | Total: 30m 58s | Avg: 30m 58s | Max: 30m 58s
      🟩 Clang15            Pass: 100%/1   | Total: 30m 55s | Avg: 30m 55s | Max: 30m 55s
      🟩 Clang16            Pass: 100%/1   | Total: 35m 00s | Avg: 35m 00s | Max: 35m 00s
      🟩 Clang17            Pass: 100%/1   | Total: 31m 05s | Avg: 31m 05s | Max: 31m 05s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 49m | Avg: 24m 16s | Max: 33m 07s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 12s | Avg:  4m 06s | Max:  4m 22s
      🟩 GCC7               Pass: 100%/2   | Total: 55m 07s | Avg: 27m 33s | Max: 29m 20s
      🟩 GCC8               Pass: 100%/1   | Total: 35m 50s | Avg: 35m 50s | Max: 35m 50s
      🟩 GCC9               Pass: 100%/3   | Total: 39m 13s | Avg: 13m 04s | Max: 30m 38s
      🟩 GCC10              Pass: 100%/1   | Total: 33m 12s | Avg: 33m 12s | Max: 33m 12s
      🟩 GCC11              Pass: 100%/1   | Total: 31m 03s | Avg: 31m 03s | Max: 31m 03s
      🟩 GCC12              Pass: 100%/1   | Total: 40m 56s | Avg: 40m 56s | Max: 40m 56s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 19m | Avg: 24m 59s | Max: 36m 52s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 43m 07s | Avg: 43m 07s | Max: 43m 07s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 56m 35s | Avg: 56m 35s | Max: 56m 35s | Hits: 193%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 58m 09s | Avg: 58m 09s | Max: 58m 09s | Hits: 178%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 35m | Avg: 51m 41s | Max:  1h 03m | Hits: 240%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 44s | Max: 57m 02s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  8h 18m | Avg: 26m 15s | Max: 36m 22s
      🟩 GCC                Pass: 100%/19  | Total:  7h 23m | Avg: 23m 20s | Max: 40m 56s
      🟩 Intel              Pass: 100%/1   | Total: 43m 07s | Avg: 43m 07s | Max: 43m 07s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 29m | Avg: 53m 57s | Max:  1h 03m | Hits: 218%/9260  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 47m | Avg: 53m 44s | Max: 57m 02s
    🟩 gpu
      🟩 v100               Pass: 100%/46  | Total: 22h 42m | Avg: 29m 37s | Max:  1h 03m | Hits: 218%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total: 21h 04m | Avg: 31m 37s | Max:  1h 03m | Hits: 182%/7408  
      🟩 TestCPU            Pass: 100%/3   | Total: 53m 11s | Avg: 17m 43s | Max: 37m 18s | Hits: 365%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 44m 46s | Avg: 14m 55s | Max: 21m 07s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 21m 18s | Avg: 21m 18s | Max: 21m 18s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  1h 04m | Avg: 12m 51s | Max: 27m 01s
      🟩 14                 Pass: 100%/4   | Total:  2h 04m | Avg: 31m 08s | Max: 56m 35s | Hits: 193%/1852  
      🟩 17                 Pass: 100%/12  | Total:  7h 03m | Avg: 35m 15s | Max: 58m 09s | Hits: 178%/3704  
      🟩 20                 Pass: 100%/23  | Total: 11h 50m | Avg: 30m 52s | Max:  1h 03m | Hits: 272%/3704  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 55s | Avg: 4m 57s | Max: 7m 41s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 41s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 41s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 41s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 41s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 41s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 41s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 41s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 14s | Avg:  2m 14s | Max:  2m 14s
      🟩 Test               Pass: 100%/1   | Total:  7m 41s | Avg:  7m 41s | Max:  7m 41s
    
  • 🟩 python: Pass: 100%/1 | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 25m 30s | Avg: 25m 30s | Max: 25m 30s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 96)

# Runner
71 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber bernhardmgruber merged commit 07d5184 into NVIDIA:branch/2.8.x Jan 22, 2025
111 checks passed
@bernhardmgruber bernhardmgruber deleted the backport_warp_spec branch January 22, 2025 22:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants