Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Backport to 2.8: Deprecate CUB iterators existing in Thrust (#3304) #3534

Open
wants to merge 2 commits into
base: branch/2.8.x
Choose a base branch
from

Conversation

bernhardmgruber
Copy link
Contributor

We keep CUB iterators inside the CUB implementation headers, because some of those are exposed to NVRTC which cannot compile Thrust iterators yet.

We keep CUB iterators inside the CUB implementation headers, because some of those are exposed to NVRTC which cannot compile Thrust iterators yet.
@bernhardmgruber
Copy link
Contributor Author

And my favorite again:

  /home/coder/cccl/cub/test/catch2_test_iterator.cu(104): internal error #2656: assertion failed: alloc_copy_of_pending_pragma: copied pragma has source sequence entry (pragma.c, line 518 in alloc_copy_of_pending_pragma)

@bernhardmgruber
Copy link
Contributor Author

So MatX now fails with:

/home/coder/cccl/build/cuda12.6ext-gcc13/matx/MatX/include/matx/transforms/cub.h:994:110: error: ‘template<class ValueType, class OffsetT> class cub::CUB_200800_SM_600_700_800::CountingInputIterator’ is deprecated: Use thrust::counting_iterator instead [-Werror=deprecated-declarations]
  994 |         cub::DeviceSelect::If(d_temp,

which is the correct behavior. I wonder why this did not happen when we merged these changes into main.

Copy link
Contributor

🟨 CI finished in 3h 13m: Pass: 98%/96 | Total: 20h 18m | Avg: 12m 41s | Max: 40m 10s | Hits: 435%/10540
  • 🟨 thrust: Pass: 97%/46 | Total: 7h 27m | Avg: 9m 44s | Max: 35m 29s | Hits: 366%/7408

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/44  | Total:  7h 18m | Avg:  9m 57s | Max: 35m 29s | Hits: 366%/7408  
      🟩 arm64              Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  4m 55s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/7   | Total: 51m 36s | Avg:  7m 22s | Max: 25m 48s | Hits: 368%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 15m 26s
      🔍 12.6               Pass:  97%/37  | Total:  6h 07m | Avg:  9m 55s | Max: 35m 29s | Hits: 365%/5556  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 53s | Avg:  5m 26s | Max:  5m 29s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 51m 36s | Avg:  7m 22s | Max: 25m 48s | Hits: 368%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 15m 26s
      🔍 nvcc12.6           Pass:  97%/35  | Total:  5h 56m | Avg: 10m 10s | Max: 35m 29s | Hits: 365%/5556  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 53s | Avg:  5m 26s | Max:  5m 29s
      🔍 nvcc               Pass:  97%/44  | Total:  7h 17m | Avg:  9m 55s | Max: 35m 29s | Hits: 366%/7408  
    🔍 cxx: MSVC14.39 🔍
      🟩 Clang9             Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  6m 43s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 23s | Avg:  6m 23s | Max:  6m 23s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 26s | Avg:  5m 26s | Max:  5m 26s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 13s | Avg:  5m 13s | Max:  5m 13s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 25s | Avg:  5m 25s | Max:  5m 25s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 12s | Avg:  5m 12s | Max:  5m 12s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 Clang18            Pass: 100%/7   | Total: 45m 40s | Avg:  6m 31s | Max: 11m 49s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 11s | Avg:  4m 05s | Max:  4m 31s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 12s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 21s | Avg:  5m 21s | Max:  5m 21s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 57s | Avg:  4m 59s | Max:  5m 56s
      🟩 GCC10              Pass: 100%/1   | Total: 35m 29s | Avg: 35m 29s | Max: 35m 29s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 07m | Avg:  8m 23s | Max: 20m 12s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 40s | Avg:  6m 40s | Max:  6m 40s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 25m 48s | Avg: 25m 48s | Max: 25m 48s | Hits: 368%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 28m 38s | Avg: 28m 38s | Max: 28m 38s | Hits: 365%/1852  
      🔍 MSVC14.39          Pass:  66%/3   | Total:  1h 34m | Avg: 31m 21s | Max: 31m 46s | Hits: 365%/3704  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 15m 26s
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/19  | Total:  1h 51m | Avg:  5m 50s | Max: 11m 49s
      🟩 GCC                Pass: 100%/19  | Total:  2h 32m | Avg:  8m 01s | Max: 35m 29s
      🟩 Intel              Pass: 100%/1   | Total:  6m 40s | Avg:  6m 40s | Max:  6m 40s
      🔍 MSVC               Pass:  80%/5   | Total:  2h 28m | Avg: 29m 42s | Max: 31m 46s | Hits: 366%/7408  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 15m 26s
    🔍 jobs: TestCPU 🔍
      🟩 Build              Pass: 100%/40  | Total:  5h 57m | Avg:  8m 55s | Max: 35m 29s | Hits: 366%/7408  
      🔍 TestCPU            Pass:  66%/3   | Total: 45m 42s | Avg: 15m 14s | Max: 30m 47s
      🟩 TestGPU            Pass: 100%/3   | Total: 45m 05s | Avg: 15m 01s | Max: 20m 12s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/5   | Total: 22m 27s | Avg:  4m 29s | Max:  5m 56s
      🟩 14                 Pass: 100%/4   | Total: 42m 14s | Avg: 10m 33s | Max: 25m 48s | Hits: 368%/1852  
      🟩 17                 Pass: 100%/12  | Total:  2h 04m | Avg: 10m 23s | Max: 31m 31s | Hits: 365%/3704  
      🔍 20                 Pass:  95%/23  | Total:  3h 52m | Avg: 10m 07s | Max: 35m 29s | Hits: 365%/1852  
    🟨 gpu
      🟨 v100               Pass:  97%/46  | Total:  7h 27m | Avg:  9m 44s | Max: 35m 29s | Hits: 366%/7408  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 25m 42s | Avg: 12m 51s | Max: 20m 12s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 39s | Avg:  4m 39s | Max:  4m 39s
    
  • 🟩 cub: Pass: 100%/47 | Total: 12h 15m | Avg: 15m 38s | Max: 40m 10s | Hits: 598%/3132

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 11h 57m | Avg: 15m 56s | Max: 40m 10s | Hits: 598%/3132  
      🟩 arm64              Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max:  9m 25s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  3h 00m | Avg: 25m 45s | Max: 40m 10s | Hits: 598%/783   
      🟩 12.5               Pass: 100%/2   | Total: 46m 05s | Avg: 23m 02s | Max: 24m 09s
      🟩 12.6               Pass: 100%/38  | Total:  8h 28m | Avg: 13m 23s | Max: 34m 58s | Hits: 598%/2349  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 13m 10s | Avg:  6m 35s | Max:  7m 00s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  3h 00m | Avg: 25m 45s | Max: 40m 10s | Hits: 598%/783   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 46m 05s | Avg: 23m 02s | Max: 24m 09s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  8h 15m | Avg: 13m 45s | Max: 34m 58s | Hits: 598%/2349  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 13m 10s | Avg:  6m 35s | Max:  7m 00s
      🟩 nvcc               Pass: 100%/45  | Total: 12h 01m | Avg: 16m 02s | Max: 40m 10s | Hits: 598%/3132  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 37m 59s | Avg:  9m 29s | Max: 10m 44s
      🟩 Clang10            Pass: 100%/1   | Total:  9m 33s | Avg:  9m 33s | Max:  9m 33s
      🟩 Clang11            Pass: 100%/1   | Total:  7m 51s | Avg:  7m 51s | Max:  7m 51s
      🟩 Clang12            Pass: 100%/1   | Total:  8m 07s | Avg:  8m 07s | Max:  8m 07s
      🟩 Clang13            Pass: 100%/1   | Total:  7m 43s | Avg:  7m 43s | Max:  7m 43s
      🟩 Clang14            Pass: 100%/1   | Total:  8m 36s | Avg:  8m 36s | Max:  8m 36s
      🟩 Clang15            Pass: 100%/1   | Total:  9m 04s | Avg:  9m 04s | Max:  9m 04s
      🟩 Clang16            Pass: 100%/1   | Total:  7m 50s | Avg:  7m 50s | Max:  7m 50s
      🟩 Clang17            Pass: 100%/1   | Total:  8m 21s | Avg:  8m 21s | Max:  8m 21s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 40m | Avg: 14m 20s | Max: 32m 00s
      🟩 GCC6               Pass: 100%/2   | Total: 59m 07s | Avg: 29m 33s | Max: 29m 36s
      🟩 GCC7               Pass: 100%/2   | Total: 16m 04s | Avg:  8m 02s | Max:  8m 04s
      🟩 GCC8               Pass: 100%/1   | Total:  9m 24s | Avg:  9m 24s | Max:  9m 24s
      🟩 GCC9               Pass: 100%/3   | Total:  1h 10m | Avg: 23m 21s | Max: 31m 08s
      🟩 GCC10              Pass: 100%/1   | Total:  8m 34s | Avg:  8m 34s | Max:  8m 34s
      🟩 GCC11              Pass: 100%/1   | Total:  9m 12s | Avg:  9m 12s | Max:  9m 12s
      🟩 GCC12              Pass: 100%/3   | Total: 31m 49s | Avg: 10m 36s | Max: 16m 10s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 03m | Avg: 15m 22s | Max: 25m 40s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 40m 10s | Avg: 40m 10s | Max: 40m 10s | Hits: 598%/783   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 32m 20s | Avg: 32m 20s | Max: 32m 20s | Hits: 598%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 17s | Max: 34m 58s | Hits: 598%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 46m 05s | Avg: 23m 02s | Max: 24m 09s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  3h 25m | Avg: 10m 48s | Max: 32m 00s
      🟩 GCC                Pass: 100%/21  | Total:  5h 27m | Avg: 15m 35s | Max: 31m 08s
      🟩 Intel              Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 21m | Avg: 35m 16s | Max: 40m 10s | Hits: 598%/3132  
      🟩 NVHPC              Pass: 100%/2   | Total: 46m 05s | Avg: 23m 02s | Max: 24m 09s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 25s | Avg: 11m 12s | Max: 16m 10s
      🟩 v100               Pass: 100%/45  | Total: 11h 52m | Avg: 15m 50s | Max: 40m 10s | Hits: 598%/3132  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  9h 28m | Avg: 14m 12s | Max: 40m 10s | Hits: 598%/3132  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 10s | Avg: 22m 10s | Max: 22m 10s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 01s | Avg: 18m 01s | Max: 18m 01s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 51s | Max: 30m 20s
      🟩 TestGPU            Pass: 100%/2   | Total: 57m 40s | Avg: 28m 50s | Max: 32m 00s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 22m 25s | Avg: 11m 12s | Max: 16m 10s
      🟩 90a                Pass: 100%/1   | Total:  6m 27s | Avg:  6m 27s | Max:  6m 27s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  1h 27m | Avg: 17m 32s | Max: 31m 08s
      🟩 14                 Pass: 100%/4   | Total:  1h 26m | Avg: 21m 44s | Max: 40m 10s | Hits: 598%/783   
      🟩 17                 Pass: 100%/12  | Total:  3h 18m | Avg: 16m 31s | Max: 34m 58s | Hits: 598%/1566  
      🟩 20                 Pass: 100%/26  | Total:  6h 02m | Avg: 13m 55s | Max: 33m 37s | Hits: 598%/783   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 01s | Avg: 5m 00s | Max: 7m 24s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 37s | Avg:  2m 37s | Max:  2m 37s
      🟩 Test               Pass: 100%/1   | Total:  7m 24s | Avg:  7m 24s | Max:  7m 24s
    
  • 🟩 python: Pass: 100%/1 | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 96)

# Runner
71 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟨 CI finished in 12h 33m: Pass: 98%/96 | Total: 20h 21m | Avg: 12m 43s | Max: 40m 10s | Hits: 435%/10540
  • 🟨 thrust: Pass: 97%/46 | Total: 7h 30m | Avg: 9m 47s | Max: 35m 29s | Hits: 366%/7408

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/44  | Total:  7h 21m | Avg: 10m 01s | Max: 35m 29s | Hits: 366%/7408  
      🟩 arm64              Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  4m 55s
    🔍 ctk: 12.6 🔍
      🟩 11.1               Pass: 100%/7   | Total: 51m 36s | Avg:  7m 22s | Max: 25m 48s | Hits: 368%/1852  
      🟩 12.5               Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 15m 26s
      🔍 12.6               Pass:  97%/37  | Total:  6h 09m | Avg:  9m 59s | Max: 35m 29s | Hits: 365%/5556  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 53s | Avg:  5m 26s | Max:  5m 29s
      🟩 nvcc11.1           Pass: 100%/7   | Total: 51m 36s | Avg:  7m 22s | Max: 25m 48s | Hits: 368%/1852  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 15m 26s
      🔍 nvcc12.6           Pass:  97%/35  | Total:  5h 58m | Avg: 10m 15s | Max: 35m 29s | Hits: 365%/5556  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 53s | Avg:  5m 26s | Max:  5m 29s
      🔍 nvcc               Pass:  97%/44  | Total:  7h 19m | Avg:  9m 59s | Max: 35m 29s | Hits: 366%/7408  
    🔍 cxx: MSVC14.39 🔍
      🟩 Clang9             Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  6m 43s
      🟩 Clang10            Pass: 100%/1   | Total:  6m 23s | Avg:  6m 23s | Max:  6m 23s
      🟩 Clang11            Pass: 100%/1   | Total:  5m 26s | Avg:  5m 26s | Max:  5m 26s
      🟩 Clang12            Pass: 100%/1   | Total:  5m 13s | Avg:  5m 13s | Max:  5m 13s
      🟩 Clang13            Pass: 100%/1   | Total:  5m 25s | Avg:  5m 25s | Max:  5m 25s
      🟩 Clang14            Pass: 100%/1   | Total:  5m 12s | Avg:  5m 12s | Max:  5m 12s
      🟩 Clang15            Pass: 100%/1   | Total:  5m 16s | Avg:  5m 16s | Max:  5m 16s
      🟩 Clang16            Pass: 100%/1   | Total:  5m 52s | Avg:  5m 52s | Max:  5m 52s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
      🟩 Clang18            Pass: 100%/7   | Total: 45m 40s | Avg:  6m 31s | Max: 11m 49s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 11s | Avg:  4m 05s | Max:  4m 31s
      🟩 GCC7               Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 12s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 21s | Avg:  5m 21s | Max:  5m 21s
      🟩 GCC9               Pass: 100%/3   | Total: 14m 57s | Avg:  4m 59s | Max:  5m 56s
      🟩 GCC10              Pass: 100%/1   | Total: 35m 29s | Avg: 35m 29s | Max: 35m 29s
      🟩 GCC11              Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 31s | Avg:  5m 31s | Max:  5m 31s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 07m | Avg:  8m 23s | Max: 20m 12s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  6m 40s | Avg:  6m 40s | Max:  6m 40s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 25m 48s | Avg: 25m 48s | Max: 25m 48s | Hits: 368%/1852  
      🟩 MSVC14.29          Pass: 100%/1   | Total: 28m 38s | Avg: 28m 38s | Max: 28m 38s | Hits: 365%/1852  
      🔍 MSVC14.39          Pass:  66%/3   | Total:  1h 36m | Avg: 32m 12s | Max: 33m 20s | Hits: 365%/3704  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 15m 26s
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/19  | Total:  1h 51m | Avg:  5m 50s | Max: 11m 49s
      🟩 GCC                Pass: 100%/19  | Total:  2h 32m | Avg:  8m 01s | Max: 35m 29s
      🟩 Intel              Pass: 100%/1   | Total:  6m 40s | Avg:  6m 40s | Max:  6m 40s
      🔍 MSVC               Pass:  80%/5   | Total:  2h 31m | Avg: 30m 12s | Max: 33m 20s | Hits: 366%/7408  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 12s | Avg: 14m 36s | Max: 15m 26s
    🔍 jobs: TestCPU 🔍
      🟩 Build              Pass: 100%/40  | Total:  5h 57m | Avg:  8m 55s | Max: 35m 29s | Hits: 366%/7408  
      🔍 TestCPU            Pass:  66%/3   | Total: 48m 15s | Avg: 16m 05s | Max: 33m 20s
      🟩 TestGPU            Pass: 100%/3   | Total: 45m 05s | Avg: 15m 01s | Max: 20m 12s
    🔍 std: 20 🔍
      🟩 11                 Pass: 100%/5   | Total: 22m 27s | Avg:  4m 29s | Max:  5m 56s
      🟩 14                 Pass: 100%/4   | Total: 42m 14s | Avg: 10m 33s | Max: 25m 48s | Hits: 368%/1852  
      🟩 17                 Pass: 100%/12  | Total:  2h 04m | Avg: 10m 23s | Max: 31m 31s | Hits: 365%/3704  
      🔍 20                 Pass:  95%/23  | Total:  3h 55m | Avg: 10m 13s | Max: 35m 29s | Hits: 365%/1852  
    🟨 gpu
      🟨 v100               Pass:  97%/46  | Total:  7h 30m | Avg:  9m 47s | Max: 35m 29s | Hits: 366%/7408  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 25m 42s | Avg: 12m 51s | Max: 20m 12s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 39s | Avg:  4m 39s | Max:  4m 39s
    
  • 🟩 cub: Pass: 100%/47 | Total: 12h 15m | Avg: 15m 38s | Max: 40m 10s | Hits: 598%/3132

    🟩 cpu
      🟩 amd64              Pass: 100%/45  | Total: 11h 57m | Avg: 15m 56s | Max: 40m 10s | Hits: 598%/3132  
      🟩 arm64              Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max:  9m 25s
    🟩 ctk
      🟩 11.1               Pass: 100%/7   | Total:  3h 00m | Avg: 25m 45s | Max: 40m 10s | Hits: 598%/783   
      🟩 12.5               Pass: 100%/2   | Total: 46m 05s | Avg: 23m 02s | Max: 24m 09s
      🟩 12.6               Pass: 100%/38  | Total:  8h 28m | Avg: 13m 23s | Max: 34m 58s | Hits: 598%/2349  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 13m 10s | Avg:  6m 35s | Max:  7m 00s
      🟩 nvcc11.1           Pass: 100%/7   | Total:  3h 00m | Avg: 25m 45s | Max: 40m 10s | Hits: 598%/783   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 46m 05s | Avg: 23m 02s | Max: 24m 09s
      🟩 nvcc12.6           Pass: 100%/36  | Total:  8h 15m | Avg: 13m 45s | Max: 34m 58s | Hits: 598%/2349  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 13m 10s | Avg:  6m 35s | Max:  7m 00s
      🟩 nvcc               Pass: 100%/45  | Total: 12h 01m | Avg: 16m 02s | Max: 40m 10s | Hits: 598%/3132  
    🟩 cxx
      🟩 Clang9             Pass: 100%/4   | Total: 37m 59s | Avg:  9m 29s | Max: 10m 44s
      🟩 Clang10            Pass: 100%/1   | Total:  9m 33s | Avg:  9m 33s | Max:  9m 33s
      🟩 Clang11            Pass: 100%/1   | Total:  7m 51s | Avg:  7m 51s | Max:  7m 51s
      🟩 Clang12            Pass: 100%/1   | Total:  8m 07s | Avg:  8m 07s | Max:  8m 07s
      🟩 Clang13            Pass: 100%/1   | Total:  7m 43s | Avg:  7m 43s | Max:  7m 43s
      🟩 Clang14            Pass: 100%/1   | Total:  8m 36s | Avg:  8m 36s | Max:  8m 36s
      🟩 Clang15            Pass: 100%/1   | Total:  9m 04s | Avg:  9m 04s | Max:  9m 04s
      🟩 Clang16            Pass: 100%/1   | Total:  7m 50s | Avg:  7m 50s | Max:  7m 50s
      🟩 Clang17            Pass: 100%/1   | Total:  8m 21s | Avg:  8m 21s | Max:  8m 21s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 40m | Avg: 14m 20s | Max: 32m 00s
      🟩 GCC6               Pass: 100%/2   | Total: 59m 07s | Avg: 29m 33s | Max: 29m 36s
      🟩 GCC7               Pass: 100%/2   | Total: 16m 04s | Avg:  8m 02s | Max:  8m 04s
      🟩 GCC8               Pass: 100%/1   | Total:  9m 24s | Avg:  9m 24s | Max:  9m 24s
      🟩 GCC9               Pass: 100%/3   | Total:  1h 10m | Avg: 23m 21s | Max: 31m 08s
      🟩 GCC10              Pass: 100%/1   | Total:  8m 34s | Avg:  8m 34s | Max:  8m 34s
      🟩 GCC11              Pass: 100%/1   | Total:  9m 12s | Avg:  9m 12s | Max:  9m 12s
      🟩 GCC12              Pass: 100%/3   | Total: 31m 49s | Avg: 10m 36s | Max: 16m 10s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 03m | Avg: 15m 22s | Max: 25m 40s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 40m 10s | Avg: 40m 10s | Max: 40m 10s | Hits: 598%/783   
      🟩 MSVC14.29          Pass: 100%/1   | Total: 32m 20s | Avg: 32m 20s | Max: 32m 20s | Hits: 598%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 17s | Max: 34m 58s | Hits: 598%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 46m 05s | Avg: 23m 02s | Max: 24m 09s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  3h 25m | Avg: 10m 48s | Max: 32m 00s
      🟩 GCC                Pass: 100%/21  | Total:  5h 27m | Avg: 15m 35s | Max: 31m 08s
      🟩 Intel              Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 21m | Avg: 35m 16s | Max: 40m 10s | Hits: 598%/3132  
      🟩 NVHPC              Pass: 100%/2   | Total: 46m 05s | Avg: 23m 02s | Max: 24m 09s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 25s | Avg: 11m 12s | Max: 16m 10s
      🟩 v100               Pass: 100%/45  | Total: 11h 52m | Avg: 15m 50s | Max: 40m 10s | Hits: 598%/3132  
    🟩 jobs
      🟩 Build              Pass: 100%/40  | Total:  9h 28m | Avg: 14m 12s | Max: 40m 10s | Hits: 598%/3132  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 10s | Avg: 22m 10s | Max: 22m 10s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 01s | Avg: 18m 01s | Max: 18m 01s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 08m | Avg: 22m 51s | Max: 30m 20s
      🟩 TestGPU            Pass: 100%/2   | Total: 57m 40s | Avg: 28m 50s | Max: 32m 00s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 22m 25s | Avg: 11m 12s | Max: 16m 10s
      🟩 90a                Pass: 100%/1   | Total:  6m 27s | Avg:  6m 27s | Max:  6m 27s
    🟩 std
      🟩 11                 Pass: 100%/5   | Total:  1h 27m | Avg: 17m 32s | Max: 31m 08s
      🟩 14                 Pass: 100%/4   | Total:  1h 26m | Avg: 21m 44s | Max: 40m 10s | Hits: 598%/783   
      🟩 17                 Pass: 100%/12  | Total:  3h 18m | Avg: 16m 31s | Max: 34m 58s | Hits: 598%/1566  
      🟩 20                 Pass: 100%/26  | Total:  6h 02m | Avg: 13m 55s | Max: 33m 37s | Hits: 598%/783   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 01s | Avg: 5m 00s | Max: 7m 24s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  7m 24s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 37s | Avg:  2m 37s | Max:  2m 37s
      🟩 Test               Pass: 100%/1   | Total:  7m 24s | Avg:  7m 24s | Max:  7m 24s
    
  • 🟩 python: Pass: 100%/1 | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 25m 46s | Avg: 25m 46s | Max: 25m 46s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 96)

# Runner
71 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

3 participants