Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deprecate and replace thrust::cuda_cub iterators #3422

Merged
merged 2 commits into from
Jan 23, 2025

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Jan 16, 2025

  • No SASS changes for thrust.test.count on SM86 except kernel symbol names
  • No SASS changes for thrust.test.min_element on SM86 except kernel symbol names
  • No SASS changes for thrust.test.find on SM86 except kernel symbol names
  • No SASS changes for thrust.test.inner_product on SM86 except kernel symbol names
  • SASS changes for thrust.test.mismatch on SM86: kernel symbol names and addresses into constant memory, but same instructions
  • No SASS changes for thrust.test.transform_scan on SM86 except kernel symbol names

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner January 16, 2025 15:01
@bernhardmgruber bernhardmgruber added thrust For all items related to Thrust. backport branch/2.8.x labels Jan 16, 2025
Copy link
Contributor

🟨 CI finished in 2h 01m: Pass: 97%/78 | Total: 2d 08h | Avg: 43m 27s | Max: 1h 15m | Hits: 230%/12720
  • 🟨 cub: Pass: 97%/38 | Total: 1d 08h | Avg: 51m 03s | Max: 1h 13m | Hits: 338%/3540

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/36  | Total:  1d 06h | Avg: 50m 47s | Max:  1h 13m | Hits: 338%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  1h 51m | Avg: 55m 48s | Max: 57m 03s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 44m | Avg: 56m 56s | Max:  1h 02m | Hits: 338%/885   
      🟩 12.5               Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🔍 12.6               Pass:  96%/31  | Total:  1d 01h | Avg: 49m 19s | Max:  1h 13m | Hits: 338%/2655  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 03m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 44m | Avg: 56m 56s | Max:  1h 02m | Hits: 338%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🔍 nvcc12.6           Pass:  96%/29  | Total: 23h 29m | Avg: 48m 35s | Max:  1h 13m | Hits: 338%/2655  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 03m
      🔍 nvcc               Pass:  97%/36  | Total:  1d 06h | Avg: 50m 33s | Max:  1h 13m | Hits: 338%/3540  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  3h 42m | Avg: 55m 31s | Max: 57m 55s
      🟩 Clang15            Pass: 100%/1   | Total: 56m 16s | Avg: 56m 16s | Max: 56m 16s
      🟩 Clang16            Pass: 100%/1   | Total: 57m 54s | Avg: 57m 54s | Max: 57m 54s
      🟩 Clang17            Pass: 100%/1   | Total: 55m 06s | Avg: 55m 06s | Max: 55m 06s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 58m | Avg: 51m 11s | Max:  1h 03m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 09s | Max: 57m 43s
      🟩 GCC8               Pass: 100%/1   | Total: 59m 02s | Avg: 59m 02s | Max: 59m 02s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max:  1h 00m
      🟩 GCC10              Pass: 100%/1   | Total: 58m 19s | Avg: 58m 19s | Max: 58m 19s
      🟩 GCC11              Pass: 100%/1   | Total: 52m 33s | Avg: 52m 33s | Max: 52m 33s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 47m | Avg: 35m 41s | Max:  1h 00m
      🔍 GCC13              Pass:  87%/8   | Total:  4h 39m | Avg: 34m 56s | Max: 57m 03s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 10m | Hits: 338%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 13m | Hits: 338%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/14  | Total: 12h 29m | Avg: 53m 33s | Max:  1h 03m
      🔍 GCC                Pass:  94%/18  | Total: 13h 06m | Avg: 43m 41s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 37m | Avg:  1h 09m | Max:  1h 13m | Hits: 338%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 46m 16s | Avg: 23m 08s | Max: 26m 30s
      🔍 v100               Pass:  97%/36  | Total:  1d 07h | Avg: 52m 36s | Max:  1h 13m | Hits: 338%/3540  
    🔍 jobs: TestGPU 🔍
      🟩 Build              Pass: 100%/31  | Total:  1d 05h | Avg: 56m 36s | Max:  1h 13m | Hits: 338%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 28m 08s | Avg: 28m 08s | Max: 28m 08s
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 30s | Avg: 24m 30s | Max: 24m 30s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 30m | Avg: 30m 19s | Max: 39m 30s
      🔍 TestGPU            Pass:  50%/2   | Total: 41m 46s | Avg: 20m 53s | Max: 37m 08s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/14  | Total: 13h 52m | Avg: 59m 27s | Max:  1h 11m | Hits: 338%/2655  
      🔍 20                 Pass:  95%/24  | Total: 18h 27m | Avg: 46m 09s | Max:  1h 13m | Hits: 337%/885   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 46m 16s | Avg: 23m 08s | Max: 26m 30s
      🟩 90a                Pass: 100%/1   | Total: 24m 11s | Avg: 24m 11s | Max: 24m 11s
    
  • 🟨 thrust: Pass: 97%/37 | Total: 23h 10m | Avg: 37m 35s | Max: 1h 15m | Hits: 188%/9180

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/35  | Total: 22h 03m | Avg: 37m 49s | Max:  1h 15m | Hits: 188%/9180  
      🟩 arm64              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 31s | Max: 34m 22s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  3h 23m | Avg: 40m 45s | Max:  1h 06m | Hits: 140%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
      🔍 12.6               Pass:  96%/30  | Total: 17h 15m | Avg: 34m 31s | Max:  1h 15m | Hits: 200%/7344  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 59m 01s | Avg: 29m 30s | Max: 30m 24s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 23m | Avg: 40m 45s | Max:  1h 06m | Hits: 140%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
      🔍 nvcc12.6           Pass:  96%/28  | Total: 16h 16m | Avg: 34m 53s | Max:  1h 15m | Hits: 200%/7344  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 59m 01s | Avg: 29m 30s | Max: 30m 24s
      🔍 nvcc               Pass:  97%/35  | Total: 22h 11m | Avg: 38m 02s | Max:  1h 15m | Hits: 188%/9180  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 18s | Max: 35m 47s
      🟩 Clang15            Pass: 100%/1   | Total: 33m 31s | Avg: 33m 31s | Max: 33m 31s
      🟩 Clang16            Pass: 100%/1   | Total: 39m 39s | Avg: 39m 39s | Max: 39m 39s
      🟩 Clang17            Pass: 100%/1   | Total: 37m 58s | Avg: 37m 58s | Max: 37m 58s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 03m | Avg: 26m 15s | Max: 36m 32s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 35s | Max: 34m 03s
      🟩 GCC8               Pass: 100%/1   | Total: 38m 25s | Avg: 38m 25s | Max: 38m 25s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 28s | Max: 35m 46s
      🟩 GCC10              Pass: 100%/1   | Total: 36m 21s | Avg: 36m 21s | Max: 36m 21s
      🟩 GCC11              Pass: 100%/1   | Total: 37m 14s | Avg: 37m 14s | Max: 37m 14s
      🟩 GCC12              Pass: 100%/1   | Total: 41m 06s | Avg: 41m 06s | Max: 41m 06s
      🔍 GCC13              Pass:  87%/8   | Total:  3h 21m | Avg: 25m 09s | Max: 35m 35s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 06m | Hits: 145%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 01m | Avg:  1h 00m | Max:  1h 15m | Hits: 216%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/14  | Total:  7h 12m | Avg: 30m 52s | Max: 39m 39s
      🔍 GCC                Pass:  93%/16  | Total:  8h 12m | Avg: 30m 46s | Max: 41m 06s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 15m | Hits: 188%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
    🔍 jobs: TestGPU 🔍
      🟩 Build              Pass: 100%/31  | Total: 21h 33m | Avg: 41m 44s | Max:  1h 15m | Hits: 143%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 30s | Avg: 16m 30s | Max: 34m 18s | Hits: 365%/1836  
      🔍 TestGPU            Pass:  66%/3   | Total: 47m 17s | Avg: 15m 45s | Max: 17m 15s
    🟨 cmake_options
      🟨 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:  50%/2   | Total: 49m 14s | Avg: 24m 37s | Max: 31m 59s
    🟨 gpu
      🟨 v100               Pass:  97%/37  | Total: 23h 10m | Avg: 37m 35s | Max:  1h 15m | Hits: 188%/9180  
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 22m 01s | Avg: 22m 01s | Max: 22m 01s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 10h 28m | Avg: 44m 52s | Max:  1h 15m | Hits: 145%/5508  
      🟩 20                 Pass: 100%/21  | Total: 11h 53m | Avg: 33m 57s | Max:  1h 15m | Hits: 252%/3672  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 57s | Avg: 5m 58s | Max: 9m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
      🟩 Test               Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s
    
  • 🟩 python: Pass: 100%/1 | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 78)

# Runner
53 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 3h 24m: Pass: 100%/78 | Total: 2d 08h | Avg: 43m 38s | Max: 1h 15m | Hits: 230%/12720
  • 🟩 cub: Pass: 100%/38 | Total: 1d 08h | Avg: 51m 34s | Max: 1h 13m | Hits: 338%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  1d 06h | Avg: 51m 19s | Max:  1h 13m | Hits: 338%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  1h 51m | Avg: 55m 48s | Max: 57m 03s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 44m | Avg: 56m 56s | Max:  1h 02m | Hits: 338%/885   
      🟩 12.5               Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🟩 12.6               Pass: 100%/31  | Total:  1d 01h | Avg: 49m 56s | Max:  1h 13m | Hits: 338%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 03m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 44m | Avg: 56m 56s | Max:  1h 02m | Hits: 338%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🟩 nvcc12.6           Pass: 100%/29  | Total: 23h 48m | Avg: 49m 15s | Max:  1h 13m | Hits: 338%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 03m
      🟩 nvcc               Pass: 100%/36  | Total:  1d 06h | Avg: 51m 06s | Max:  1h 13m | Hits: 338%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 42m | Avg: 55m 31s | Max: 57m 55s
      🟩 Clang15            Pass: 100%/1   | Total: 56m 16s | Avg: 56m 16s | Max: 56m 16s
      🟩 Clang16            Pass: 100%/1   | Total: 57m 54s | Avg: 57m 54s | Max: 57m 54s
      🟩 Clang17            Pass: 100%/1   | Total: 55m 06s | Avg: 55m 06s | Max: 55m 06s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 58m | Avg: 51m 11s | Max:  1h 03m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 09s | Max: 57m 43s
      🟩 GCC8               Pass: 100%/1   | Total: 59m 02s | Avg: 59m 02s | Max: 59m 02s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max:  1h 00m
      🟩 GCC10              Pass: 100%/1   | Total: 58m 19s | Avg: 58m 19s | Max: 58m 19s
      🟩 GCC11              Pass: 100%/1   | Total: 52m 33s | Avg: 52m 33s | Max: 52m 33s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 47m | Avg: 35m 41s | Max:  1h 00m
      🟩 GCC13              Pass: 100%/8   | Total:  4h 58m | Avg: 37m 21s | Max: 57m 03s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 10m | Hits: 338%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 13m | Hits: 338%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total: 12h 29m | Avg: 53m 33s | Max:  1h 03m
      🟩 GCC                Pass: 100%/18  | Total: 13h 25m | Avg: 44m 45s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 37m | Avg:  1h 09m | Max:  1h 13m | Hits: 338%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 46m 16s | Avg: 23m 08s | Max: 26m 30s
      🟩 v100               Pass: 100%/36  | Total:  1d 07h | Avg: 53m 08s | Max:  1h 13m | Hits: 338%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  1d 05h | Avg: 56m 36s | Max:  1h 13m | Hits: 338%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 28m 08s | Avg: 28m 08s | Max: 28m 08s
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 30s | Avg: 24m 30s | Max: 24m 30s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 30m | Avg: 30m 19s | Max: 39m 30s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 01m | Avg: 30m 36s | Max: 37m 08s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 46m 16s | Avg: 23m 08s | Max: 26m 30s
      🟩 90a                Pass: 100%/1   | Total: 24m 11s | Avg: 24m 11s | Max: 24m 11s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 13h 52m | Avg: 59m 27s | Max:  1h 11m | Hits: 338%/2655  
      🟩 20                 Pass: 100%/24  | Total: 18h 47m | Avg: 46m 58s | Max:  1h 13m | Hits: 337%/885   
    
  • 🟩 thrust: Pass: 100%/37 | Total: 23h 06m | Avg: 37m 28s | Max: 1h 15m | Hits: 188%/9180

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 45m 01s | Avg: 22m 30s | Max: 31m 59s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total: 21h 59m | Avg: 37m 41s | Max:  1h 15m | Hits: 188%/9180  
      🟩 arm64              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 31s | Max: 34m 22s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 23m | Avg: 40m 45s | Max:  1h 06m | Hits: 140%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
      🟩 12.6               Pass: 100%/30  | Total: 17h 11m | Avg: 34m 23s | Max:  1h 15m | Hits: 200%/7344  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 59m 01s | Avg: 29m 30s | Max: 30m 24s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 23m | Avg: 40m 45s | Max:  1h 06m | Hits: 140%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
      🟩 nvcc12.6           Pass: 100%/28  | Total: 16h 12m | Avg: 34m 44s | Max:  1h 15m | Hits: 200%/7344  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 59m 01s | Avg: 29m 30s | Max: 30m 24s
      🟩 nvcc               Pass: 100%/35  | Total: 22h 07m | Avg: 37m 55s | Max:  1h 15m | Hits: 188%/9180  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 18s | Max: 35m 47s
      🟩 Clang15            Pass: 100%/1   | Total: 33m 31s | Avg: 33m 31s | Max: 33m 31s
      🟩 Clang16            Pass: 100%/1   | Total: 39m 39s | Avg: 39m 39s | Max: 39m 39s
      🟩 Clang17            Pass: 100%/1   | Total: 37m 58s | Avg: 37m 58s | Max: 37m 58s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 03m | Avg: 26m 15s | Max: 36m 32s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 35s | Max: 34m 03s
      🟩 GCC8               Pass: 100%/1   | Total: 38m 25s | Avg: 38m 25s | Max: 38m 25s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 28s | Max: 35m 46s
      🟩 GCC10              Pass: 100%/1   | Total: 36m 21s | Avg: 36m 21s | Max: 36m 21s
      🟩 GCC11              Pass: 100%/1   | Total: 37m 14s | Avg: 37m 14s | Max: 37m 14s
      🟩 GCC12              Pass: 100%/1   | Total: 41m 06s | Avg: 41m 06s | Max: 41m 06s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 17m | Avg: 24m 37s | Max: 35m 35s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 06m | Hits: 145%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 01m | Avg:  1h 00m | Max:  1h 15m | Hits: 216%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  7h 12m | Avg: 30m 52s | Max: 39m 39s
      🟩 GCC                Pass: 100%/16  | Total:  8h 08m | Avg: 30m 31s | Max: 41m 06s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 15m | Hits: 188%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total: 23h 06m | Avg: 37m 28s | Max:  1h 15m | Hits: 188%/9180  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total: 21h 33m | Avg: 41m 44s | Max:  1h 15m | Hits: 143%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 30s | Avg: 16m 30s | Max: 34m 18s | Hits: 365%/1836  
      🟩 TestGPU            Pass: 100%/3   | Total: 43m 04s | Avg: 14m 21s | Max: 16m 59s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 22m 01s | Avg: 22m 01s | Max: 22m 01s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 10h 28m | Avg: 44m 52s | Max:  1h 15m | Hits: 145%/5508  
      🟩 20                 Pass: 100%/21  | Total: 11h 53m | Avg: 33m 57s | Max:  1h 15m | Hits: 252%/3672  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 57s | Avg: 5m 58s | Max: 9m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
      🟩 Test               Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s
    
  • 🟩 python: Pass: 100%/1 | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 78)

# Runner
53 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 1h 59m: Pass: 100%/78 | Total: 2d 06h | Avg: 41m 55s | Max: 1h 12m | Hits: 247%/12720
  • 🟩 cub: Pass: 100%/38 | Total: 1d 09h | Avg: 52m 10s | Max: 1h 12m | Hits: 339%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  1d 07h | Avg: 51m 57s | Max:  1h 12m | Hits: 339%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 56m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 01m | Avg:  1h 00m | Max:  1h 11m | Hits: 339%/885   
      🟩 12.5               Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 10m
      🟩 12.6               Pass: 100%/31  | Total:  1d 01h | Avg: 49m 47s | Max:  1h 12m | Hits: 339%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 01m | Avg:  1h 00m | Max:  1h 11m | Hits: 339%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 10m
      🟩 nvcc12.6           Pass: 100%/29  | Total: 23h 41m | Avg: 49m 01s | Max:  1h 12m | Hits: 339%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m
      🟩 nvcc               Pass: 100%/36  | Total:  1d 07h | Avg: 51m 41s | Max:  1h 12m | Hits: 339%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 52m | Avg: 58m 04s | Max: 59m 55s
      🟩 Clang15            Pass: 100%/1   | Total: 58m 51s | Avg: 58m 51s | Max: 58m 51s
      🟩 Clang16            Pass: 100%/1   | Total: 53m 52s | Avg: 53m 52s | Max: 53m 52s
      🟩 Clang17            Pass: 100%/1   | Total: 54m 24s | Avg: 54m 24s | Max: 54m 24s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 31m | Avg: 47m 18s | Max:  1h 03m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 08s | Max: 55m 42s
      🟩 GCC8               Pass: 100%/1   | Total: 59m 15s | Avg: 59m 15s | Max: 59m 15s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 09s | Max: 55m 45s
      🟩 GCC10              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC11              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC12              Pass: 100%/3   | Total:  1h 44m | Avg: 34m 54s | Max: 59m 08s
      🟩 GCC13              Pass: 100%/8   | Total:  5h 26m | Avg: 40m 50s | Max: 59m 30s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits: 339%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 12m | Hits: 339%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 10m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total: 12h 10m | Avg: 52m 10s | Max:  1h 03m
      🟩 GCC                Pass: 100%/18  | Total: 13h 51m | Avg: 46m 11s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 43m | Avg:  1h 10m | Max:  1h 12m | Hits: 339%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 10m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 45m 36s | Avg: 22m 48s | Max: 26m 12s
      🟩 v100               Pass: 100%/36  | Total:  1d 08h | Avg: 53m 48s | Max:  1h 12m | Hits: 339%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  1d 05h | Avg: 57m 38s | Max:  1h 12m | Hits: 339%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 37m 16s | Avg: 37m 16s | Max: 37m 16s
      🟩 GraphCapture       Pass: 100%/1   | Total: 28m 29s | Avg: 28m 29s | Max: 28m 29s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 17m | Avg: 25m 51s | Max: 34m 55s
      🟩 TestGPU            Pass: 100%/2   | Total: 52m 43s | Avg: 26m 21s | Max: 27m 37s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 45m 36s | Avg: 22m 48s | Max: 26m 12s
      🟩 90a                Pass: 100%/1   | Total: 24m 35s | Avg: 24m 35s | Max: 24m 35s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 14h 03m | Avg:  1h 00m | Max:  1h 12m | Hits: 339%/2655  
      🟩 20                 Pass: 100%/24  | Total: 18h 58m | Avg: 47m 27s | Max:  1h 12m | Hits: 338%/885   
    
  • 🟩 thrust: Pass: 100%/37 | Total: 20h 34m | Avg: 33m 21s | Max: 1h 04m | Hits: 211%/9180

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 28m 59s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total: 19h 32m | Avg: 33m 29s | Max:  1h 04m | Hits: 211%/9180  
      🟩 arm64              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 06s | Max: 33m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 10m | Avg: 38m 10s | Max:  1h 00m | Hits: 173%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max: 59m 04s
      🟩 12.6               Pass: 100%/30  | Total: 15h 28m | Avg: 30m 56s | Max:  1h 04m | Hits: 221%/7344  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 56m 48s | Avg: 28m 24s | Max: 28m 53s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 10m | Avg: 38m 10s | Max:  1h 00m | Hits: 173%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max: 59m 04s
      🟩 nvcc12.6           Pass: 100%/28  | Total: 14h 31m | Avg: 31m 06s | Max:  1h 04m | Hits: 221%/7344  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 48s | Avg: 28m 24s | Max: 28m 53s
      🟩 nvcc               Pass: 100%/35  | Total: 19h 37m | Avg: 33m 38s | Max:  1h 04m | Hits: 211%/9180  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 14s | Max: 33m 17s
      🟩 Clang15            Pass: 100%/1   | Total: 34m 36s | Avg: 34m 36s | Max: 34m 36s
      🟩 Clang16            Pass: 100%/1   | Total: 31m 54s | Avg: 31m 54s | Max: 31m 54s
      🟩 Clang17            Pass: 100%/1   | Total: 32m 09s | Avg: 32m 09s | Max: 32m 09s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 48m | Avg: 24m 07s | Max: 32m 12s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 05s | Max: 33m 02s
      🟩 GCC8               Pass: 100%/1   | Total: 30m 12s | Avg: 30m 12s | Max: 30m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 08s | Max: 35m 23s
      🟩 GCC10              Pass: 100%/1   | Total: 34m 51s | Avg: 34m 51s | Max: 34m 51s
      🟩 GCC11              Pass: 100%/1   | Total: 31m 54s | Avg: 31m 54s | Max: 31m 54s
      🟩 GCC12              Pass: 100%/1   | Total: 35m 53s | Avg: 35m 53s | Max: 35m 53s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 57m | Avg: 22m 10s | Max: 33m 44s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 04m | Hits: 173%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 37m | Avg: 52m 21s | Max:  1h 03m | Hits: 237%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max: 59m 04s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  6h 32m | Avg: 28m 02s | Max: 34m 36s
      🟩 GCC                Pass: 100%/16  | Total:  7h 24m | Avg: 27m 47s | Max: 35m 53s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 41m | Avg: 56m 21s | Max:  1h 04m | Hits: 211%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max: 59m 04s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total: 20h 34m | Avg: 33m 21s | Max:  1h 04m | Hits: 211%/9180  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total: 19h 06m | Avg: 36m 59s | Max:  1h 04m | Hits: 173%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 59s | Avg: 17m 19s | Max: 36m 29s | Hits: 365%/1836  
      🟩 TestGPU            Pass: 100%/3   | Total: 35m 42s | Avg: 11m 54s | Max: 12m 51s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 19m 12s | Avg: 19m 12s | Max: 19m 12s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  9h 19m | Avg: 39m 55s | Max:  1h 04m | Hits: 173%/5508  
      🟩 20                 Pass: 100%/21  | Total: 10h 34m | Avg: 30m 13s | Max:  1h 03m | Hits: 269%/3672  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 53s | Avg: 4m 56s | Max: 8m 03s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 50s | Avg:  1m 50s | Max:  1m 50s
      🟩 Test               Pass: 100%/1   | Total:  8m 03s | Avg:  8m 03s | Max:  8m 03s
    
  • 🟩 python: Pass: 100%/1 | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 78)

# Runner
53 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber
Copy link
Contributor Author

Do we need to add a benchmark for thrust.test.mismatch? Some of the addresses into constant memory changed, but the instructions are identical.

@bernhardmgruber
Copy link
Contributor Author

Do we need to add a benchmark for thrust.test.mismatch? Some of the addresses into constant memory changed, but the instructions are identical.

@gevtushenko says it's fine, we can proceed without benchmark.

@bernhardmgruber bernhardmgruber enabled auto-merge (squash) January 23, 2025 09:53
Copy link
Contributor

🟩 CI finished in 7h 13m: Pass: 100%/78 | Total: 2d 05h | Avg: 41m 08s | Max: 1h 11m | Hits: 234%/12708
  • 🟩 cub: Pass: 100%/38 | Total: 1d 07h | Avg: 49m 52s | Max: 1h 09m | Hits: 336%/3528

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  1d 05h | Avg: 49m 27s | Max:  1h 09m | Hits: 336%/3528  
      🟩 arm64              Pass: 100%/2   | Total:  1h 54m | Avg: 57m 27s | Max: 58m 36s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 49m | Avg: 57m 59s | Max:  1h 01m | Hits: 335%/882   
      🟩 12.5               Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
      🟩 12.6               Pass: 100%/31  | Total:  1d 00h | Avg: 47m 28s | Max:  1h 09m | Hits: 336%/2646  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 49m | Avg: 57m 59s | Max:  1h 01m | Hits: 335%/882   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
      🟩 nvcc12.6           Pass: 100%/29  | Total: 22h 30m | Avg: 46m 34s | Max:  1h 09m | Hits: 336%/2646  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m
      🟩 nvcc               Pass: 100%/36  | Total:  1d 05h | Avg: 49m 17s | Max:  1h 09m | Hits: 336%/3528  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 39m | Avg: 54m 59s | Max: 57m 26s
      🟩 Clang15            Pass: 100%/1   | Total: 53m 05s | Avg: 53m 05s | Max: 53m 05s
      🟩 Clang16            Pass: 100%/1   | Total: 56m 28s | Avg: 56m 28s | Max: 56m 28s
      🟩 Clang17            Pass: 100%/1   | Total: 54m 57s | Avg: 54m 57s | Max: 54m 57s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 34m | Avg: 47m 47s | Max:  1h 04m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 56m | Avg: 58m 24s | Max: 59m 46s
      🟩 GCC8               Pass: 100%/1   | Total: 55m 12s | Avg: 55m 12s | Max: 55m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 51m | Avg: 55m 58s | Max: 56m 01s
      🟩 GCC10              Pass: 100%/1   | Total: 53m 30s | Avg: 53m 30s | Max: 53m 30s
      🟩 GCC11              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC12              Pass: 100%/3   | Total:  1h 39m | Avg: 33m 15s | Max: 53m 57s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 45m | Avg: 35m 38s | Max: 59m 06s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits: 335%/1764  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m | Hits: 337%/1764  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total: 11h 59m | Avg: 51m 21s | Max:  1h 04m
      🟩 GCC                Pass: 100%/18  | Total: 13h 02m | Avg: 43m 27s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 20m | Avg:  1h 05m | Max:  1h 09m | Hits: 336%/3528  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 07m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 45m 48s | Avg: 22m 54s | Max: 26m 17s
      🟩 v100               Pass: 100%/36  | Total:  1d 06h | Avg: 51m 22s | Max:  1h 09m | Hits: 336%/3528  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  1d 05h | Avg: 56m 10s | Max:  1h 09m | Hits: 336%/3528  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 19s | Avg: 24m 19s | Max: 24m 19s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 33s | Avg: 16m 33s | Max: 16m 33s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 05m | Avg: 21m 57s | Max: 24m 38s
      🟩 TestGPU            Pass: 100%/2   | Total: 47m 10s | Avg: 23m 35s | Max: 24m 38s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 45m 48s | Avg: 22m 54s | Max: 26m 17s
      🟩 90a                Pass: 100%/1   | Total: 23m 43s | Avg: 23m 43s | Max: 23m 43s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 13h 48m | Avg: 59m 12s | Max:  1h 09m | Hits: 336%/2646  
      🟩 20                 Pass: 100%/24  | Total: 17h 46m | Avg: 44m 26s | Max:  1h 07m | Hits: 336%/882   
    
  • 🟩 thrust: Pass: 100%/37 | Total: 20h 57m | Avg: 33m 58s | Max: 1h 11m | Hits: 195%/9180

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 38m 47s | Avg: 19m 23s | Max: 26m 56s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total: 19h 58m | Avg: 34m 14s | Max:  1h 11m | Hits: 195%/9180  
      🟩 arm64              Pass: 100%/2   | Total: 58m 48s | Avg: 29m 24s | Max: 30m 54s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 21m | Avg: 40m 19s | Max:  1h 03m | Hits: 119%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  1h 53m | Avg: 56m 57s | Max: 57m 46s
      🟩 12.6               Pass: 100%/30  | Total: 15h 41m | Avg: 31m 23s | Max:  1h 11m | Hits: 215%/7344  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 57m 07s | Avg: 28m 33s | Max: 28m 58s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 21m | Avg: 40m 19s | Max:  1h 03m | Hits: 119%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 53m | Avg: 56m 57s | Max: 57m 46s
      🟩 nvcc12.6           Pass: 100%/28  | Total: 14h 44m | Avg: 31m 35s | Max:  1h 11m | Hits: 215%/7344  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 57m 07s | Avg: 28m 33s | Max: 28m 58s
      🟩 nvcc               Pass: 100%/35  | Total: 19h 59m | Avg: 34m 17s | Max:  1h 11m | Hits: 195%/9180  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 10m | Avg: 32m 30s | Max: 34m 47s
      🟩 Clang15            Pass: 100%/1   | Total: 31m 24s | Avg: 31m 24s | Max: 31m 24s
      🟩 Clang16            Pass: 100%/1   | Total: 32m 19s | Avg: 32m 19s | Max: 32m 19s
      🟩 Clang17            Pass: 100%/1   | Total: 30m 53s | Avg: 30m 53s | Max: 30m 53s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 50m | Avg: 24m 21s | Max: 31m 16s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 12m | Avg: 36m 29s | Max: 39m 07s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 18s | Avg: 31m 18s | Max: 31m 18s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 55s | Max: 33m 09s
      🟩 GCC10              Pass: 100%/1   | Total: 32m 52s | Avg: 32m 52s | Max: 32m 52s
      🟩 GCC11              Pass: 100%/1   | Total: 35m 29s | Avg: 35m 29s | Max: 35m 29s
      🟩 GCC12              Pass: 100%/1   | Total: 34m 17s | Avg: 34m 17s | Max: 34m 17s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 02m | Avg: 22m 48s | Max: 36m 22s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits: 141%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 46m | Avg: 55m 23s | Max:  1h 11m | Hits: 232%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 53m | Avg: 56m 57s | Max: 57m 46s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  6h 35m | Avg: 28m 13s | Max: 34m 47s
      🟩 GCC                Pass: 100%/16  | Total:  7h 33m | Avg: 28m 19s | Max: 39m 07s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 54m | Avg: 58m 58s | Max:  1h 11m | Hits: 195%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 57s | Max: 57m 46s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total: 20h 57m | Avg: 33m 58s | Max:  1h 11m | Hits: 195%/9180  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total: 19h 22m | Avg: 37m 29s | Max:  1h 11m | Hits: 153%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 47m 45s | Avg: 15m 55s | Max: 32m 29s | Hits: 365%/1836  
      🟩 TestGPU            Pass: 100%/3   | Total: 46m 55s | Avg: 15m 38s | Max: 18m 32s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 19m 23s | Avg: 19m 23s | Max: 19m 23s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  9h 32m | Avg: 40m 53s | Max:  1h 04m | Hits: 150%/5508  
      🟩 20                 Pass: 100%/21  | Total: 10h 45m | Avg: 30m 45s | Max:  1h 11m | Hits: 264%/3672  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 19s | Avg: 4m 39s | Max: 7m 21s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 21s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 21s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 21s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 21s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 21s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 21s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 21s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
      🟩 Test               Pass: 100%/1   | Total:  7m 21s | Avg:  7m 21s | Max:  7m 21s
    
  • 🟩 python: Pass: 100%/1 | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 47m 45s | Avg: 47m 45s | Max: 47m 45s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 78)

# Runner
53 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber bernhardmgruber merged commit 35df3a9 into NVIDIA:main Jan 23, 2025
91 of 94 checks passed
Copy link
Contributor

Git push to origin failed for branch/2.8.x with exitcode 128

@bernhardmgruber bernhardmgruber deleted the depr_thrust_cuda_iter branch January 23, 2025 16:16
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Jan 23, 2025
davebayer pushed a commit to davebayer/cccl that referenced this pull request Jan 29, 2025
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Jan 30, 2025
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Jan 30, 2025
@bernhardmgruber bernhardmgruber mentioned this pull request Feb 5, 2025
36 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport branch/2.8.x thrust For all items related to Thrust.
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

2 participants