Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run thrust::transform benchmarks with more elements #2764

Merged
merged 1 commit into from
Nov 12, 2024

Conversation

bernhardmgruber
Copy link
Contributor

Run the BabelStream thrust::transform benchmarks with 2^25 and 2^31 elements by default.

Copy link
Contributor

🟩 CI finished in 2h 40m: Pass: 100%/222 | Total: 5d 00h | Avg: 32m 28s | Max: 1h 26m | Hits: 79%/16128
  • 🟩 cub: Pass: 100%/110 | Total: 3d 16h | Avg: 48m 31s | Max: 1h 26m | Hits: 66%/2948

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total:  3d 10h | Avg: 48m 20s | Max:  1h 26m | Hits:  66%/2948  
      🟩 arm64              Pass: 100%/8   | Total:  6h 48m | Avg: 51m 00s | Max: 53m 00s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total: 10h 55m | Avg: 43m 41s | Max: 52m 54s | Hits:  66%/737   
      🟩 11.8               Pass: 100%/3   | Total:  3h 15m | Avg:  1h 05m | Max:  1h 06m
      🟩 12.5               Pass: 100%/4   | Total:  4h 07m | Avg:  1h 01m | Max:  1h 05m
      🟩 12.6               Pass: 100%/88  | Total:  2d 22h | Avg: 48m 10s | Max:  1h 26m | Hits:  66%/2211  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  3h 26m | Avg: 51m 35s | Max: 52m 28s
      🟩 nvcc11.1           Pass: 100%/15  | Total: 10h 55m | Avg: 43m 41s | Max: 52m 54s | Hits:  66%/737   
      🟩 nvcc11.8           Pass: 100%/3   | Total:  3h 15m | Avg:  1h 05m | Max:  1h 06m
      🟩 nvcc12.5           Pass: 100%/4   | Total:  4h 07m | Avg:  1h 01m | Max:  1h 05m
      🟩 nvcc12.6           Pass: 100%/84  | Total:  2d 19h | Avg: 48m 01s | Max:  1h 26m | Hits:  66%/2211  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  3h 26m | Avg: 51m 35s | Max: 52m 28s
      🟩 nvcc               Pass: 100%/106 | Total:  3d 13h | Avg: 48m 24s | Max:  1h 26m | Hits:  66%/2948  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  5h 02m | Avg: 50m 25s | Max: 57m 04s
      🟩 Clang10            Pass: 100%/3   | Total:  2h 39m | Avg: 53m 08s | Max: 57m 26s
      🟩 Clang11            Pass: 100%/4   | Total:  3h 26m | Avg: 51m 42s | Max: 56m 53s
      🟩 Clang12            Pass: 100%/4   | Total:  3h 21m | Avg: 50m 18s | Max: 51m 18s
      🟩 Clang13            Pass: 100%/4   | Total:  3h 30m | Avg: 52m 31s | Max: 55m 31s
      🟩 Clang14            Pass: 100%/4   | Total:  3h 15m | Avg: 48m 55s | Max: 49m 43s
      🟩 Clang15            Pass: 100%/4   | Total:  3h 25m | Avg: 51m 20s | Max: 56m 13s
      🟩 Clang16            Pass: 100%/4   | Total:  3h 23m | Avg: 50m 54s | Max: 55m 04s
      🟩 Clang17            Pass: 100%/4   | Total:  3h 35m | Avg: 53m 47s | Max: 55m 20s
      🟩 Clang18            Pass: 100%/11  | Total:  8h 29m | Avg: 46m 21s | Max: 53m 00s
      🟩 GCC6               Pass: 100%/2   | Total:  1h 25m | Avg: 42m 31s | Max: 43m 57s
      🟩 GCC7               Pass: 100%/6   | Total:  4h 31m | Avg: 45m 13s | Max: 51m 21s
      🟩 GCC8               Pass: 100%/6   | Total:  4h 27m | Avg: 44m 30s | Max: 51m 04s
      🟩 GCC9               Pass: 100%/6   | Total:  4h 33m | Avg: 45m 31s | Max: 51m 06s
      🟩 GCC10              Pass: 100%/4   | Total:  3h 12m | Avg: 48m 09s | Max: 51m 50s
      🟩 GCC11              Pass: 100%/7   | Total:  6h 24m | Avg: 54m 58s | Max:  1h 06m
      🟩 GCC12              Pass: 100%/4   | Total:  3h 13m | Avg: 48m 22s | Max: 49m 13s
      🟩 GCC13              Pass: 100%/16  | Total: 10h 13m | Avg: 38m 19s | Max:  1h 26m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  2h 53m | Avg: 57m 42s | Max: 58m 02s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 52m 54s | Avg: 52m 54s | Max: 52m 54s | Hits:  66%/737   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 47s | Max:  1h 01m | Hits:  66%/1474  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 57m 05s | Avg: 57m 05s | Max: 57m 05s | Hits:  66%/737   
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  4h 07m | Avg:  1h 01m | Max:  1h 05m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  1d 16h | Avg: 50m 12s | Max: 57m 26s
      🟩 GCC                Pass: 100%/51  | Total:  1d 14h | Avg: 44m 43s | Max:  1h 26m
      🟩 Intel              Pass: 100%/3   | Total:  2h 53m | Avg: 57m 42s | Max: 58m 02s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 47m | Avg: 56m 53s | Max:  1h 01m | Hits:  66%/2948  
      🟩 NVHPC              Pass: 100%/4   | Total:  4h 07m | Avg:  1h 01m | Max:  1h 05m
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total:  3d 16h | Avg: 48m 31s | Max:  1h 26m | Hits:  66%/2948  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  3d 12h | Avg: 49m 35s | Max:  1h 06m | Hits:  66%/2948  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 16m 52s | Avg: 16m 52s | Max: 16m 52s
      🟩 GraphCapture       Pass: 100%/1   | Total: 14m 42s | Avg: 14m 42s | Max: 14m 42s
      🟩 HostLaunch         Pass: 100%/3   | Total:  2h 19m | Avg: 46m 21s | Max:  1h 26m
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 49m | Avg: 36m 39s | Max: 44m 18s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total:  3h 15m | Avg:  1h 05m | Max:  1h 06m
      🟩 90a                Pass: 100%/4   | Total:  1h 19m | Avg: 19m 56s | Max: 21m 11s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  1d 00h | Avg: 48m 27s | Max:  1h 06m
      🟩 14                 Pass: 100%/29  | Total:  1d 00h | Avg: 49m 57s | Max:  1h 05m | Hits:  66%/1474  
      🟩 17                 Pass: 100%/27  | Total: 22h 18m | Avg: 49m 34s | Max:  1h 03m | Hits:  66%/737   
      🟩 20                 Pass: 100%/24  | Total: 18h 17m | Avg: 45m 43s | Max:  1h 26m | Hits:  66%/737   
    
  • 🟩 thrust: Pass: 100%/109 | Total: 1d 06h | Avg: 16m 56s | Max: 1h 20m | Hits: 82%/13180

    🟩 cpu
      🟩 amd64              Pass: 100%/101 | Total:  1d 06h | Avg: 17m 50s | Max:  1h 20m | Hits:  82%/13180 
      🟩 arm64              Pass: 100%/8   | Total: 44m 10s | Avg:  5m 31s | Max:  6m 00s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  3h 05m | Avg: 12m 23s | Max: 52m 10s | Hits:  77%/2636  
      🟩 11.8               Pass: 100%/3   | Total: 57m 40s | Avg: 19m 13s | Max: 45m 22s
      🟩 12.5               Pass: 100%/4   | Total:  3h 47m | Avg: 56m 49s | Max:  1h 02m
      🟩 12.6               Pass: 100%/87  | Total: 22h 55m | Avg: 15m 48s | Max:  1h 20m | Hits:  83%/10544 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  5m 48s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  3h 05m | Avg: 12m 23s | Max: 52m 10s | Hits:  77%/2636  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 57m 40s | Avg: 19m 13s | Max: 45m 22s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  3h 47m | Avg: 56m 49s | Max:  1h 02m
      🟩 nvcc12.6           Pass: 100%/83  | Total: 22h 34m | Avg: 16m 18s | Max:  1h 20m | Hits:  83%/10544 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  5m 48s
      🟩 nvcc               Pass: 100%/105 | Total:  1d 06h | Avg: 17m 22s | Max:  1h 20m | Hits:  82%/13180 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  3h 07m | Avg: 31m 17s | Max: 43m 09s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 37m | Avg: 32m 30s | Max: 37m 00s
      🟩 Clang11            Pass: 100%/4   | Total:  2h 14m | Avg: 33m 41s | Max: 40m 24s
      🟩 Clang12            Pass: 100%/4   | Total:  2h 11m | Avg: 32m 47s | Max: 42m 16s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 59m | Avg: 29m 52s | Max: 31m 32s
      🟩 Clang14            Pass: 100%/4   | Total: 24m 29s | Avg:  6m 07s | Max:  6m 54s
      🟩 Clang15            Pass: 100%/4   | Total: 25m 08s | Avg:  6m 17s | Max:  7m 16s
      🟩 Clang16            Pass: 100%/4   | Total: 23m 08s | Avg:  5m 47s | Max:  5m 57s
      🟩 Clang17            Pass: 100%/4   | Total: 23m 43s | Avg:  5m 55s | Max:  6m 21s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 12m | Avg:  6m 34s | Max: 15m 47s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 20s | Avg:  4m 10s | Max:  4m 17s
      🟩 GCC7               Pass: 100%/6   | Total: 29m 52s | Avg:  4m 58s | Max:  6m 02s
      🟩 GCC8               Pass: 100%/6   | Total: 31m 02s | Avg:  5m 10s | Max:  6m 21s
      🟩 GCC9               Pass: 100%/6   | Total: 31m 32s | Avg:  5m 15s | Max:  6m 06s
      🟩 GCC10              Pass: 100%/4   | Total: 22m 40s | Avg:  5m 40s | Max:  6m 05s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 22m | Avg: 11m 49s | Max: 45m 22s
      🟩 GCC12              Pass: 100%/4   | Total: 26m 45s | Avg:  6m 41s | Max:  7m 40s
      🟩 GCC13              Pass: 100%/14  | Total:  3h 10m | Avg: 13m 38s | Max:  1h 20m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 56m | Avg: 38m 46s | Max: 42m 24s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 52m 10s | Avg: 52m 10s | Max: 52m 10s | Hits:  77%/2636  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 35s | Max: 53m 49s | Hits:  77%/5272  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 20m | Avg: 40m 01s | Max: 59m 49s | Hits:  88%/5272  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  3h 47m | Avg: 56m 49s | Max:  1h 02m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total: 13h 59m | Avg: 17m 29s | Max: 43m 09s
      🟩 GCC                Pass: 100%/49  | Total:  7h 03m | Avg:  8m 39s | Max:  1h 20m
      🟩 Intel              Pass: 100%/3   | Total:  1h 56m | Avg: 38m 46s | Max: 42m 24s
      🟩 MSVC               Pass: 100%/5   | Total:  3h 59m | Avg: 47m 52s | Max: 59m 49s | Hits:  82%/13180 
      🟩 NVHPC              Pass: 100%/4   | Total:  3h 47m | Avg: 56m 49s | Max:  1h 02m
    🟩 gpu
      🟩 v100               Pass: 100%/109 | Total:  1d 06h | Avg: 16m 56s | Max:  1h 20m | Hits:  82%/13180 
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  1d 04h | Avg: 16m 33s | Max:  1h 02m | Hits:  77%/10544 
      🟩 TestCPU            Pass: 100%/4   | Total: 44m 29s | Avg: 11m 07s | Max: 20m 13s | Hits:  99%/2636  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 52m | Avg: 37m 24s | Max:  1h 20m
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 57m 40s | Avg: 19m 13s | Max: 45m 22s
      🟩 90a                Pass: 100%/4   | Total: 42m 33s | Avg: 10m 38s | Max: 27m 55s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  7h 45m | Avg: 15m 30s | Max:  1h 20m
      🟩 14                 Pass: 100%/29  | Total:  8h 38m | Avg: 17m 52s | Max: 57m 33s | Hits:  77%/5272  
      🟩 17                 Pass: 100%/27  | Total:  8h 11m | Avg: 18m 13s | Max:  1h 01m | Hits:  77%/2636  
      🟩 20                 Pass: 100%/23  | Total:  6h 10m | Avg: 16m 07s | Max:  1h 02m | Hits:  88%/5272  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 50s | Avg: 4m 25s | Max: 6m 45s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 45s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 45s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 45s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 45s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 45s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 45s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 45s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 05s | Avg:  2m 05s | Max:  2m 05s
      🟩 Test               Pass: 100%/1   | Total:  6m 45s | Avg:  6m 45s | Max:  6m 45s
    
  • 🟩 python: Pass: 100%/1 | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 56s | Avg: 14m 56s | Max: 14m 56s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 222)

# Runner
184 linux-amd64-cpu16
16 linux-arm64-cpu16
13 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber merged commit 284e104 into NVIDIA:main Nov 12, 2024
237 checks passed
@bernhardmgruber bernhardmgruber deleted the thrust_transf_bench branch November 12, 2024 10:08
fbusato pushed a commit to fbusato/cccl that referenced this pull request Nov 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

2 participants