Skip to content

Releases: JuliaGPU/GPUCompiler.jl

v0.26.1

21 Feb 18:15
e4b39b3
Compare
Choose a tag to compare

GPUCompiler v0.26.1

Diff since v0.26.0

Merged pull requests:

v0.26.0

20 Feb 12:16
a95fd25
Compare
Choose a tag to compare

GPUCompiler v0.26.0

Diff since v0.25.0

Merged pull requests:

Closed issues:

  • Overlay methods disabling IR interpreter breaks const-prop of GPU-incompatible code (#384)
  • GCN: Assertion failure due to stack address space (#504)
  • Optimizer regression on 1.11 (#506)
  • propagate_julia_addrsp! is not defined (#531)

v0.25.0

29 Sep 11:36
45c4325
Compare
Choose a tag to compare

GPUCompiler v0.25.0

Breaking change: PTX back-end now does not replace trap with exit anymore. This probably only matters to CUDA.jl, which now has to make sure to emit exit in its signal handler, to avoid tripping over the trap that probably follows it.

Diff since v0.24.5

Merged pull requests:

  • Update manifest (#522) (@github-actions[bot])
  • PTX: Improve handling of trap (#524) (@maleadt)

v0.24.5

22 Sep 09:31
324d83d
Compare
Choose a tag to compare

GPUCompiler v0.24.5

Diff since v0.24.4

Merged pull requests:

  • Only warn about missed transformations when debug output is enabled. (#521) (@maleadt)

v0.24.4

19 Sep 12:00
b23fe47
Compare
Choose a tag to compare

GPUCompiler v0.24.4

Diff since v0.24.3

Merged pull requests:

v0.24.3

14 Sep 12:56
a607fd5
Compare
Choose a tag to compare

GPUCompiler v0.24.3

Diff since v0.24.2

Merged pull requests:

v0.24.2

13 Sep 12:10
947507e
Compare
Choose a tag to compare

GPUCompiler v0.24.2

Diff since v0.24.0

Merged pull requests:

v0.24.1

12 Sep 13:03
c916b4e
Compare
Choose a tag to compare

GPUCompiler v0.24.1

Diff since v0.24.0

Merged pull requests:

v0.24.0

12 Sep 08:35
f28fcdc
Compare
Choose a tag to compare

GPUCompiler v0.24.0

Diff since v0.23.0

Closed issues:

  • compile(:asm) returns LLVM IR without requiring a context (#420)

Merged pull requests:

v0.23.0

29 Aug 11:26
ffbedbb
Compare
Choose a tag to compare

GPUCompiler v0.23.0

Diff since v0.22.0

Closed issues:

  • How to set fast math for CUDA (#491)

Merged pull requests: