[HLSL] implement `lerp` intrinsic #70102

llvm-beanz · 2023-10-24T19:17:28Z

Implement HLSL lerp intrinsic:

https://learn.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl-lerp

The text was updated successfully, but these errors were encountered:

This is the start of implementing the lerp intrinsic https://learn.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl-lerp Builtins.td - defines the builtin hlsl_intrinsics.h - defines the lerp api DiagnosticSemaKinds.td - needed a new error to be inclusive for more than two operands. CGBuiltin.cpp - add the lerp intrinsic lowering SemaChecking.cpp - type checks for lerp builtin IntrinsicsDirectX.td - define the lerp intrinsic this change implements the first half of llvm#70102

This is the start of implementing the lerp intrinsic https://learn.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl-lerp Builtins.td - defines the builtin hlsl_intrinsics.h - defines the lerp api DiagnosticSemaKinds.td - needed a new error to be inclusive for more than two operands. CGBuiltin.cpp - add the lerp intrinsic lowering SemaChecking.cpp - type checks for lerp builtin IntrinsicsDirectX.td - define the lerp intrinsic this change implements the first half of #70102 Co-authored-by: Xiang Li <[email protected]>

farzonl · 2024-03-01T22:08:59Z

The remaining work here is elementwise lerps.

Unlike other elementwise operations lerp tends to do an odd grouping.

for example:
https://godbolt.org/z/ses881sx7

export float3 dot1(float3 x, float3 y, float3 z) {
    return lerp(x, y, z);
}

This groups the z vector and extracts all its elements first

  %z.i0 = extractelement <3 x float> %z, i32 0, !dbg !41 ; line:3 col:12
  %z.i1 = extractelement <3 x float> %z, i32 1, !dbg !41 ; line:3 col:12
  %z.i2 = extractelement <3 x float> %z, i32 2, !dbg !41 ; line:3 col:12
  %x.i0 = extractelement <3 x float> %x, i32 0, !dbg !41 ; line:3 col:12
  %y.i0 = extractelement <3 x float> %y, i32 0, !dbg !41 ; line:3 col:12

It also groups the multiplies and adds

  %.i01 = fmul fast float %z.i0, %.i0, !dbg !41 ; line:3 col:12
  %.i12 = fmul fast float %z.i1, %.i1, !dbg !41 ; line:3 col:12
  %.i23 = fmul fast float %z.i2, %.i2, !dbg !41 ; line:3 col:12
  %.i04 = fadd fast float %.i01, %x.i0, !dbg !41 ; line:3 col:12
  %.i15 = fadd fast float %.i12, %x.i1, !dbg !41 ; line:3 col:12
  %.i26 = fadd fast float %.i23, %x.i2, !dbg !41 ; line:3 col:12

llvm-beanz · 2024-03-01T22:25:34Z

We do not need clang to match the instruction ordering emitted by DXC.

This change implements lowering for llvm#70076, llvm#70100, llvm#70072, & llvm#70102 CGBuiltin.cpp - - simplify lerp intrinsic IntrinsicsDirectX.td - simplify lerp intrinsic SemaChecking.cpp - remove unnecessary check DXILIntrinsicExpansion.* - add intrinsic to instruction expansion cases DXILOpLowering.cpp - make sure DXILIntrinsicExpansion happens first DirectX.h - changes to support new pass DirectXTargetMachine.cpp - changes to support new pass

This change implements lowering for llvm#70076, llvm#70100, llvm#70072, & llvm#70102 `CGBuiltin.cpp` - - simplify `lerp` intrinsic `IntrinsicsDirectX.td` - simplify `lerp` intrinsic `SemaChecking.cpp` - remove unnecessary check `DXILIntrinsicExpansion.*` - add intrinsic to instruction expansion cases `DXILOpLowering.cpp` - make sure `DXILIntrinsicExpansion` happens first `DirectX.h` - changes to support new pass `DirectXTargetMachine.cpp` - changes to support new pass Why `any`, and `lerp` as instruction expansion just for DXIL? - SPIR-V there is an [OpAny](https://registry.khronos.org/SPIR-V/specs/unified1/SPIRV.html#OpAny) - SPIR-V has a GLSL lerp extension via [Fmix](https://registry.khronos.org/SPIR-V/specs/1.0/GLSL.std.450.html#FMix) Why `exp` instruction expansion? - We have an `exp2` opcode and `exp` reuses that opcode. So instruction expansion is a convenient way to do preprocessing. - Further SPIR-V has a GLSL exp extension via [Exp](https://registry.khronos.org/SPIR-V/specs/1.0/GLSL.std.450.html#Exp) and [Exp2](https://registry.khronos.org/SPIR-V/specs/1.0/GLSL.std.450.html#Exp2) Why `rcp` as instruction expansion? This one is a bit of the odd man out and might have to move to `cgbuiltins` when we better understand SPIRV requirements. However I included it because it seems like [fast math mode has an AllowRecip flag](https://registry.khronos.org/SPIR-V/specs/unified1/SPIRV.html#_fp_fast_math_mode) which lets you compute the reciprocal without performing the division. We don't have that in DXIL so thought to include it.

This change implements lowering for #70076, #70100, #70072, & #70102 `CGBuiltin.cpp` - - simplify `lerp` intrinsic `IntrinsicsDirectX.td` - simplify `lerp` intrinsic `SemaChecking.cpp` - remove unnecessary check `DXILIntrinsicExpansion.*` - add intrinsic to instruction expansion cases `DXILOpLowering.cpp` - make sure `DXILIntrinsicExpansion` happens first `DirectX.h` - changes to support new pass `DirectXTargetMachine.cpp` - changes to support new pass Why `any`, and `lerp` as instruction expansion just for DXIL? - SPIR-V there is an [OpAny](https://registry.khronos.org/SPIR-V/specs/unified1/SPIRV.html#OpAny) - SPIR-V has a GLSL lerp extension via [Fmix](https://registry.khronos.org/SPIR-V/specs/1.0/GLSL.std.450.html#FMix) Why `exp` instruction expansion? - We have an `exp2` opcode and `exp` reuses that opcode. So instruction expansion is a convenient way to do preprocessing. - Further SPIR-V has a GLSL exp extension via [Exp](https://registry.khronos.org/SPIR-V/specs/1.0/GLSL.std.450.html#Exp) and [Exp2](https://registry.khronos.org/SPIR-V/specs/1.0/GLSL.std.450.html#Exp2) Why `rcp` as instruction expansion? This one is a bit of the odd man out and might have to move to `cgbuiltins` when we better understand SPIRV requirements. However I included it because it seems like [fast math mode has an AllowRecip flag](https://registry.khronos.org/SPIR-V/specs/unified1/SPIRV.html#_fp_fast_math_mode) which lets you compute the reciprocal without performing the division. We don't have that in DXIL so thought to include it.

llvmbot · 2024-03-15T00:42:08Z

@llvm/issue-subscribers-clang-frontend

Author: Chris B (llvm-beanz)

Implement HLSL `lerp` intrinsic:

https://learn.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl-lerp

llvmbot · 2024-03-15T00:42:13Z

@llvm/issue-subscribers-clang-codegen

Author: Chris B (llvm-beanz)

Implement HLSL `lerp` intrinsic:

https://learn.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl-lerp

llvm-beanz added this to HLSL Support Oct 24, 2023

llvm-beanz converted this from a draft issue Oct 24, 2023

llvm-beanz added the HLSL HLSL Language Support label Oct 24, 2023

davidcook-msft self-assigned this Feb 26, 2024

farzonl mentioned this issue Feb 26, 2024

[HLSL] implementation of lerp intrinsic #83077

Merged

farzonl assigned farzonl and unassigned davidcook-msft Mar 1, 2024

farzonl moved this to In Progress in HLSL Support Mar 1, 2024

farzonl mentioned this issue Mar 8, 2024

[DXIL] exp, any, lerp, & rcp Intrinsic Lowering #84526

Merged

farzonl moved this from In Progress to Needs Review in HLSL Support Mar 8, 2024

damyanp moved this from Needs Review to Active in HLSL Support Mar 8, 2024

farzonl linked a pull request Mar 12, 2024 that will close this issue

[DXIL] exp, any, lerp, & rcp Intrinsic Lowering #84526

Merged

farzonl closed this as completed in #84526 Mar 15, 2024

github-project-automation bot moved this from Active to Done in HLSL Support Mar 15, 2024

EugeneZelenko added clang:frontend Language frontend issues, e.g. anything involving "Sema" clang:codegen IR generation bugs: mangling, exceptions, etc. backend:DirectX labels Mar 15, 2024

This was referenced Aug 5, 2024

[workstream] Intrinsics llvm/wg-hlsl#28

Open

Intrinsics used by particle_life.hlsl are implemented llvm/wg-hlsl#29

Closed

Intrinsics used by DML shaders are implemented llvm/wg-hlsl#30

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HLSL] implement `lerp` intrinsic #70102

[HLSL] implement `lerp` intrinsic #70102

llvm-beanz commented Oct 24, 2023

farzonl commented Mar 1, 2024

llvm-beanz commented Mar 1, 2024

llvmbot commented Mar 15, 2024

llvmbot commented Mar 15, 2024

[HLSL] implement lerp intrinsic #70102

[HLSL] implement lerp intrinsic #70102

Comments

llvm-beanz commented Oct 24, 2023

farzonl commented Mar 1, 2024

llvm-beanz commented Mar 1, 2024

llvmbot commented Mar 15, 2024

llvmbot commented Mar 15, 2024

[HLSL] implement `lerp` intrinsic #70102

[HLSL] implement `lerp` intrinsic #70102