Implement the `fmod` HLSL Function #99118

farzonl · 2024-07-16T19:57:08Z

NOTE: SPIRV implementation is already completed.
NOTE: The change here shold be a header only changes with test case updates done to
clang/test/SemaHLSL/BuiltIns and clang/test/CodeGenHLSL/builtins/fmod.hlsl. The SPIRV tests should remain unchanged and continue to work.

fmod is currently defined as

_HLSL_16BIT_AVAILABILITY(shadermodel, 6.2)
_HLSL_BUILTIN_ALIAS(__builtin_elementwise_fmod)
half fmod(half, half);
_HLSL_16BIT_AVAILABILITY(shadermodel, 6.2)
_HLSL_BUILTIN_ALIAS(__builtin_elementwise_fmod)
half2 fmod(half2, half2);
_HLSL_16BIT_AVAILABILITY(shadermodel, 6.2)
_HLSL_BUILTIN_ALIAS(__builtin_elementwise_fmod)
half3 fmod(half3, half3);
_HLSL_16BIT_AVAILABILITY(shadermodel, 6.2)
_HLSL_BUILTIN_ALIAS(__builtin_elementwise_fmod)
half4 fmod(half4, half4);

_HLSL_BUILTIN_ALIAS(__builtin_elementwise_fmod)
float fmod(float, float);
_HLSL_BUILTIN_ALIAS(__builtin_elementwise_fmod)
float2 fmod(float2, float2);
_HLSL_BUILTIN_ALIAS(__builtin_elementwise_fmod)
float3 fmod(float3, float3);
_HLSL_BUILTIN_ALIAS(__builtin_elementwise_fmod)
float4 fmod(float4, float4);

replace it with a templatized version

_HLSL_16BIT_AVAILABILITY(shadermodel, 6.2)
const inline half fmod(half X, half Y) {
  return __detail::fmod_impl(X, Y);
}

const inline float fmod(float X, float Y) {
  return __detail::fmod_impl(X, Y);
}
template <int N, typename = std::enable_if_t<(N > 1 && N <=4)>>
_HLSL_16BIT_AVAILABILITY(shadermodel, 6.2)
const inline vector<half, N> fmod(vector<half, N> X, vector<half, N> Y) {
  return __detail::fmod_vec_impl(X, Y);
}

template <int N, typename = std::enable_if_t<(N > 1 && N <=4)>>
const inline vector<float, N> fmod(vector<float, N> X, vector<float, N> Y) {
  return __detail::fmod_vec_impl(X, Y);
}

This should call a version in clang/lib/Headers/hlsl/hlsl_detail.h
that will do the target switching for us.

template <typename T>
constexpr enable_if_t<is_same<float, T>::value || is_same<half, T>::value, T>
fmod_impl(T X, T Y) {
#if !defined(__DirectX__)
  return __builtin_elementwise_fmod(X, Y));
#else 
  return /*insert algorithmic approach here*/;
#endif
}
template <typename T, int N>
constexpr vector<T, N> fmod_vec_impl(vector<T, N> X, vector<T, N> Y) {
#if !defined(__DirectX__)
  return __builtin_elementwise_fmod(X, Y));
#else 
  return /*insert algorithmic approach here*/;
#endif
}

DirectX

DXIL Opcode	DXIL OpName	Shader Model	Shader Stages
6, 22	FAbs, Frc	6.0	()

For reference the algorithm you are expected to implement in HLSL source shold match the DXC implementation.
Please find the reference implmentation linked here: https://github.com/microsoft/DirectXShaderCompiler/blob/c2ed9ad4ee775f3de903ce757c994aecc59a5306/lib/HLSL/HLOperationLower.cpp#L2253C1-L2271C2

  %1 = fdiv fast float %p1, %p2
  %2 = fsub fast float -0.000000e+00, %1
  %3 = fcmp fast oge float %1, %2
  %FAbs = call float @dx.op.unary.f32(i32 6, float %1)
  %Frc = call float @dx.op.unary.f32(i32 22, float %FAbs)
  %4 = fsub fast float -0.000000e+00, %Frc
  %5 = select i1 %3, float %Frc, float %4
  %6 = fmul fast float %5, %p2

SPIR-V

OpFRem:

Description:

The floating-point remainder whose sign matches the sign
of Operand 1.

Result Type must be a scalar or vector of floating-point
type.

The types of Operand 1 and Operand 2 both must be the same as
Result Type.

Results are computed per component. The resulting value is undefined if
Operand 2 is 0. Otherwise, the result is the remainder
r of Operand 1 divided by Operand 2 where if r ≠ 0, the sign of
r is the same as the sign of Operand 1.

Word Count	Opcode	Results	Operands
5	140	<id> Result Type	Result <id>	<id> Operand 1	<id> Operand 2

Test Case(s)

Example 1

//dxc fmod_test.hlsl -T lib_6_8 -enable-16bit-types -O0

export float4 fn(float4 p1, float4 p2) {
    return fmod(p1, p2);
}

HLSL:

Returns the floating-point remainder of x/y.

ret fmod(x, y)

Parameters

Item	Description
x	[in] The floating-point dividend.
y	[in] The floating-point divisor.

Return Value

The floating-point remainder of the x parameter divided by the y parameter.

Remarks

The floating-point remainder is calculated such that x = i * y + f, where i is an integer, f has the same sign as x, and the absolute value of f is less than the absolute value of y.

Type Description

Name	Template Type	Component Type	Size
x	scalar, vector, or matrix	float	any
y	same as input x	float	same dimension(s) as input x
ret	same as input x	float	same dimension(s) as input x

Minimum Shader Model

This function is supported in the following shader models.

Shader Model	Supported
Shader Model 2 (DirectX HLSL) and higher shader models	yes
Shader Model 1 (DirectX HLSL)	vs_1_1

Requirements

Requirement	Value
Header	Corecrt_math.h

This change implements the frontend for llvm#99118 Builtins.td - add the fmod builtin CGBuiltin.cpp - lower the builtin to llvm FRem instruction hlsl_intrinsics.h - add the fmod api SemaHLSL.cpp - add type checks for builtin clang/docs/LanguageExtensions.rst - add the builtin in *Elementwise Builtins*

This change implements the frontend for llvm#99118 Builtins.td - add the fmod builtin CGBuiltin.cpp - lower the builtin to llvm FRem instruction hlsl_intrinsics.h - add the fmod api SemaHLSL.cpp - add type checks for builtin clang/docs/LanguageExtensions.rst - add the builtin in *Elementwise Builtins* clang/docs/ReleaseNotes.rst - announce the builtin

This change add the elementwise fmod builtin to support HLSL function 'fmod' in clang for llvm#99118 Builtins.td - add the fmod builtin CGBuiltin.cpp - lower the builtin to llvm FRem instruction hlsl_intrinsics.h - add the fmod api SemaChecking.cpp - add type checks for builtin SemaHLSL.cpp - add HLSL type checks for builtin clang/docs/LanguageExtensions.rst - add the builtin in *Elementwise Builtins* clang/docs/ReleaseNotes.rst - announce the builtin

This change add the elementwise fmod builtin to support HLSL function 'fmod' in clang for #99118 Builtins.td - add the fmod builtin CGBuiltin.cpp - lower the builtin to llvm FRem instruction hlsl_intrinsics.h - add the fmod api SemaChecking.cpp - add type checks for builtin SemaHLSL.cpp - add HLSL type checks for builtin clang/docs/LanguageExtensions.rst - add the builtin in *Elementwise Builtins* clang/docs/ReleaseNotes.rst - announce the builtin

damyanp · 2024-10-21T21:49:09Z

We want to decide llvm/wg-hlsl#86 before we continue with this.

kmpeng · 2025-02-06T18:49:38Z

I'll be taking on this task.

farzonl · 2025-02-27T05:16:14Z

@kmpeng @V-FEXrt had some concerns about vec1s. I noticed we aren't restricing vector sizes and we probably should.
My idea is template <int N, typename = std::enable_if_t<(N > 1 && N <=4)>>. The only thing I'm afraid of is what happens when we want to suport long vectors and we want to limit it to shader model 6.9.

My idea for thst is the following:

template <int N, typename = std::enable_if_t<(N > 1 && N <=4)>>
const inline vector<float, N> fmod(vector<float, N> X, vector<float, N> Y) {
  return __detail::fmod_vec_impl(X, Y);
}
_HLSL_AVAILABILITY(shadermodel, 6.9)
template <int N, typename = std::enable_if_t<(N >4)>>
const inline vector<float, N> fmod(vector<float, N> X, vector<float, N> Y) {
     return __detail::fmod_vec_impl(X, Y);
}

I filed a bug to fix up the other intrinsics here: #129003

V-FEXrt · 2025-02-27T16:45:03Z

@farzonl yeah we were talking about that yesterday. The expectation is that a vec1 would get cast/scalarized right?

farzonl · 2025-02-27T17:00:28Z

@V-FEXrt not via this ticket. We haven’t supported vec1s for any of the non templatized intrinsics. We shouldn’t be adding it as an overload for fmod or any of the ones mentioned in the above mentioned bug. If they would get cast/scalarized/whatever that would be via overload resolution rules. We are probably screwing up those resolutions by having intrinsics that can take in vec1s

V-FEXrt · 2025-02-27T18:07:32Z

Got it, makes sense

farzonl added backend:DirectX backend:SPIR-V bot:HLSL HLSL HLSL Language Support metabug Issue to collect references to a group of similar or related issues. labels Jul 16, 2024

github-project-automation bot added this to HLSL Support Jul 16, 2024

farzonl mentioned this issue Jul 16, 2024

Implement the entire HLSL API set. #99235

Open

farzonl mentioned this issue Aug 7, 2024

Intrinsics used by DML shaders are implemented llvm/wg-hlsl#30

Closed

52 tasks

damyanp assigned lizhengxing Aug 29, 2024

lizhengxing mentioned this issue Sep 16, 2024

[HLSL] Implementation of the elementwise fmod builtin #108849

Merged

pow2clk moved this to Active in HLSL Support Sep 23, 2024

farzonl moved this from Active to Planning in HLSL Support Feb 4, 2025

farzonl unassigned lizhengxing Feb 4, 2025

damyanp moved this from Planning to Ready in HLSL Support Feb 4, 2025

damyanp moved this from Ready to Active in HLSL Support Feb 6, 2025

damyanp assigned kmpeng Feb 6, 2025

kmpeng mentioned this issue Mar 7, 2025

Implement the fmod intrinsic #130320

Merged

farzonl closed this as completed in #130320 Mar 12, 2025

farzonl closed this as completed in 184f944 Mar 12, 2025

github-project-automation bot moved this from Active to Closed in HLSL Support Mar 12, 2025

EugeneZelenko added the clang:headers Headers provided by Clang, e.g. for intrinsics label Mar 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement the `fmod` HLSL Function #99118

Implement the `fmod` HLSL Function #99118

farzonl commented Jul 16, 2024 •

edited

Loading

lizhengxing commented Aug 29, 2024

damyanp commented Oct 21, 2024

kmpeng commented Feb 6, 2025

farzonl commented Feb 27, 2025 •

edited

Loading

V-FEXrt commented Feb 27, 2025

farzonl commented Feb 27, 2025 •

edited

Loading

V-FEXrt commented Feb 27, 2025

Implement the fmod HLSL Function #99118

Implement the fmod HLSL Function #99118

Comments

farzonl commented Jul 16, 2024 • edited Loading

DirectX

SPIR-V

OpFRem:

Description:

Test Case(s)

Example 1

HLSL:

Parameters

Return Value

Remarks

Type Description

Minimum Shader Model

Requirements

See also

lizhengxing commented Aug 29, 2024

damyanp commented Oct 21, 2024

kmpeng commented Feb 6, 2025

farzonl commented Feb 27, 2025 • edited Loading

V-FEXrt commented Feb 27, 2025

farzonl commented Feb 27, 2025 • edited Loading

V-FEXrt commented Feb 27, 2025

Implement the `fmod` HLSL Function #99118

Implement the `fmod` HLSL Function #99118

farzonl commented Jul 16, 2024 •

edited

Loading

farzonl commented Feb 27, 2025 •

edited

Loading

farzonl commented Feb 27, 2025 •

edited

Loading