-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement always_inline as a CompilerJob property #373
Conversation
In principle yes, although in practice the runtime probably won't contain functions with non-inlined calls. |
Codecov ReportBase: 73.79% // Head: 63.67% // Decreases project coverage by
Additional details and impacted files@@ Coverage Diff @@
## master #373 +/- ##
===========================================
- Coverage 73.79% 63.67% -10.12%
===========================================
Files 24 24
Lines 2816 2676 -140
===========================================
- Hits 2078 1704 -374
- Misses 738 972 +234
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
I'm not a fan of adding common properties to each back-end's Target struct, which up until now only contains target-specific elements. Wouldn't it be better to make this a property of the |
aa18542
to
c53979b
Compare
Is this ready to merge? This has been working excellently for AMDGPU since the latest push. |
f3e03a4
to
fb006a5
Compare
GCN tests are failing because the expensive function always seems inlined. @jpsamaroo care to take a look? |
It's not actually inlined (I have |
Using |
Doesn't that indicate a deeper problem? |
@maleadt is it intended that |
Maybe, but the AMDGPU target is in frequent flux, so I wouldn't be surprised if it works fine on LLVM 15/16. |
Yes, I suggested above putting it in the compiler job instead, as it isn't really a property of the target, is it? Does this cause any issues maybe? |
Right, so I guess the fact that CUDA.jl is trying to pass it through the target just needs to be fixed. Makes sense! |
No need to duplicate tests for identical functionality.
b88edc1
to
2ede3c3
Compare
@maleadt remind me should we put
always_inline
intoruntime_slug
?I also noticed that
PTXCompilerTarget
implementshash
, butGCNCompilerTarget
doesn't.