-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ACL] Convolution performance regression #2588
Comments
Hi @alvoron, How did you build oneDNN 3.6.2 with ACL 24.09 because the minimum version required for building oneDNN 3.6.2 is 24.11.1 as per here: Line 34 in 2eb3dd1
benchdnn ?
If I build oneDNN 3.6.2 with ACL 24.11.1 and oneDNN 3.5.3 with ACL 24.09 on Apple M3 Pro I do not get performance regression as you do. There is small regression with 3.6.2 but given that runtime is less then 1ms I don't think it is significant. This is result from running the first reproducible with oneDNN 3.5.3:
And this is when running with oneDNN 3.6.2:
|
the first
@vpirogov why i got this:o(╥﹏╥)o aarch onednn3.4 and ACL 24.11
|
@Serenagirl, oneDNN v3.4 is rather old, I'd suggest investigating issues with current version instead. If you want to understand why |
@Serenagirl The following
Could you please try this reproducer on |
sorry I test on aarch64 not Apple silicon, my test results with onednn 3.4 alone were:
, and the test results with acl23.11 were:
, when I set --stag=any --dtag=any I got some 2.9s:
, the test results were normal, although the acl was still used with gemm:ref, and I'll analyze it. |
The performance issue has been reproduced on Apple M2 Pro.
Several benchdnn reproducers:
oneDNN 3.6.2 with ACL 24.09 gives 0.135 / 0.1 / 0.099 ms. respectively on Apple M2 Pro.
oneDNN 3.6.2 with ACL 24.11 gives 0.4 / 0.22 / 0.196 ms. respectively on Apple M2 Pro.
The text was updated successfully, but these errors were encountered: