Support Apple Accelerate and improve MKL integration #355

ChrisRackauckas · 2023-08-07T15:47:56Z

No description provided.

ai-maintainer

AI-Maintainer Review for PR - Support Apple Accelerate and improve MKL integration

Title and Description ⚠️

Title is clear but description is missing

The title of the pull request is clear and indicates the purpose of the changes. However, the description is missing. It would be beneficial to include a description that provides additional context and explains the rationale for the changes.

Scope of Changes 👍

Changes are narrowly focused

The changes in this pull request are narrowly focused on adding support for Apple Accelerate and improving the integration with MKL. The modifications are concentrated in a few specific files and do not appear to be trying to resolve multiple unrelated issues simultaneously.

Documentation ⚠️

Missing docstrings for new functions and methods

Several new functions and methods have been added without docstrings. It is recommended to add docstrings to these entities to provide clear and concise descriptions of their behavior, arguments, and return values. The following entities need docstrings:

AppleAccelerateLUFactorization
is_new_accelerate_available
aa_getrf!
default_alias_A
default_alias_b
LinearSolve.init_cacheval
SciMLBase.solve!

Testing ⚠️

No information about testing

The description does not provide any information about how the changes were tested. Including details about the testing methodology, such as specific test cases, frameworks, or environments used, would be beneficial to ensure that the changes have been adequately validated.

Suggested Changes

Please add a detailed description to the pull request explaining the rationale behind the changes and any additional context that might be helpful.
Add docstrings to the new functions and methods to describe their behavior, arguments, and return values.
Include information about how the changes were tested. If specific test cases were used, please include them in the description.

Reviewed with AI Maintainer

src/appleaccelerate.jl

Co-authored-by: Elliot Saba <staticfloat@gmail.com>

src/appleaccelerate.jl

These should always be available everywhere.

ViralBShah · 2023-08-07T21:31:37Z

Would you ever pass ipiv to aa_getrf? If so, that probably comes in as an Int64 vector and probably should be downcast to an Int32 vector.

ViralBShah · 2023-08-07T21:41:40Z

I believe the smarter thing to do is to use NEWLAPACK symbols if available, falling back to the old one if not. Not sure if getrf is better in that or the same. In case the new LAPACK library has a faster LU, this is worth the effort. Some benchmarking is necessary first.

cc @vpuri3

ChrisRackauckas · 2023-08-07T22:20:50Z

Would you ever pass ipiv to aa_getrf? If so, that probably comes in as an Int64 vector and probably should be downcast to an Int32 vector.

We allocate all of the caches so it's safe from that. It's setup so that all caches compile up front and repeated calls then reuse the cache now, even for the info ref.

codecov · 2023-08-07T22:54:03Z

Codecov Report

Merging #355 (6d5aeb4) into main (464156c) will increase coverage by 47.93%.
The diff coverage is 17.74%.

@@             Coverage Diff             @@
##             main     #355       +/-   ##
===========================================
+ Coverage   25.75%   73.68%   +47.93%     
===========================================
  Files          18       19        +1     
  Lines        1254     1353       +99     
===========================================
+ Hits          323      997      +674     
+ Misses        931      356      -575

Files Changed	Coverage Δ
src/LinearSolve.jl	`90.90% <ø> (+15.90%)`	⬆️
src/appleaccelerate.jl	`7.27% <7.27%> (ø)`
ext/LinearSolveMKLExt.jl	`92.30% <100.00%> (+92.30%)`	⬆️

... and 15 files with indirect coverage changes

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

ChrisRackauckas · 2023-08-08T05:24:31Z

I believe the smarter thing to do is to use NEWLAPACK symbols if available, falling back to the old one if not. Not sure if getrf is better in that or the same. In case the new LAPACK library has a faster LU, this is worth the effort. Some benchmarking is necessary first.

If I did it correctly, it doesn't seem that big of a deal in #358

ViralBShah · 2023-08-08T12:32:27Z

That seems correctly done. Unless they explicitly multi-threaded the getrf call, all the performance actually just comes from the matmul. openblas does have the natively multi-threaded getrf but I think it doesn't have access to the fast matmul kernels on M-series that Accelerate does: https://github.com/xianyi/OpenBLAS/blob/develop/lapack/getrf/getrf_parallel.c

I suppose the only benefit of the 64-bit version is that you don't have to convert the ipiv vector to 64-bit on return and save one small allocation.

ChrisRackauckas · 2023-08-08T13:03:12Z

I just cached the 32-bit version so the allocation is saved anyways. So yeah, seems like it's better to just support 32-bit there.

Support Apple Accelerate and improve MKL integration

6e9bdfa

ai-maintainer bot reviewed Aug 7, 2023

View reviewed changes

staticfloat reviewed Aug 7, 2023

View reviewed changes

src/appleaccelerate.jl Outdated Show resolved Hide resolved

src/appleaccelerate.jl Outdated Show resolved Hide resolved

ChrisRackauckas and others added 4 commits August 7, 2023 12:12

Update src/appleaccelerate.jl

2ea5488

Co-authored-by: Elliot Saba <staticfloat@gmail.com>

Update src/appleaccelerate.jl

42af75b

Co-authored-by: Elliot Saba <staticfloat@gmail.com>

Fix the dispatch

946a45e

fix up tests

9ef7134

ViralBShah reviewed Aug 7, 2023

View reviewed changes

src/appleaccelerate.jl Show resolved Hide resolved

ViralBShah mentioned this pull request Aug 7, 2023

use AppleAccelerate as default on Mac #354

Closed

Use LP64 BLAS and LAPACK from Accelerate

94be106

These should always be available everywhere.

handle the getrs as well and support more lapack installations

f78754f

ChrisRackauckas added 3 commits August 7, 2023 18:22

skip apple test on non-apple platforms

93382d1

better skip apple test?

d8d0849

missing paren

83e1edb

dope it's not a subtype

6d5aeb4

ChrisRackauckas merged commit aabe2f2 into main Aug 8, 2023
14 of 17 checks passed

ChrisRackauckas deleted the accelerate branch August 8, 2023 00:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Apple Accelerate and improve MKL integration #355

Support Apple Accelerate and improve MKL integration #355

ChrisRackauckas commented Aug 7, 2023

ai-maintainer bot left a comment •

edited

Loading

ViralBShah commented Aug 7, 2023

ViralBShah commented Aug 7, 2023

ChrisRackauckas commented Aug 7, 2023

codecov bot commented Aug 7, 2023 •

edited

Loading

ChrisRackauckas commented Aug 8, 2023

ViralBShah commented Aug 8, 2023 •

edited

Loading

ChrisRackauckas commented Aug 8, 2023

Support Apple Accelerate and improve MKL integration #355

Support Apple Accelerate and improve MKL integration #355

Conversation

ChrisRackauckas commented Aug 7, 2023

ai-maintainer bot left a comment • edited Loading

Choose a reason for hiding this comment

AI-Maintainer Review for PR - Support Apple Accelerate and improve MKL integration

Title and Description ⚠️

Scope of Changes 👍

Documentation ⚠️

Testing ⚠️

Suggested Changes

ViralBShah commented Aug 7, 2023

ViralBShah commented Aug 7, 2023

ChrisRackauckas commented Aug 7, 2023

codecov bot commented Aug 7, 2023 • edited Loading

Codecov Report

ChrisRackauckas commented Aug 8, 2023

ViralBShah commented Aug 8, 2023 • edited Loading

ChrisRackauckas commented Aug 8, 2023

ai-maintainer bot left a comment •

edited

Loading

codecov bot commented Aug 7, 2023 •

edited

Loading

ViralBShah commented Aug 8, 2023 •

edited

Loading