[documentation] Add an optimized linear operator for sparse matrix-vector product on Nvidia GPUs #783

amontoison · 2023-08-26T05:59:23Z

For sparse matrix-vector products on Nvidia GPUs, we need to allocate some buffers.
The current implementation allocates a new buffer at each product A * v.
We could reuse the same buffer if we implement a linear operator based on the low-level CUDA wrappers.

The text was updated successfully, but these errors were encountered:

amontoison · 2023-12-16T03:19:50Z

I did it in KrylovPreconditioners.jl.
The user just needs to do:

using KrylovPreconditioners
opA = KrylovOperator(A)

A can be a sparse COO, CSR or CSC matrices.
It also works for AMD GPUs.
I should add a section about it in Performance Tips or some remarks in the section GPU.

amontoison self-assigned this Aug 26, 2023

amontoison mentioned this issue Dec 16, 2023

[documentation] KrylovOperator -- optimized sparse products on NVIDIA and AMD GPUs #848

Merged

amontoison closed this as completed in #848 Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[documentation] Add an optimized linear operator for sparse matrix-vector product on Nvidia GPUs #783

[documentation] Add an optimized linear operator for sparse matrix-vector product on Nvidia GPUs #783

amontoison commented Aug 26, 2023

amontoison commented Dec 16, 2023 •

edited

Loading

[documentation] Add an optimized linear operator for sparse matrix-vector product on Nvidia GPUs #783

[documentation] Add an optimized linear operator for sparse matrix-vector product on Nvidia GPUs #783

Comments

amontoison commented Aug 26, 2023

amontoison commented Dec 16, 2023 • edited Loading

amontoison commented Dec 16, 2023 •

edited

Loading