Sequential and parallel (GPU based) implementation of a Sparse Matrix Vector Multiplication using (Compressed Sparse Row (CSR) matrix format.
make sequential
./bin/SpMV-SEQ <iterations> <print mode [1 default, 2 view data info]> <file path>
make parallel
./bin/SpMV-CUDA <threads num> <iterations> <print mode [1 default, 2 view data info]> <file path>
-
⚠️ IMPORTANT more tests on my GPU (working on my Mac M1 rn🥲) - more performances evaluation
- CUDA workflow