nx-cugraph: faster `shortest_path` #4739

eriknw · 2024-10-24T22:32:45Z

For larger graphs, nearly all the time was spent creating the dict of lists of paths. I couldn't find a better way to create these, nor could I find an approach to compute more in PLC or cupy. So, the solution in this PR is to avoid computing until needed!

This now returns a Mapping instead of a dict. Will anybody care or notice that the return type isn't strictly a dict?

~~This is currently only for unweighted bfs. We should also do weighted sssp paths.~~ Update to do sssp too.

~~Also, this currently recurses. If we like the approach in this PR, we should update computing the paths on demand to not recurse.~~ Updated to not recurse to avoid any chance of hitting recursion limit, which would have been unlikely, but possible.

If all paths are used--and, hence, computed in PathMapping--then overall performance remains comparable. Hence, this PR speeds up performance (sometimes by a lot!) or keeps it the same, and the cost is a non-dict return types, which I think is okay for backends to do, because duck-typing is a thing in Python.

For larger graphs, nearly all the time was spent creating the dict of lists of paths. I couldn't find a better way to create these, nor could I find an approach to compute more in PLC or cupy. So, the solution in this PR is to avoid computing until needed! This now returns a `Mapping` instead of a `dict`. Will anybody care or notice that the return type isn't strictly a dict? This is currently only for unweighted bfs. We should also do weighted sssp paths. Also, this currently recurses. If we like the approach in this PR, we should update computing the paths on demand to not recurse.

ChuckHastings · 2024-10-25T17:36:45Z

If you can define the behavior you want, we could explore adding some PLC functionality to improve this.

eriknw · 2024-10-25T18:50:36Z

If you can define the behavior you want, we could explore adding some PLC functionality to improve this.

Thanks @ChuckHastings, this is what I tried to do initially, but I couldn't determine anything that would actually help, because the slow part is creating Python lists for each node. Maybe we could get a factor of 2 or so if we moved the original code to Cython (but I'm skeptical), but this just feels awkward.

ChuckHastings · 2024-10-25T19:38:46Z

Did you look at

cugraph/cpp/include/cugraph_c/traversal_algorithms.h

Line 158 in f917ae4

* @brief Extract BFS or SSSP paths from a cugraph_paths_result_t

?

This function should return a dense matrix where each row represents the path from a destination matrix back to the source from the BFS. So result[0][0] would be the first destination, result[0][1] would be the predecessor, etc. back to the source. Any vertices beyond the source on a given path would be set to invalid_vertex_id.

Would that make things faster? You wouldn't have to do the predecessor lookups, you would already have contiguous values in GPU memory.

rlratzel · 2024-10-30T21:17:26Z

Thanks, @eriknw. Please also include the before and after output from the relevant pytest benchmarks.

the cost is a non-dict return types, which I think is okay for backends to do, because duck-typing is a thing in Python.

We should discuss this at the next NetworkX dispatching meeting.

eriknw · 2024-11-06T02:03:43Z

Did you look at

cugraph/cpp/include/cugraph_c/traversal_algorithms.h

Line 158 in f917ae4

* @brief Extract BFS or SSSP paths from a cugraph_paths_result_t

Thanks for sharing that, I didn't know about it! I did experiment as if such a thing (or CSR-like path results) could be done in C, and I found it didn't help performance.

eriknw · 2024-11-06T02:08:44Z

Please also include the before and after output from the relevant pytest benchmarks.

Benchmarks before this PR:

Benchmarks with this PR:

eriknw · 2024-11-07T02:38:49Z

We should discuss this at the next NetworkX dispatching meeting.

I brought this up during the nx dispatching meeting today. All present thought this approach is perfectly fine.

eriknw added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change python nx-cugraph labels Oct 24, 2024

eriknw requested a review from a team as a code owner October 24, 2024 22:32

eriknw added 2 commits October 25, 2024 03:44

Don't recurse; also do this for sssp; bump lint.yaml

51280c0

Comment to indicate difference, since some code is repeated

b590224

eriknw added 2 commits November 5, 2024 08:42

Merge branch 'branch-24.12' into faster_shortest_paths

824e8c2

Fix benchmark; bump lint

c500985

github-actions bot added the benchmarks label Nov 6, 2024

Merge branch 'branch-24.12' into faster_shortest_paths

078947b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nx-cugraph: faster `shortest_path` #4739

nx-cugraph: faster `shortest_path` #4739

eriknw commented Oct 24, 2024 •

edited

Loading

ChuckHastings commented Oct 25, 2024

eriknw commented Oct 25, 2024

ChuckHastings commented Oct 25, 2024

rlratzel commented Oct 30, 2024

eriknw commented Nov 6, 2024

eriknw commented Nov 6, 2024

eriknw commented Nov 7, 2024

nx-cugraph: faster shortest_path #4739

Are you sure you want to change the base?

nx-cugraph: faster shortest_path #4739

Conversation

eriknw commented Oct 24, 2024 • edited Loading

ChuckHastings commented Oct 25, 2024

eriknw commented Oct 25, 2024

ChuckHastings commented Oct 25, 2024

rlratzel commented Oct 30, 2024

eriknw commented Nov 6, 2024

eriknw commented Nov 6, 2024

eriknw commented Nov 7, 2024

nx-cugraph: faster `shortest_path` #4739

nx-cugraph: faster `shortest_path` #4739

eriknw commented Oct 24, 2024 •

edited

Loading