Multi GPU #202

martindevans · 2023-10-20T12:47:58Z

Added support for multi GPU in the model config.

This is a litte rough at the moment. The IModelParams.TensorSplits array length must be <= NativeApi.llama_max_devices(). If it's too small it's padded up to size (with zeroes). If it's too large the ToLlamaModelParams method throws. I'd rather not require the user to query the native API directly, and I'd rather the error were caught at config time.

This is exposed through a TensorSplitsCollection on the IModelParams interface. The collection can be indexed into (like an array) and automatically has the correct size. There is no way to change the size, preventing incorrect usage.

Tested with multiple GPUs

…correctly setting the `tensor_splits` collection

…rter`

martindevans · 2023-10-20T13:58:00Z

Note that this PR breaks Newtonsoft.Json serialization of IModelParams. System.Text.Json still works fine.

Last time we discussed this I don't think anyone was particularly interested in supporting JSON serialization at all and I only added it because it was very easy to do.

martindevans · 2023-10-22T23:36:11Z

Some discussion about this PR over in #189.

martindevans · 2023-10-26T13:40:35Z

Tested by swiftress on Discord and confirmed working (https://discord.com/channels/1106946823282761851/1106947264938790972/1166937319664795689)

Added multi GPU support

15db194

martindevans mentioned this pull request Oct 20, 2023

Running LLamaSharp on gpu #189

Closed

martindevans added 5 commits October 20, 2023 14:10

Added a safe TensorSplitsCollection to the params which prevents in…

6a4cd50

…correctly setting the `tensor_splits` collection

Improved doc comment on tensor_split

04acbf8

Fixed default value

281e58f

Added System.Text.Json serialization for `TensorSplitsCollectionConve…

b4e7f64

…rter`

spelling

768747c

Fixed serialization

f621ec6

martindevans merged commit 321d0b5 into SciSharp:master Oct 26, 2023
4 checks passed

martindevans deleted the multi_gpu branch October 26, 2023 13:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi GPU #202

Multi GPU #202

martindevans commented Oct 20, 2023 •

edited

Loading

martindevans commented Oct 20, 2023

martindevans commented Oct 22, 2023

martindevans commented Oct 26, 2023

Multi GPU #202

Multi GPU #202

Conversation

martindevans commented Oct 20, 2023 • edited Loading

martindevans commented Oct 20, 2023

martindevans commented Oct 22, 2023

martindevans commented Oct 26, 2023

martindevans commented Oct 20, 2023 •

edited

Loading