BatchedExecutor Save/Load #681

martindevans · 2024-04-20T19:34:31Z

Added the ability to save and load individual conversations in a batched executor.

Added BatchedExecutor.Load(filepath) method
Added Conversation.Save(filepath) method
Added new (currently internal) SaveState/LoadState methods in LLamaContext which can stash some extra binary data in the header. This can be used where other state needs to be saved alongside the raw KV cache.
New example showing saving/loading a conversation

…hed executor. - New example - Added `BatchedExecutor.Load(filepath)` method - Added `Conversation.Save(filepath)` method - Added new (currently internal) `SaveState`/`LoadState` methods in LLamaContext which can stash some extra binary data in the header

Lyrcaxis

Nice!

LLama/LLamaContext.cs

…stead of to file.

martindevans · 2024-04-21T23:37:08Z

I've just added the ability to save and load an entire Conversation to an in-memory State object, instead of to file.

This was inspired by some of the scheduler stuff @AsakusaRinne has been talking about. With this a scheduler could "swap out" a Conversation object and dispose it, freeing up space in the KV cache for other conversations to be evaluated, all without touching disk.

…y for the batched executor.

zsogitbe · 2024-04-23T11:12:32Z

I've just added the ability to save and load an entire Conversation to an in-memory State object, instead of to file.

This was inspired by some of the scheduler stuff @AsakusaRinne has been talking about. With this a scheduler could "swap out" a Conversation object and dispose it, freeing up space in the KV cache for other conversations to be evaluated, all without touching disk.

Multi-model execution is an important use case. Having such a scheduler would be very useful. Great work!

Lyrcaxis approved these changes Apr 20, 2024

View reviewed changes

LLama/LLamaContext.cs Outdated Show resolved Hide resolved

LLama/LLamaContext.cs Outdated Show resolved Hide resolved

martindevans mentioned this pull request Apr 21, 2024

[Proposal] Refactor the mid-level and high-level implementations of LLamaSharp #684

Open

Added ability to save/load a Conversation to an in-memory state, in…

00fa795

…stead of to file.

martindevans added 2 commits April 22, 2024 01:39

Moved the new save/load methods out to an extension class specificall…

617f721

…y for the batched executor.

Removed unnecessary spaces

2880501

martindevans merged commit ccc49eb into SciSharp:master Apr 23, 2024
3 checks passed

martindevans deleted the batched_executor_save_single_conversation branch April 23, 2024 14:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BatchedExecutor Save/Load #681

BatchedExecutor Save/Load #681

martindevans commented Apr 20, 2024

Lyrcaxis left a comment

martindevans commented Apr 21, 2024

zsogitbe commented Apr 23, 2024

BatchedExecutor Save/Load #681

BatchedExecutor Save/Load #681

Conversation

martindevans commented Apr 20, 2024

Lyrcaxis left a comment

Choose a reason for hiding this comment

martindevans commented Apr 21, 2024

zsogitbe commented Apr 23, 2024