Dynamic model selection with Ollama when asking #838

Maniga · 2024-10-17T08:47:41Z

Maniga
Oct 17, 2024

Hello everyone,

I'm currently working on a project where I'd like to use multiple models in Ollama for Kernel Memory. Specifically, I want to be able to select which Ollama model is used when querying the Kernel Memory with ask. I added the kernel manager as a .net core web api into my dotnet aspire solution.

Is there a standard or recommended way to implement this type of functionality? Would it be necessary to write a custom MemoryService to enable model selection, or is there an existing feature or pattern that supports this?

Any guidance or examples from those who have faced a similar requirement would be greatly appreciated!

Thank you in advance!

dluc · 2024-10-17T18:05:38Z

dluc
Oct 17, 2024
Maintainer

KM supports "Request Context" parameters, which can be used to change configurations during a request, without affecting the deployment or other concurrent requests. To implement what you suggest, you could add a new Request Context param name, and support it in the Ollama generator.

Here's some pointers:

To access params stored in the request context, you can inject IContextProvider into OllamaTextGenerator.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic model selection with Ollama when asking #838

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Dynamic model selection with Ollama when asking #838

Maniga Oct 17, 2024

Replies: 1 comment

dluc Oct 17, 2024 Maintainer

Maniga
Oct 17, 2024

dluc
Oct 17, 2024
Maintainer