LLamaEmbedder 2.0 #902

martindevans · 2024-08-15T10:49:36Z

Totally rewritten the LLamaEmbedder based on https://github.com/ggerganov/llama.cpp/tree/master/examples/embedding. New embedder properly handles pooling, either returning one embedding for the whole sequence or one per token. This rewrite does not support batching, it's still just one string at a time.

Added Encode methods to LLamaContext
Moved some native methods from NativeApi to SafeLLamaContextHandle and wrapped them properly
Added HasDecoder property to SafeLlamaModelHandle. This function doesn't exist in the current version of llama.cpp, will need to be hooked up in the next binary update
Added some normalization methods as extensions on span/array. This required adding a dependency on System.Numerics.Tensors

DRAFT until the HasDecoder function exists, after the next binary update.

martindevans · 2024-08-23T18:10:51Z

Rebased onto #905

…anov/llama.cpp/tree/master/examples/embedding. New embedder properly handles pooling, either returning one embedding for the whole sequence or one per token. - Added `Encode` methods to `LLamaContext` - Moved some native methods from `NativeApi` to `SafeLLamaContextHandle` and wrapped them properly - Added `HasDecoder` property to `SafeLlamaModelHandle`. This function doesn't exist in the current version of llama.cpp, will need to be hooked up in the next binary update - Added some normalization methods as extensions on span/array. This required adding a dependency on `System.Numerics.Tensors`

- Using `llama_set_embeddings` to toggle on embedding mode, so it no longer needs to be specified in the params

martindevans mentioned this pull request Aug 23, 2024

Improve LLamaEmbedder #889

Closed

martindevans force-pushed the llama_embedder_2 branch from f8445df to c06fb81 Compare August 23, 2024 18:10

martindevans marked this pull request as ready for review August 26, 2024 14:00

martindevans added 2 commits August 27, 2024 21:50

- Fixed LLamaEmbedder example

a3028de

- Using `llama_set_embeddings` to toggle on embedding mode, so it no longer needs to be specified in the params

martindevans force-pushed the llama_embedder_2 branch from c06fb81 to a3028de Compare August 27, 2024 20:51

martindevans merged commit 6025979 into SciSharp:master Aug 31, 2024
6 checks passed

martindevans deleted the llama_embedder_2 branch August 31, 2024 19:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLamaEmbedder 2.0 #902

LLamaEmbedder 2.0 #902

martindevans commented Aug 15, 2024

martindevans commented Aug 23, 2024

LLamaEmbedder 2.0 #902

LLamaEmbedder 2.0 #902

Conversation

martindevans commented Aug 15, 2024

martindevans commented Aug 23, 2024