Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add containers/tei/{cpu,gpu}/1.5.0 #61

Draft
wants to merge 4 commits into
base: main
Choose a base branch
from
Draft

Add containers/tei/{cpu,gpu}/1.5.0 #61

wants to merge 4 commits into from

Conversation

alvarobartt
Copy link
Member

@alvarobartt alvarobartt commented Jul 26, 2024

Description

This PR adds a new container for TEI v1.5.0 recently released (see https://github.com/huggingface/text-embeddings-inference/releases/tag/v1.5.0).

The main features within TEI v1.5.0 are the following:

To inspect the changes required to make the TEI container work in GCP, see the diff at:

@alvarobartt
Copy link
Member Author

Note

This PR is on hold, since the CPU version requires the model to have ONNX compatible weights, and there are a bunch of models that only contain the safetensors weights.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant