Support for Logprobs Output in Triton Inference Server with vLLM Backend #7557

RafaelXokito · 2024-08-21T10:14:38Z

RafaelXokito
Aug 21, 2024

I recently deployed the Triton Inference Server with the vLLM backend. However, we have a use case where we need to obtain the logprobs (logarithmic probabilities) for the generated tokens along with the regular output.

Currently, it appears that Triton with the vLLM backend does not directly support returning logprobs as part of the response.

Is there any existing workaround or upcoming feature that would allow us to retrieve logprobs from the vLLM backend through Triton?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Logprobs Output in Triton Inference Server with vLLM Backend #7557

{{title}}

Replies: 0 comments

Select a reply

Support for Logprobs Output in Triton Inference Server with vLLM Backend #7557

RafaelXokito Aug 21, 2024

Replies: 0 comments

RafaelXokito
Aug 21, 2024