Using VLLM with Langchain for RAG purposes #5573

sadrafh · 2024-06-15T23:29:04Z

sadrafh
Jun 15, 2024

Hello,

I am using VLLM to use Llama models for RAG purposes. However, I am constantly facing a runnable error. This is my VLLM model initialization:

llm_vllm = LLM(
model="Llama-2-7b-chat-hf",
device="cuda"
)
When I try to create a chain with:

chain = (
{"context": retriever, "question": RunnablePassthrough()} | prompt | llm_vllm | StrOutputParser()
)
response = chain.invoke(user_question)

I get the following error: TypeError: Expected a Runnable, callable or dict. Instead got an unsupported type: <class 'vllm.entrypoints.llm.LLM'>

Similarly, if I use:

from langchain.chains.question_answering import load_qa_chain
chain = load_qa_chain(llm_vllm, chain_type="stuff")
I get an error: llm instance of Runnable expected.

Is there any solution for that?

mgoin · 2024-06-19T14:11:47Z

mgoin
Jun 19, 2024
Collaborator Sponsor

Duplicate with issue here #5572

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using VLLM with Langchain for RAG purposes #5573

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Using VLLM with Langchain for RAG purposes #5573

sadrafh Jun 15, 2024

Replies: 1 comment

mgoin Jun 19, 2024 Collaborator Sponsor

sadrafh
Jun 15, 2024

mgoin
Jun 19, 2024
Collaborator Sponsor