Add Dockerfile for vllm Arc support #641

gavinlichn · 2024-09-09T10:15:47Z

Description

Support vllm inference on Intel ARC GPU

Issues

#629

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

n/a

Tests

n/a

Support vllm inference on Intel ARC GPU Signed-off-by: Li Gang <gang.g.li@intel.com> Co-authored-by: Chen, Hu1 <hu1.chen@intel.com>

hshen14 · 2024-09-10T03:23:38Z

comps/llms/text-generation/vllm/vllm_arc.sh

+source /opt/intel/oneapi/setvars.sh
+source /opt/intel/1ccl-wks/setvars.sh
+
+python -m ipex_llm.vllm.xpu.entrypoints.openai.api_server \


IPEX-LLM is not yet an approved Intel product, so suggest holding on the relevant PRs for ARC support, unless the position is getting more clear.

If ipex-llm not ready enough, do we have other alternate for ARC support currently?
We can try to enable ARC with other alternates if we have.

chensuyue · 2024-09-11T13:58:54Z

comps/llms/text-generation/vllm/docker/Dockerfile.arc

@@ -0,0 +1,10 @@
+# Copyright (C) 2024 Intel Corporation


let's remove the docker folder and rename Dockerfile.arc-> Dockerfile.intel_xpu

Add vllm Arc Dockerfile support

129c673

Support vllm inference on Intel ARC GPU Signed-off-by: Li Gang <gang.g.li@intel.com> Co-authored-by: Chen, Hu1 <hu1.chen@intel.com>

gavinlichn requested a review from lvliang-intel as a code owner September 9, 2024 10:15

gavinlichn mentioned this pull request Sep 9, 2024

Enable ChatQnA with vllm Arc support opea-project/GenAIExamples#771

Open

4 tasks

lvliang-intel approved these changes Sep 9, 2024

View reviewed changes

Merge branch 'main' into arc_vllm

4bf5c1c

hshen14 reviewed Sep 10, 2024

View reviewed changes

chensuyue reviewed Sep 11, 2024

View reviewed changes

lkk12014402 pushed a commit that referenced this pull request Sep 19, 2024

fix tgi xeon tag (#641)

6674832

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Dockerfile for vllm Arc support #641

Add Dockerfile for vllm Arc support #641

gavinlichn commented Sep 9, 2024

hshen14 Sep 10, 2024

gavinlichn Sep 11, 2024

chensuyue Sep 11, 2024

Add Dockerfile for vllm Arc support #641

Are you sure you want to change the base?

Add Dockerfile for vllm Arc support #641

Conversation

gavinlichn commented Sep 9, 2024

Description

Issues

Type of change

Dependencies

Tests

hshen14 Sep 10, 2024

Choose a reason for hiding this comment

gavinlichn Sep 11, 2024

Choose a reason for hiding this comment

chensuyue Sep 11, 2024

Choose a reason for hiding this comment