From 5d7364ecd96a1d0351b6d91f66615b3b2b129782 Mon Sep 17 00:00:00 2001 From: "chen, suyue" Date: Wed, 16 Oct 2024 15:53:20 +0800 Subject: [PATCH 1/3] add PINECONE_KEY_LANGCHAIN_TEST for CI test (#959) Signed-off-by: chensuyue Signed-off-by: lucasmelogithub --- .github/workflows/_run-docker-compose.yml | 1 + 1 file changed, 1 insertion(+) diff --git a/.github/workflows/_run-docker-compose.yml b/.github/workflows/_run-docker-compose.yml index ee92e611f..37142b7a1 100644 --- a/.github/workflows/_run-docker-compose.yml +++ b/.github/workflows/_run-docker-compose.yml @@ -118,6 +118,7 @@ jobs: GOOGLE_CSE_ID: ${{ secrets.GOOGLE_CSE_ID }} GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }} PINECONE_KEY: ${{ secrets.PINECONE_KEY }} + PINECONE_KEY_LANGCHAIN_TEST: ${{ secrets.PINECONE_KEY_LANGCHAIN_TEST }} IMAGE_REPO: ${{ inputs.registry }} IMAGE_TAG: ${{ inputs.tag }} example: ${{ inputs.example }} From 6e9fff1e763c96e3f049b2af83314a4e9471eb2b Mon Sep 17 00:00:00 2001 From: lucasmelogithub Date: Wed, 16 Oct 2024 11:59:23 -0500 Subject: [PATCH 2/3] ChatQnA - Add Terraform and Ansible Modules information Signed-off-by: lucasmelogithub --- ChatQnA/README.md | 22 +++++++++++++++++++++- 1 file changed, 21 insertions(+), 1 deletion(-) diff --git a/ChatQnA/README.md b/ChatQnA/README.md index b9167fa43..994d38f11 100644 --- a/ChatQnA/README.md +++ b/ChatQnA/README.md @@ -4,7 +4,27 @@ Chatbots are the most widely adopted use case for leveraging the powerful chat a RAG bridges the knowledge gap by dynamically fetching relevant information from external sources, ensuring that responses generated remain factual and current. The core of this architecture are vector databases, which are instrumental in enabling efficient and semantic retrieval of information. These databases store data as vectors, allowing RAG to swiftly access the most pertinent documents or data points based on semantic similarity. -## Deploy ChatQnA Service +## šŸ¤– Automated Terraform Deployment using IntelĀ® Optimized Cloud Modules for **Terraform** + +| Cloud Provider | Intel Architecture | Intel Optimized Cloud Module for Terraform | Comments | +|----------------|-------------------------|-------------------------------------------|----------| +| AWS | 4th Gen Intel Xeon with Intel AMX | [AWS Module](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-xeon-opea-chatqna) | Uses Intel/neural-chat-7b-v3-3 by default | +| AWS Falcon2-11B| 4th Gen Intel Xeon with Intel AMX | [AWS Module with Falcon11B](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-xeon-opea-chatqna-falcon11B) | Uses TII Falcon2-11B LLM Model | +| GCP | 5th Gen Intel Xeon with Intel AMX | [GCP Module](https://github.com/intel/terraform-intel-gcp-vm/tree/main/examples/gen-ai-xeon-opea-chatqna) | Also supports Confidential AI by using IntelĀ® TDX with 4th Gen Xeon | +| Azure | 5th Gen Intel Xeon with Intel AMX | Work-in-progress | Work-in-progress | +| Intel Tiber AI Cloud | 5th Gen Intel Xeon with Intel AMX | Work-in-progress | Work-in-progress | + + +## Automated Deployment to Ubuntu based system(if not using Terraform) using IntelĀ® Optimized Cloud Modules for **Ansible** + +To deploy to existing Xeon Ubuntu based system, use our Intel Optimized Cloud Modules for Ansible. This is the same Ansible playbook used by Terraform. +Use this if you are not using Terraform and have provisioned your system with another tool or manually including bare metal. +| Operating System | Intel Optimized Cloud Module for Ansible | +|------------------|------------------------------------------| +| Ubuntu 20.04 | [ChatQnA Ansible Module](https://github.com/intel/optimized-cloud-recipes/tree/main/recipes/ai-opea-chatqna-xeon) | +| Ubuntu 22.04 | Work-in-progress | + +## Manually Deploy ChatQnA Service The ChatQnA service can be effortlessly deployed on Intel Gaudi2, Intel Xeon Scalable Processors and Nvidia GPU. From d83e18e0a2d0d990c47d7915cc250cab55f8ba68 Mon Sep 17 00:00:00 2001 From: "pre-commit-ci[bot]" <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed, 16 Oct 2024 17:48:07 +0000 Subject: [PATCH 3/3] [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --- ChatQnA/README.md | 21 ++++++++++----------- 1 file changed, 10 insertions(+), 11 deletions(-) diff --git a/ChatQnA/README.md b/ChatQnA/README.md index 994d38f11..e613f9b3d 100644 --- a/ChatQnA/README.md +++ b/ChatQnA/README.md @@ -4,16 +4,15 @@ Chatbots are the most widely adopted use case for leveraging the powerful chat a RAG bridges the knowledge gap by dynamically fetching relevant information from external sources, ensuring that responses generated remain factual and current. The core of this architecture are vector databases, which are instrumental in enabling efficient and semantic retrieval of information. These databases store data as vectors, allowing RAG to swiftly access the most pertinent documents or data points based on semantic similarity. -## šŸ¤– Automated Terraform Deployment using IntelĀ® Optimized Cloud Modules for **Terraform** - -| Cloud Provider | Intel Architecture | Intel Optimized Cloud Module for Terraform | Comments | -|----------------|-------------------------|-------------------------------------------|----------| -| AWS | 4th Gen Intel Xeon with Intel AMX | [AWS Module](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-xeon-opea-chatqna) | Uses Intel/neural-chat-7b-v3-3 by default | -| AWS Falcon2-11B| 4th Gen Intel Xeon with Intel AMX | [AWS Module with Falcon11B](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-xeon-opea-chatqna-falcon11B) | Uses TII Falcon2-11B LLM Model | -| GCP | 5th Gen Intel Xeon with Intel AMX | [GCP Module](https://github.com/intel/terraform-intel-gcp-vm/tree/main/examples/gen-ai-xeon-opea-chatqna) | Also supports Confidential AI by using IntelĀ® TDX with 4th Gen Xeon | -| Azure | 5th Gen Intel Xeon with Intel AMX | Work-in-progress | Work-in-progress | -| Intel Tiber AI Cloud | 5th Gen Intel Xeon with Intel AMX | Work-in-progress | Work-in-progress | +## šŸ¤– Automated Terraform Deployment using IntelĀ® Optimized Cloud Modules for **Terraform** +| Cloud Provider | Intel Architecture | Intel Optimized Cloud Module for Terraform | Comments | +| -------------------- | --------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------- | +| AWS | 4th Gen Intel Xeon with Intel AMX | [AWS Module](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-xeon-opea-chatqna) | Uses Intel/neural-chat-7b-v3-3 by default | +| AWS Falcon2-11B | 4th Gen Intel Xeon with Intel AMX | [AWS Module with Falcon11B](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-xeon-opea-chatqna-falcon11B) | Uses TII Falcon2-11B LLM Model | +| GCP | 5th Gen Intel Xeon with Intel AMX | [GCP Module](https://github.com/intel/terraform-intel-gcp-vm/tree/main/examples/gen-ai-xeon-opea-chatqna) | Also supports Confidential AI by using IntelĀ® TDX with 4th Gen Xeon | +| Azure | 5th Gen Intel Xeon with Intel AMX | Work-in-progress | Work-in-progress | +| Intel Tiber AI Cloud | 5th Gen Intel Xeon with Intel AMX | Work-in-progress | Work-in-progress | ## Automated Deployment to Ubuntu based system(if not using Terraform) using IntelĀ® Optimized Cloud Modules for **Ansible** @@ -21,8 +20,8 @@ To deploy to existing Xeon Ubuntu based system, use our Intel Optimized Cloud Mo Use this if you are not using Terraform and have provisioned your system with another tool or manually including bare metal. | Operating System | Intel Optimized Cloud Module for Ansible | |------------------|------------------------------------------| -| Ubuntu 20.04 | [ChatQnA Ansible Module](https://github.com/intel/optimized-cloud-recipes/tree/main/recipes/ai-opea-chatqna-xeon) | -| Ubuntu 22.04 | Work-in-progress | +| Ubuntu 20.04 | [ChatQnA Ansible Module](https://github.com/intel/optimized-cloud-recipes/tree/main/recipes/ai-opea-chatqna-xeon) | +| Ubuntu 22.04 | Work-in-progress | ## Manually Deploy ChatQnA Service