add k8s docs for getting started and helm #179

devpramod · 2024-09-25T16:00:48Z

This PR contains the following docs:

Getting Started for k8s - Installation, basic introduction to k8s and has a section for helm. As more k8s deployment modes are added, corresponding sections will be created in this doc
Deploy using helm charts, a doc that follows the xeon.md template as much as possible to deploy ChatQnA on k8s

Signed-off-by: devpramod <pramod.pai@intel.com>

dbkinder

some suggested edits

Also, when you add new documents, they need to be linked into the table of contents structure. There's an index.rst file in this folder you can edit to add these two documents.

I'd suggest you add an edit to the index.rst doc in this deploy folder, and replace the existing Kubernetes section with this:

Kubernetes
**********

.. toctree::
   :maxdepth: 1

   k8s_getting_started
   TGI on Xeon with Helm Charts <k8s_helm>

* Xeon & Gaudi with GMC
* Xeon & Gaudi without GMC

examples/ChatQnA/deploy/k8s_getting_started.md

examples/ChatQnA/deploy/k8s_helm.md

examples/ChatQnA/deploy/k8s_getting_started.md

examples/ChatQnA/deploy/k8s_helm.md

Signed-off-by: devpramod <pramod.pai@intel.com> Signed-off-by: devpramod <pramod.pai@intel.com>

examples/ChatQnA/deploy/index.rst

examples/ChatQnA/deploy/k8s_helm.md

Signed-off-by: devpramod <pramod.pai@intel.com>

dbkinder

LGTM, thanks!

tylertitsworth

Some of the stuff I see in the docs, is just a tutorial on things that already have docs. Like TGI/TEI, Helm, and Kubernetes. It feels a lot like we're overexplaining a concept that can be answered by a link to the source docs of another tool and a command for how it's relevant to use with ChatQnA.

For reference, this is the most handholding I would do in the case of deploying TGI:

Configure Model Server

Before we deploy a model, we need to configure the model server with information like, what model to use and how many max tokens to use. We will be using the tgi-on-intel helm chart. This chart uses XPU to the serve model normally, but we are going to configure it to use gaudi2 instead.

First, look at the configuration files in the tgi directory and add/remove any configuration options relevant to your workflow:

cd tgi
# Create a new configmap for your model server to use
kubectl apply -f cm.yaml

Tip

Here is the reference to the Huggingface Launcher Environment Variables and the TGI-Gaudi Environment Variables.

Deploy Model Server

Now that we have configured the model server, we can deploy it to Kubernetes. Using the provided config.yaml file in the tgi directory, we can deploy the model server.

Modify any values like resources or replicas in the config.yaml file to suit your needs. Then, deploy the model server:

# Encode HF Token for secret.encodedToken
echo -n '<token>' | base64
# Install Chart
git clone https://github.com/intel/ai-containers
helm install model-server -f config.yaml ai-containers/workflows/charts/tgi
# Check the pod status
kubectl get pod
kubectl logs -f <pod-name>

Please use a tool like markdownlint to ensure consistent styling.

examples/ChatQnA/deploy/k8s_helm.md

examples/ChatQnA/deploy/k8s_getting_started.md

tylertitsworth · 2024-09-27T22:53:53Z

examples/ChatQnA/deploy/k8s_getting_started.md

+
+### Kubernetes Cluster and Development Environment
+
+**Setting Up the Kubernetes Cluster:** Before beginning deployment for the ChatQnA application, ensure that a Kubernetes cluster is ready. For guidance on setting up your Kubernetes cluster, please refer to the comprehensive setup instructions available on the [Opea Project deployment guide](https://opea-project.github.io/latest/deploy/index.html).


There is a very subtle difference between bolded and non-bolded text that is confusing to read. I would rather see an h4 here (####) than bolded text + a colon.

@tylertitsworth
Do you suggest switching to an h4 only for the section "Kubernetes Cluster and Development Environment"?
reason I ask is, the document has many items that are bolded and may not be suitable for h4
like kubectl, pods, Using Helm Charts, Using Manifests etc

examples/ChatQnA/deploy/k8s_getting_started.md

examples/ChatQnA/deploy/k8s_helm.md

tylertitsworth · 2024-09-30T15:56:46Z

examples/ChatQnA/deploy/k8s_helm.md

+NAMESPACE: chatqa
+STATUS: deployed
+REVISION: 1
+


No installation notes is a bit strange, this could do a LOT of heavy lifting for the docs here.

@tylertitsworth The installation notes are under Validate microservice --> Check the pod status which are the following sections
Do you have any suggestions to make it better? or add something?

examples/ChatQnA/deploy/k8s_helm.md

dbkinder · 2024-09-30T16:15:30Z

I've got a script in docs/scripts/checkmd.sh that uses pymarkdown (lint) to scan markdown files, with a bunch of checks disabled. Alas, if I wasn't retiring today, including a markdown linter was on my list to add to the CI checks. :)

Signed-off-by: devpramod <pramod.pai@intel.com>

add k8s docs for getting started and helm

44d9c28

Signed-off-by: devpramod <pramod.pai@intel.com>

devpramod requested review from dbkinder, chensuyue, ftian1, mkbhanda, preethivenkatesh, chickenrae and tomlenth as code owners September 25, 2024 16:00

dbkinder suggested changes Sep 26, 2024

View reviewed changes

fix formatting issues

8e7f138

Signed-off-by: devpramod <pramod.pai@intel.com> Signed-off-by: devpramod <pramod.pai@intel.com>

devpramod force-pushed the main branch from 6399e6d to 8e7f138 Compare September 27, 2024 15:42

dbkinder suggested changes Sep 27, 2024

View reviewed changes

examples/ChatQnA/deploy/index.rst Outdated Show resolved Hide resolved

examples/ChatQnA/deploy/k8s_helm.md Show resolved Hide resolved

update toctree

fcf8851

Signed-off-by: devpramod <pramod.pai@intel.com>

dbkinder approved these changes Sep 27, 2024

View reviewed changes

tylertitsworth suggested changes Sep 30, 2024

View reviewed changes

upddate both docs

12e2e4f

Signed-off-by: devpramod <pramod.pai@intel.com>

ftian1 approved these changes Oct 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add k8s docs for getting started and helm #179

add k8s docs for getting started and helm #179

devpramod commented Sep 25, 2024

dbkinder left a comment •

edited

Loading

dbkinder left a comment

tylertitsworth left a comment •

edited

Loading

tylertitsworth Sep 27, 2024

devpramod Oct 16, 2024

tylertitsworth Sep 30, 2024

devpramod Oct 16, 2024

dbkinder commented Sep 30, 2024


		### Kubernetes Cluster and Development Environment

		Setting Up the Kubernetes Cluster: Before beginning deployment for the ChatQnA application, ensure that a Kubernetes cluster is ready. For guidance on setting up your Kubernetes cluster, please refer to the comprehensive setup instructions available on the [Opea Project deployment guide](https://opea-project.github.io/latest/deploy/index.html).

add k8s docs for getting started and helm #179

Are you sure you want to change the base?

add k8s docs for getting started and helm #179

Conversation

devpramod commented Sep 25, 2024

dbkinder left a comment • edited Loading

Choose a reason for hiding this comment

dbkinder left a comment

Choose a reason for hiding this comment

tylertitsworth left a comment • edited Loading

Choose a reason for hiding this comment

Configure Model Server

Deploy Model Server

tylertitsworth Sep 27, 2024

Choose a reason for hiding this comment

devpramod Oct 16, 2024

Choose a reason for hiding this comment

tylertitsworth Sep 30, 2024

Choose a reason for hiding this comment

devpramod Oct 16, 2024

Choose a reason for hiding this comment

dbkinder commented Sep 30, 2024

dbkinder left a comment •

edited

Loading

tylertitsworth left a comment •

edited

Loading