[Feature] Implement Health Check Endpoint for Delayed Service Startup #764

isaacncz · 2024-10-07T08:35:26Z

OS type
Ubuntu

Description
When running the example Translation using Docker Compose, one of the images takes additional time to pull a model from the Huggingface upon startup. During this period, the service is unresponsive to HTTP requests, resulting in HTTP 500 errors.

To improve reliability, would like to propose adding a health check endpoint that can verify when the service is ready to handle requests. This will allow other services and users to know when the service is up and running, avoiding unnecessary errors and improving the user experience.

Expected Behavior:

Docker Compose starts all services.
The service in question takes some time to pull the model.
A health check endpoint will be available to verify when the model has finished loading and the service is ready.
Proposed Solution:

Add a /health endpoint that returns:
200 OK when the service is fully operational.
503 Service Unavailable or similar status when the service is still initializing or loading the model.
Optionally, provide a message or a status code that indicates the estimated time remaining for startup.

louie-tsai · 2024-10-07T21:58:47Z

v1/health_check endpoint might work for you.
https://github.com/opea-project/GenAIComps/tree/main/comps/embeddings/tei/langchain#3-consume-embedding-service
https://github.com/opea-project/GenAIComps/tree/main/comps/llms/text-generation#31-check-service-status

it should be implemented as a cores feature.
https://github.com/opea-project/GenAIComps/blob/main/comps/cores/mega/http_service.py#L68

could you try whether v1/health_check work for your case?

louie-tsai · 2024-10-08T17:01:39Z

@isaacncz
I put one of the example for health check below.
health check
curl http://localhost:3007/v1/health_check -X GET -H 'Content-Type: application/json'
response from microservice
{"Service Title":"opea_service@llm_tgi/MicroService","Service Description":"OPEA Microservice Infrastructure"}

…pea-project#764

…pea-project#764 Signed-off-by: Foong, Khang Sheong <khang.sheong.foong@intel.com>

louie-tsai · 2024-10-15T14:41:25Z

@isaacncz
Do you have further questions?
if not, we will close the ticket.

isaacncz · 2024-10-15T22:54:27Z

@louie-tsai i have tested the health check, it worked. However, for llm microservice, i will not be able to check whether the model is already downloaded completely.

louie-tsai · 2024-10-16T16:50:03Z

@isaacncz
no check for model download completion yet indeed.
@kevinintel
There is need to show the LLM model download completion status via health_check or statistics.
please help to evaluate the feature.

kevinintel · 2024-10-17T06:13:47Z

It depends on serving framework, we only know the service ready or not

louie-tsai · 2024-10-29T00:10:28Z

@kevinintel
will let you handle this feature request which is asking about serving framework readiness.

louie-tsai added the enhancement New feature or request label Oct 7, 2024

preethivenkatesh assigned louie-tsai Oct 9, 2024

preethivenkatesh added the aitce label Oct 9, 2024

Khangf added a commit to Khangf/GenAIComps that referenced this issue Oct 9, 2024

feature: enable delayed microservices spawning based on dependencies o…

1d437cc

…pea-project#764

Khangf mentioned this issue Oct 9, 2024

Enable delayed microservices spawning based on dependencies #772

Open

1 task

Khangf added a commit to Khangf/GenAIComps that referenced this issue Oct 9, 2024

feature: enable delayed microservices spawning based on dependencies o…

aff7572

…pea-project#764 Signed-off-by: Foong, Khang Sheong <khang.sheong.foong@intel.com>

louie-tsai assigned kevinintel Oct 16, 2024

louie-tsai added the DEV features label Oct 16, 2024

louie-tsai removed their assignment Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Implement Health Check Endpoint for Delayed Service Startup #764

[Feature] Implement Health Check Endpoint for Delayed Service Startup #764

isaacncz commented Oct 7, 2024

louie-tsai commented Oct 7, 2024

louie-tsai commented Oct 8, 2024

louie-tsai commented Oct 15, 2024

isaacncz commented Oct 15, 2024

louie-tsai commented Oct 16, 2024

kevinintel commented Oct 17, 2024

louie-tsai commented Oct 29, 2024

[Feature] Implement Health Check Endpoint for Delayed Service Startup #764

[Feature] Implement Health Check Endpoint for Delayed Service Startup #764

Comments

isaacncz commented Oct 7, 2024

louie-tsai commented Oct 7, 2024

louie-tsai commented Oct 8, 2024

louie-tsai commented Oct 15, 2024

isaacncz commented Oct 15, 2024

louie-tsai commented Oct 16, 2024

kevinintel commented Oct 17, 2024

louie-tsai commented Oct 29, 2024