Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] master from kserve:master #173

Closed
wants to merge 10 commits into from
Closed

Conversation

pull[bot]
Copy link

@pull pull bot commented Jan 18, 2024

See Commits and Changes for more details.


Created by pull[bot]

Can you help keep this open source service alive? 💖 Please sponsor : )

Copy link

openshift-ci bot commented Jan 18, 2024

Hi @pull[bot]. Thanks for your PR.

I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

terrytangyuan and others added 2 commits January 21, 2024 18:33
* generate docs

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

---------

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
1. graph poetry project was not included in previous checks.
    2. As poetry lock --check is deprecated, used `poetry check --lock`.

Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com>
@spolti
Copy link
Member

spolti commented Jan 24, 2024

/approve
/lgtm

Copy link

openshift-ci bot commented Jan 24, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: pull[bot], spolti

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

* initial commit for graph raw deployment

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* addd hpa support as well for inference graph raw deployment

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Just for local

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Just local

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Fix local setup

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Sleep change

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Fix self-signed-ca installation (#3165)

Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>
Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* refactored

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Fix logging message

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* adding unit tests

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* adding unit tests

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* removed temporary dev env changes

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Rawdeployment mode type

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* restoring from master branch

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Remove affinity test

Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* add with resource test

Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* added controller test for graph

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* addressed code review comments

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* Added copy right statement for new files

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* Removed dependency for InferenceGraph kind on componentExtensionSpec which is specific to inference service spec

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* restored Makefile from master branch

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* fixed codescan issue for AtoI function

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* fixed test failures

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* Added unit test for hpa reconciler

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* Added Ready status when Deployment is available in raw mode

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* Removed unused code

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* e2e test for ISVC in raw mode

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* improved e2e tests for inference graph raw deployment

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* fixed python lint errors

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* added annotations for ig spec that triggers raw deployment

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* made names unique b/w runs

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* Add test

Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* Adding unit test

Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>

* fixed e2e test failure for ig.  IG is not moving to ready state because of not handling raw deployment mode condition correctly

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* changed marker for graph test

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* corrected test validation for raw deployment mode knative resources

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* correct rebase errors

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* hpa field moved from annotations to inferencegraphspec as fields

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

* commiting make generate output due to InferenceGraphSpec changes

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>

---------

Signed-off-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Signed-off-by: Tanvi Thakur <tanvi_thakur@intuit.com>
Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>
Co-authored-by: Mopuri, Bharath <bharath_mopuri@intuit.com>
Co-authored-by: Tanvi Thakur <tanvi_thakur@intuit.com>
Co-authored-by: Sivanantham <90966311+sivanantha321@users.noreply.github.com>
@openshift-ci openshift-ci bot removed the lgtm label Jan 24, 2024
Copy link

openshift-ci bot commented Jan 24, 2024

New changes are detected. LGTM label has been removed.

When Istio is installed with its CNI plugin, KServe inference services are not capable to start. This is because the storage initializer is an init-container and the network is not available when the CNI plugin is enabled.

The typical recommendation to fix the issue is to remove init-containers and move any initialization code to a regular container. This approach would not work well with KServe, because the serving runtimes assume the model is already present on the filesystem and moving the storage initializer as a regular container will cause race conditions (the runtime will succeed loading only if the storage initializer manages to pull the model before the runtime starts).

There are alternative approaches documented in https://istio.io/latest/docs/setup/additional-setup/cni/#compatibility-with-application-init-containers. All alternatives have the downside that the traffic won't be captured by Istio and won't benefit from Istio features, which should be OK for KServe storage-initializer case.

These changes use the approach for running the storage initializer using the same UserID as the Istio sidecar. The UID is copied from the sidecar container to cover Istio derivatives, like Maistra.

Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@pull pull bot added merge-conflict Resolve conflicts manually and removed approved needs-rebase labels Jan 25, 2024
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@pull pull bot removed the needs-rebase label Jan 26, 2024
* Pass missing infer parameters

Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>

* Add tests

Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>

* Fix warnings

Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>

* Add equal magic method for infer types

Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>

* Avoid inplace modification of parameters in conversion

Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>

---------

Signed-off-by: Sivanantham Chinnaiyan <sivanantham.chinnaiyan@ideas2it.com>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

* feat: Support S3 transfer acceleration

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* Add missing field in ISVC yaml and storage.py

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix test

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

* fix default

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>

---------

Signed-off-by: Yuan Tang <terrytangyuan@gmail.com>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

* Initial huggingface server implementation

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Implement preprocess and predict for huggingface server

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add postprocess

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Update lock file

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Update func signature

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Update encoding

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add token classification test

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add comments

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Fix comments

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add generate endpoint

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Remove vllm async engine file

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Handle optional vllm

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add async generator

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Fix lint

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Generate response streaming support

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* rename async generate stream file

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Call generate endpoint directly

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Remove the tritonclient dependency

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add huggingface dockerfile

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Test for triton bert case

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add huggingface test

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add Makefile

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Support gpu device

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Add gnereate parameters

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Fix generate tests

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

* Run huggingface test

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

---------

Signed-off-by: Dan Sun <dsun20@bloomberg.net>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

…elcars (#3384)

Modelcar: Add mount of shared volume to transformer container

Adding the missing mount of the emptyDir volume that holds the symbolic link to the model data in the model sidecar to a transformer container, if it is configured.

Signed-off-by: Roland Huß <rhuss@redhat.com>
chore:  The cloudevents/sdk-go was migrated to v2, however, the v1 is still present in the
        go.mod. It was being used bu the pkg/logger/worker.go file to set a field that was
        not being in use anymore, thus it was removed

Signed-off-by: Spolti <fspolti@redhat.com>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@israel-hdez
Copy link

Closing, because in #179 there was a code sync. The bot would open a new PR with some additional commits that went in upstream after #179 got ready.

@israel-hdez
Copy link

/close

Copy link

openshift-ci bot commented Jan 29, 2024

@israel-hdez: Closed this PR.

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot closed this Jan 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
⤵️ pull merge-conflict Resolve conflicts manually
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

10 participants