Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] master from kserve:master #173

Closed
wants to merge 10 commits into from
Closed

Conversation

pull[bot]
Copy link

@pull pull bot commented Jan 18, 2024

See Commits and Changes for more details.


Created by pull[bot]

Can you help keep this open source service alive? 💖 Please sponsor : )

Copy link

openshift-ci bot commented Jan 18, 2024

Hi @pull[bot]. Thanks for your PR.

I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

terrytangyuan and others added 2 commits January 21, 2024 18:33
* generate docs

Signed-off-by: Yuan Tang <[email protected]>

* fix

Signed-off-by: Yuan Tang <[email protected]>

* fix

Signed-off-by: Yuan Tang <[email protected]>

---------

Signed-off-by: Yuan Tang <[email protected]>
1. graph poetry project was not included in previous checks.
    2. As poetry lock --check is deprecated, used `poetry check --lock`.

Signed-off-by: Andrews Arokiam <[email protected]>
@spolti
Copy link
Member

spolti commented Jan 24, 2024

/approve
/lgtm

Copy link

openshift-ci bot commented Jan 24, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: pull[bot], spolti

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

* initial commit for graph raw deployment

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* addd hpa support as well for inference graph raw deployment

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* Just for local

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* Just local

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* Fix local setup

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* Sleep change

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* Fix self-signed-ca installation (#3165)

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* refactored

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* Fix logging message

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* adding unit tests

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* adding unit tests

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* removed temporary dev env changes

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* Rawdeployment mode type

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* restoring from master branch

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>

* Remove affinity test

Signed-off-by: Tanvi Thakur <[email protected]>

* add with resource test

Signed-off-by: Tanvi Thakur <[email protected]>

* added controller test for graph

Signed-off-by: Mopuri, Bharath <[email protected]>

* addressed code review comments

Signed-off-by: Mopuri, Bharath <[email protected]>

* Added copy right statement for new files

Signed-off-by: Mopuri, Bharath <[email protected]>

* Removed dependency for InferenceGraph kind on componentExtensionSpec which is specific to inference service spec

Signed-off-by: Mopuri, Bharath <[email protected]>

* restored Makefile from master branch

Signed-off-by: Mopuri, Bharath <[email protected]>

* fixed codescan issue for AtoI function

Signed-off-by: Mopuri, Bharath <[email protected]>

* fixed test failures

Signed-off-by: Mopuri, Bharath <[email protected]>

* Added unit test for hpa reconciler

Signed-off-by: Mopuri, Bharath <[email protected]>

* Added Ready status when Deployment is available in raw mode

Signed-off-by: Mopuri, Bharath <[email protected]>

* Removed unused code

Signed-off-by: Mopuri, Bharath <[email protected]>

* e2e test for ISVC in raw mode

Signed-off-by: Mopuri, Bharath <[email protected]>

* improved e2e tests for inference graph raw deployment

Signed-off-by: Mopuri, Bharath <[email protected]>

* fixed python lint errors

Signed-off-by: Mopuri, Bharath <[email protected]>

* added annotations for ig spec that triggers raw deployment

Signed-off-by: Mopuri, Bharath <[email protected]>

* made names unique b/w runs

Signed-off-by: Mopuri, Bharath <[email protected]>

* Add test

Signed-off-by: Tanvi Thakur <[email protected]>

* Adding unit test

Signed-off-by: Tanvi Thakur <[email protected]>

* fixed e2e test failure for ig.  IG is not moving to ready state because of not handling raw deployment mode condition correctly

Signed-off-by: Mopuri, Bharath <[email protected]>

* changed marker for graph test

Signed-off-by: Mopuri, Bharath <[email protected]>

* corrected test validation for raw deployment mode knative resources

Signed-off-by: Mopuri, Bharath <[email protected]>

* correct rebase errors

Signed-off-by: Mopuri, Bharath <[email protected]>

* hpa field moved from annotations to inferencegraphspec as fields

Signed-off-by: Mopuri, Bharath <[email protected]>

* commiting make generate output due to InferenceGraphSpec changes

Signed-off-by: Mopuri, Bharath <[email protected]>

---------

Signed-off-by: Mopuri, Bharath <[email protected]>
Signed-off-by: Tanvi Thakur <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Co-authored-by: Mopuri, Bharath <[email protected]>
Co-authored-by: Tanvi Thakur <[email protected]>
Co-authored-by: Sivanantham <[email protected]>
@openshift-ci openshift-ci bot removed the lgtm label Jan 24, 2024
Copy link

openshift-ci bot commented Jan 24, 2024

New changes are detected. LGTM label has been removed.

When Istio is installed with its CNI plugin, KServe inference services are not capable to start. This is because the storage initializer is an init-container and the network is not available when the CNI plugin is enabled.

The typical recommendation to fix the issue is to remove init-containers and move any initialization code to a regular container. This approach would not work well with KServe, because the serving runtimes assume the model is already present on the filesystem and moving the storage initializer as a regular container will cause race conditions (the runtime will succeed loading only if the storage initializer manages to pull the model before the runtime starts).

There are alternative approaches documented in https://istio.io/latest/docs/setup/additional-setup/cni/#compatibility-with-application-init-containers. All alternatives have the downside that the traffic won't be captured by Istio and won't benefit from Istio features, which should be OK for KServe storage-initializer case.

These changes use the approach for running the storage initializer using the same UserID as the Istio sidecar. The UID is copied from the sidecar container to cover Istio derivatives, like Maistra.

Signed-off-by: Edgar Hernández <[email protected]>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@pull pull bot added merge-conflict Resolve conflicts manually and removed approved needs-rebase labels Jan 25, 2024
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@pull pull bot removed the needs-rebase label Jan 26, 2024
* Pass missing infer parameters

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Add tests

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Fix warnings

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Add equal magic method for infer types

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Avoid inplace modification of parameters in conversion

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

---------

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

* feat: Support S3 transfer acceleration

Signed-off-by: Yuan Tang <[email protected]>

* Add missing field in ISVC yaml and storage.py

Signed-off-by: Yuan Tang <[email protected]>

* fix test

Signed-off-by: Yuan Tang <[email protected]>

* fix default

Signed-off-by: Yuan Tang <[email protected]>

---------

Signed-off-by: Yuan Tang <[email protected]>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

* Initial huggingface server implementation

Signed-off-by: Dan Sun <[email protected]>

* Implement preprocess and predict for huggingface server

Signed-off-by: Dan Sun <[email protected]>

* Add postprocess

Signed-off-by: Dan Sun <[email protected]>

* Update lock file

Signed-off-by: Dan Sun <[email protected]>

* Update func signature

Signed-off-by: Dan Sun <[email protected]>

* Update encoding

Signed-off-by: Dan Sun <[email protected]>

* Add token classification test

Signed-off-by: Dan Sun <[email protected]>

* Add comments

Signed-off-by: Dan Sun <[email protected]>

* Fix comments

Signed-off-by: Dan Sun <[email protected]>

* Add generate endpoint

Signed-off-by: Dan Sun <[email protected]>

* Remove vllm async engine file

Signed-off-by: Dan Sun <[email protected]>

* Handle optional vllm

Signed-off-by: Dan Sun <[email protected]>

* Add async generator

Signed-off-by: Dan Sun <[email protected]>

* Fix lint

Signed-off-by: Dan Sun <[email protected]>

* Generate response streaming support

Signed-off-by: Dan Sun <[email protected]>

* rename async generate stream file

Signed-off-by: Dan Sun <[email protected]>

* Call generate endpoint directly

Signed-off-by: Dan Sun <[email protected]>

* Remove the tritonclient dependency

Signed-off-by: Dan Sun <[email protected]>

* Add huggingface dockerfile

Signed-off-by: Dan Sun <[email protected]>

* Test for triton bert case

Signed-off-by: Dan Sun <[email protected]>

* Add huggingface test

Signed-off-by: Dan Sun <[email protected]>

* Add Makefile

Signed-off-by: Dan Sun <[email protected]>

* Support gpu device

Signed-off-by: Dan Sun <[email protected]>

* Add gnereate parameters

Signed-off-by: Dan Sun <[email protected]>

* Fix generate tests

Signed-off-by: Dan Sun <[email protected]>

* Run huggingface test

Signed-off-by: Dan Sun <[email protected]>

---------

Signed-off-by: Dan Sun <[email protected]>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

…elcars (#3384)

Modelcar: Add mount of shared volume to transformer container

Adding the missing mount of the emptyDir volume that holds the symbolic link to the model data in the model sidecar to a transformer container, if it is configured.

Signed-off-by: Roland Huß <[email protected]>
chore:  The cloudevents/sdk-go was migrated to v2, however, the v1 is still present in the
        go.mod. It was being used bu the pkg/logger/worker.go file to set a field that was
        not being in use anymore, thus it was removed

Signed-off-by: Spolti <[email protected]>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@israel-hdez
Copy link

Closing, because in #179 there was a code sync. The bot would open a new PR with some additional commits that went in upstream after #179 got ready.

@israel-hdez
Copy link

/close

Copy link

openshift-ci bot commented Jan 29, 2024

@israel-hdez: Closed this PR.

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot closed this Jan 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
⤵️ pull merge-conflict Resolve conflicts manually
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

10 participants