-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[pull] master from kserve:master #194
Conversation
* Pass missing infer parameters Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix warnings Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add equal magic method for infer types Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Avoid inplace modification of parameters in conversion Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
* feat: Support S3 transfer acceleration Signed-off-by: Yuan Tang <[email protected]> * Add missing field in ISVC yaml and storage.py Signed-off-by: Yuan Tang <[email protected]> * fix test Signed-off-by: Yuan Tang <[email protected]> * fix default Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
* Initial huggingface server implementation Signed-off-by: Dan Sun <[email protected]> * Implement preprocess and predict for huggingface server Signed-off-by: Dan Sun <[email protected]> * Add postprocess Signed-off-by: Dan Sun <[email protected]> * Update lock file Signed-off-by: Dan Sun <[email protected]> * Update func signature Signed-off-by: Dan Sun <[email protected]> * Update encoding Signed-off-by: Dan Sun <[email protected]> * Add token classification test Signed-off-by: Dan Sun <[email protected]> * Add comments Signed-off-by: Dan Sun <[email protected]> * Fix comments Signed-off-by: Dan Sun <[email protected]> * Add generate endpoint Signed-off-by: Dan Sun <[email protected]> * Remove vllm async engine file Signed-off-by: Dan Sun <[email protected]> * Handle optional vllm Signed-off-by: Dan Sun <[email protected]> * Add async generator Signed-off-by: Dan Sun <[email protected]> * Fix lint Signed-off-by: Dan Sun <[email protected]> * Generate response streaming support Signed-off-by: Dan Sun <[email protected]> * rename async generate stream file Signed-off-by: Dan Sun <[email protected]> * Call generate endpoint directly Signed-off-by: Dan Sun <[email protected]> * Remove the tritonclient dependency Signed-off-by: Dan Sun <[email protected]> * Add huggingface dockerfile Signed-off-by: Dan Sun <[email protected]> * Test for triton bert case Signed-off-by: Dan Sun <[email protected]> * Add huggingface test Signed-off-by: Dan Sun <[email protected]> * Add Makefile Signed-off-by: Dan Sun <[email protected]> * Support gpu device Signed-off-by: Dan Sun <[email protected]> * Add gnereate parameters Signed-off-by: Dan Sun <[email protected]> * Fix generate tests Signed-off-by: Dan Sun <[email protected]> * Run huggingface test Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
…elcars (#3384) Modelcar: Add mount of shared volume to transformer container Adding the missing mount of the emptyDir volume that holds the symbolic link to the model data in the model sidecar to a transformer container, if it is configured. Signed-off-by: Roland Huß <[email protected]>
chore: The cloudevents/sdk-go was migrated to v2, however, the v1 is still present in the go.mod. It was being used bu the pkg/logger/worker.go file to set a field that was not being in use anymore, thus it was removed Signed-off-by: Spolti <[email protected]>
…dency to >=2.8.1, <3.0.0 (#3389) * Bump google cloud storage to fix warnings Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix poetry deprecated field warning Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Restrict ray dependency to ^2.8.1 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Hi @pull[bot]. Thanks for your PR. I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
* telepresence-setup.sh Signed-off-by: Roland Huß <[email protected]> * Add hint for downloading telepresence Signed-off-by: Roland Huß <[email protected]> --------- Signed-off-by: Roland Huß <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Yuan Tang <[email protected]>
* Change docker data directory to /mnt/docker Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Change huggingface test virtual env location Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Use symlink to move docker data dir Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * skip huggingface tests for python 3.8 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
…tom handler (#3405) Add exception handler for model server and Add ability to specify custom exception handler Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Updated python-release script. Signed-off-by: Andrews Arokiam <[email protected]>
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: pull[bot] The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
* feat: Add HuggingFace runtime out-of-the-box support Signed-off-by: Yuan Tang <[email protected]> * Update violation list Signed-off-by: Yuan Tang <[email protected]> * Add to charts Signed-off-by: Yuan Tang <[email protected]> * Regenerate everything Signed-off-by: Yuan Tang <[email protected]> * fix lint Signed-off-by: Yuan Tang <[email protected]> * fix lint Signed-off-by: Yuan Tang <[email protected]> * Empty-Commit Signed-off-by: Yuan Tang <[email protected]> * Empty-Commit Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
Update link to Slack instructions Signed-off-by: Yuan (Terry) Tang <[email protected]>
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
* Fix parameters value type conversion Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix datatype check in infer type Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix datatype check in infer type Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This is a small fix to the test_raw_logger.py/test_kserve_logger E2E test. This E2E test was creating a `message-dumper-raw` that was dependant on a Serverless setup, leading to the E2E to fail on an environment where Istio nor KNative are available. The fix is to add the right annotation to `message-dumper-raw` to deploy it in Raw mode, and adjust the logger URL of the `isvc-logger-raw` to point to the Kubernetes Service, rather than the KNative service. Signed-off-by: Edgar Hernández <[email protected]>
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
2665ac5
into
opendatahub-io:master
See Commits and Changes for more details.
Created by
pull[bot]
Can you help keep this open source service alive? 💖 Please sponsor : )