Code sync for upstream v0.14.0 #421

israel-hdez · 2024-10-22T20:59:58Z

No description provided.

* propagate trc bool across vllm init Signed-off-by: Calvin Woo <[email protected]> Signed-off-by: calvin d. woo <[email protected]> * use args directly to avoid undefined var Signed-off-by: Calvin Woo <[email protected]> Signed-off-by: calvin d. woo <[email protected]> * Remove trailing space Signed-off-by: Dan Sun <[email protected]> Signed-off-by: calvin d. woo <[email protected]> * move params to newline Signed-off-by: calvin d. woo <[email protected]> --------- Signed-off-by: Calvin Woo <[email protected]> Signed-off-by: calvin d. woo <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>

The KServe Python SDK README.md uses relative URLs that work well on GitHub but return a 404 error when visited on PyPI. This change updates the README.md to use absolute URLs that work well on both GitHub and PyPI. Signed-off-by: kevinbazira <[email protected]>

check empty model final. Signed-off-by: HAO <[email protected]> Co-authored-by: koshino17 <[email protected]>

* Fix No model ready error in multi model serving - Fixes the regression introduced by kserve#3275 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Mark transformer model ready in init method Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Initial implementation of inference client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Use Inference client for e2e tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Upgrade pytest-asyncio to 0.23.4 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix mutable object initialization in default parameters Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix graph e2e tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix pmml test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add explain, support dict response, use inference client for internal requests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix inference graph test and grpc headers Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Remove v1 datamodels Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Introduce protocol in client config Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Support inference graph Signed-off-by: Sivanantham Chinnaiyan <[email protected]> remove logging configs Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Update default timeout to 60 seconds Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add retry config for grpc client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix infer model_name parameter Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add tests for graph endpoint Signed-off-by: Sivanantham Chinnaiyan <[email protected]> debug Signed-off-by: Sivanantham Chinnaiyan <[email protected]> fix http client param mismatch Signed-off-by: Sivanantham Chinnaiyan <[email protected]> skip graph test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> fix timeout in grpc client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix url construction Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix explain Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * configure logger for e2e tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix grpc retry config Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Increase request timeout Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * configure logger for e2e tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Fix grpc retry config Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Increase request timeout Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Use fixtures for rest client Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Fix model name not properly parsed by inference graph Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Handle single string arg with excess whitespace Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Handle duplicate arguments Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>

empty commit Signed-off-by: Spolti <[email protected]>

Use add_generation_rompt for chat template Signed-off-by: Dattu Sharma <[email protected]>

* Deduplicate the names for the additional domain names Signed-off-by: Vincent Hou <[email protected]> * Refactoring the functions Signed-off-by: Vincent Hou <[email protected]> --------- Signed-off-by: Vincent Hou <[email protected]>

virtual service case insensitive Signed-off-by: Andrews Arokiam <[email protected]>

* Install packages needed for model load Signed-off-by: Gavrish Prabhu <[email protected]> * make all apt get into a single line Signed-off-by: Gavrish Prabhu <[email protected]> --------- Signed-off-by: Gavrish Prabhu <[email protected]>

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

…serve#3789) * Add readiness probe for mlserver in CI Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Increase memory limit for pmml test to prevent OOMKilled and read timeout error Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Fix logprobs Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix a scenario where stream completion fails if echo is true and logprobs is nil Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix a scenario where completion fails if the prompt is token_ids and echo is set to true Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Respect tokenizer revision Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add workaround for adding None to token_logprobs and top_logprobs Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

agent watcher unit test is always flaky so increase timeout to make it stable Signed-off-by: jooho lee <[email protected]>

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Add tests for vLLM Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * resolve comments Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Uncomment tests for fixed bugs Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

….3 (kserve#3812) * Upgrade serving runtime python version to 3.11 and debian to bookworm Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Upgrade poetry to 1.8.3 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Upgrade openjdk to 17 for pmml Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix 'AS' casing warning Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix pmml server Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Bump vLLM to 0.5.3.post1 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update makefile Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * approx probability comparison Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Set multiprocessing method to spawn Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

…se 'spawn' for mutiprocessing (kserve#3757) * Refactor model server to let uvicorn handle multiple workers - Refactored the ModelServer to let uvicorn handle multiple workers. This will remove the bottleneck of using 'fork' for multiprocessing - Make FastAPI app instance easily accessible across the project so that users can easily add middlewares and custom exception handlers for custom models. - Use uvloop eventpolicy Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add middleware example Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add e2e test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove nest_asyncio in art explainer Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove uvloop Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix python tests Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * revert art explainer Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove monkeypatch Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove redundant future exception logging Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Signed-off-by: Spolti <[email protected]>

* Make ray serve an optional dependency Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Unify the log configuration using kserve logger (kserve#3577) * Configure logging for serving runtimes Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add pyyaml dependency Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * black format Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * fix pyproject.toml Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * cleanup logger for e2e Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Modify logger format to include func name Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Log model download time. Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Allow disabling logger configuration and deprecate logger related arg in model server Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase master Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Resolve comments Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * pyyaml=^6.0.0 to fix build failure Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove logger related parameters from model server Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * import model_server Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix lint Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix linting Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase, minor fixes and add e2e test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: Sivanantham Chinnaiyan <[email protected]> Co-authored-by: Curtis Maddalozzo <[email protected]> Co-authored-by: Dan Sun <[email protected]>

* Update aif example chore: Update aif explainer example. - Bump KServer to 0.13.0, it will bring some library updates, plus, it fixes a few security alerts in this example. - update the scikit-learn package name Signed-off-by: Spolti <[email protected]> * move the local instructions to the README Signed-off-by: Spolti <[email protected]> * empty commit Signed-off-by: Spolti <[email protected]> --------- Signed-off-by: Spolti <[email protected]>

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

…ve#3737) These changes introduce the possibility to configure KServe with its own Istio local gateway, to partially decouple KServe from the Knative local gateway. Typically, it is OK to re-use the already configured Knative local gateway for KServe uses (as long as configs do not conflict). However, there are cases where having a dedicated local gateway for KServe is beneficial. Just to give some examples: * To have the ability to use strict mTLS in Istio * To reduce some pressure on the Knative local gateway by having a dedicated gateway deployment (it still would hit Knative gateway, but only once, rather than twice) * To be able to configure TLS on cluster-local hostnames (Knative support is still experimental) To have a dedicated Gateway in KServe, similar configurations to Knative are need to be done. At the very least, and if not having a dedicated gateway deployment, a v1/Service and an Istio Gateway resource need to be created for KServe. Such resources would need to be configured in _localGateway_ and _localGatewayService_. KServe still needs to rely on Knative routing for the KSVCs it creates. Thus, after handling an incoming request and resolving its target, it needs to be forwarded to be handled by Knative. This is the reason for introducing a new `knativeLocalGatewayService` in the ConfigMap. The removed `ingressService` seems to be unused. Apparently, it became unused when the v1alpa1 API of the InferenceServices was deprecated and removed. Signed-off-by: Edgar Hernández <[email protected]>

* Add support for Azure DNS zone endpoints Signed-off-by: tjandy98 <[email protected]> * Add test cases for Azure Blob and File Share URI pattern matching Signed-off-by: tjandy98 <[email protected]> * flake8 Signed-off-by: tjandy98 <[email protected]> * black Signed-off-by: tjandy98 <[email protected]> --------- Signed-off-by: tjandy98 <[email protected]>

Signed-off-by: Dan Sun <[email protected]>

* Add logging request feature for vLLM Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add log request feature for huggingface Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Signed-off-by: Dan Sun <[email protected]>

* Consolidate into one commit Signed-off-by: Jin Dong <[email protected]> * Fix configmap format Signed-off-by: Jin Dong <[email protected]> * Fix configmap Signed-off-by: Jin Dong <[email protected]> * Log configmap read error Signed-off-by: Jin Dong <[email protected]> * fix naming Signed-off-by: Dan Sun <[email protected]> * Update comments Signed-off-by: Jin Dong <[email protected]> * Add enabled flag to configmap and avoid cluster resource check in isvc defaulter Signed-off-by: Jin Dong <[email protected]> * move client into the local model block Signed-off-by: Dan Sun <[email protected]> * Fix lint Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>

* Sync helm chart with kustomize Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update manifest generation script to sync helm charts Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Make kserve-addressable-resolver role optional Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Prepare for 0.14.0-rc1 release Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update release process Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Comment out crd sync script in make Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix helm template syntax Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* add a new API for multi-node/multi-gpu Signed-off-by: jooho lee <[email protected]> * fix gitaction Signed-off-by: jooho lee <[email protected]> * fix merging conflict Signed-off-by: jooho lee <[email protected]> * fix gitaction fail Signed-off-by: jooho lee <[email protected]> * regenerate codegen/manifests Signed-off-by: jooho lee <[email protected]> * Apply suggestions from code review Co-authored-by: Dan Sun <[email protected]> Signed-off-by: Jooho Lee <[email protected]> * remove unnecessary comment Signed-off-by: jooho lee <[email protected]> * change the type of workerSpec in isvc to PodSpec Signed-off-by: jooho lee <[email protected]> * update controller-gen version Signed-off-by: jooho lee <[email protected]> * remove replicas from workerSpec Signed-off-by: jooho lee <[email protected]> * fix conflict merging Signed-off-by: jooho lee <[email protected]> * added size(replicas) for workerSpec again Signed-off-by: jooho lee <[email protected]> * add WorkerSpec to inferenceService Signed-off-by: jooho lee <[email protected]> * fix go linter Signed-off-by: jooho lee <[email protected]> --------- Signed-off-by: jooho lee <[email protected]> Signed-off-by: Jooho Lee <[email protected]> Signed-off-by: Jooho Lee <[email protected]> Co-authored-by: Dan Sun <[email protected]>

…#3924) * fix openapigen.sh that can be executed from kserve dir Signed-off-by: jooho lee <[email protected]> * regenerate codegen/manifests Signed-off-by: jooho lee <[email protected]> * Update go.sum Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: jooho lee <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>

* Support python 3.12 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update dependencies Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Update deps to support 3.12 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove python 3.8 support Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Remove skip for infer client test Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix port forward Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix sklearn pandas dep Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * skip pydantic v1 test for py 3.12 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Add setuptools dep for pmml Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix lgb Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Include setuptools for paddle Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Include setuptools for huggingface Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Rebase Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Bump version to 0.13.0-rc0 (kserve#3665) Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: jordanyono <[email protected]> * fixing docs Signed-off-by: jordanyono <[email protected]> * fix spelling mistake Signed-off-by: jordanyono <[email protected]> --------- Signed-off-by: Curtis Maddalozzo <[email protected]> Signed-off-by: jordanyono <[email protected]> Co-authored-by: Curtis Maddalozzo <[email protected]>

* Fix local testing Signed-off-by: Dan Sun <[email protected]> * Fix codegen Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Add a flag for automount serviceaccount Signed-off-by: Jin Dong <[email protected]> * Set default to false Signed-off-by: Jin Dong <[email protected]> * Default to true Signed-off-by: Jin Dong <[email protected]> * Fix test error Signed-off-by: Jin Dong <[email protected]> * Update openapi generated.go Signed-off-by: Jin Dong <[email protected]> * Fix python lint Signed-off-by: Jin Dong <[email protected]> * Fix config loading Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>

…ainer (kserve#3985) * Do not set security context on the storage initializer from user container Signed-off-by: Jin Dong <[email protected]> * Add securityContext to the default storage container in the helm chart Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>

This adds the model container as an init-container to mitigate a race condition that would happen if the model container is not present on the cluster-node. The race condition happens if the cluster is able to fetch and start the runtime container before the modelcar is fetched. This would lead to the runtime to terminate with error. By configuring the model container as an init-container the runtime won't start until the modelcar is fetched. Although there is still the risk of a race condition when the cluster schedules the runtime container first, the pod should stabilize after a few restarts of the runtime container and should either prevent a CrashLoopBackOff event on the pod, or the crash event would finish quickly. This improves compatibility with the runtimes which can now stay agnostic to the modelcar implementation, until better techniques (like native sidecars, and oci volume mounts) become mature. Signed-off-by: Edgar Hernández <[email protected]>

* Initial commit for headers passing issue Signed-off-by: Andrews Arokiam <[email protected]> * modifying the e2e test for rebase conflict Signed-off-by: Andrews Arokiam <[email protected]> * bug fix on unittest Signed-off-by: Andrews Arokiam <[email protected]> * review changes Signed-off-by: Andrews Arokiam <[email protected]> * fix for test failure Signed-off-by: Andrews Arokiam <[email protected]> * bug fix on e2e test Signed-off-by: Andrews Arokiam <[email protected]> * overridding the entrypoint of custom model images Signed-off-by: Andrews Arokiam <[email protected]> * custom response header Signed-off-by: Andrews Arokiam <[email protected]> * fix for unittest failure Signed-off-by: Andrews Arokiam <[email protected]> * added custom response headers in post process Signed-off-by: Andrews Arokiam <[email protected]> * added predict time latency in example response header Signed-off-by: Andrews Arokiam <[email protected]> * fix OOM --------- Signed-off-by: Andrews Arokiam <[email protected]> Co-authored-by: Dan Sun <[email protected]>

* security update Signed-off-by: udai <[email protected]> * adding sign off Signed-off-by: udai <[email protected]> --------- Signed-off-by: udai <[email protected]>

Signed-off-by: Jin Dong <[email protected]>

* temp commit Signed-off-by: Jin Dong <[email protected]> * python-release.sh Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>

…14-upgrade Code sync with upstream, up to v0.14. Signed-off-by: Edgar Hernández <[email protected]>

Signed-off-by: Edgar Hernández <[email protected]>

openshift-ci · 2024-10-25T19:33:16Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: israel-hdez, spolti

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [israel-hdez,spolti]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci · 2024-10-28T19:01:31Z

@israel-hdez: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/e2e-raw	`c9453bf`	link	true	`/test e2e-raw`
ci/prow/e2e-slow	`c9453bf`	link	true	`/test e2e-slow`
ci/prow/e2e-fast	`c9453bf`	link	true	`/test e2e-fast`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

calwoo and others added 30 commits June 21, 2024 06:18

Fix model is ready even if there is no model (kserve#3275)

d19e310

check empty model final. Signed-off-by: HAO <[email protected]> Co-authored-by: koshino17 <[email protected]>

Fix logprobs for vLLM (kserve#3738)

d4349ea

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

pillow - Buffer Overflow (kserve#3598)

ca691f7

empty commit Signed-off-by: Spolti <[email protected]>

Use add_generation_prompt while creating chat template (kserve#3775)

164ea83

Use add_generation_rompt for chat template Signed-off-by: Dattu Sharma <[email protected]>

Make Virtual Service case-insensitive (kserve#3779)

29731be

virtual service case insensitive Signed-off-by: Andrews Arokiam <[email protected]>

Make gRPC max message length configurable (kserve#3741)

31d2395

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Increase timeout to make unit test stable (kserve#3808)

de96bd3

agent watcher unit test is always flaky so increase timeout to make it stable Signed-off-by: jooho lee <[email protected]>

Upgrade CI deps (kserve#3822)

32d3fc0

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Update golang for docs/Dockerfile to 1.21 (kserve#3761)

d73f1ca

Signed-off-by: Spolti <[email protected]>

Use helm for quick installation (kserve#3813)

f9c73c2

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Fix failed build for knativeLocalGatewayService (kserve#3866)

a16f2d7

Signed-off-by: Dan Sun <[email protected]>

Bump vLLM to 0.5.4 (kserve#3874)

656f65e

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

yuzisun and others added 17 commits September 30, 2024 08:20

Update OWNERS file (kserve#3966)

b344a0a

Signed-off-by: Dan Sun <[email protected]>

Fix openssl vulnerability CWE-1395 (kserve#3975)

17725be

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Fix kserve local testing env (kserve#3981)

70e12ff

* Fix local testing Signed-off-by: Dan Sun <[email protected]> * Fix codegen Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>

Fix streaming response not working properly with logger (kserve#3847)

dec8c14

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

Torchserve security update (kserve#3774)

dbceef3

* security update Signed-off-by: udai <[email protected]> * adding sign off Signed-off-by: udai <[email protected]> --------- Signed-off-by: udai <[email protected]>

Pin ubuntu 22.04 for minikube setup action (kserve#3994)

82234c0

Signed-off-by: Jin Dong <[email protected]>

KServe 0.14 Release (kserve#3988)

7e43642

* temp commit Signed-off-by: Jin Dong <[email protected]> * python-release.sh Signed-off-by: Jin Dong <[email protected]> --------- Signed-off-by: Jin Dong <[email protected]>

openshift-ci bot requested review from Jooho and spolti October 22, 2024 21:00

openshift-ci bot added the approved label Oct 22, 2024

israel-hdez added 3 commits October 25, 2024 11:27

Merge remote-tracking branch 'kserve/release-0.14' into j9436-kserve0…

befa4d6

…14-upgrade Code sync with upstream, up to v0.14. Signed-off-by: Edgar Hernández <[email protected]>

Fix poetry lockfiles after v0.14.0 code sync

53de246

Signed-off-by: Edgar Hernández <[email protected]>

Restore compatibility with go v1.21

b7a868f

Signed-off-by: Edgar Hernández <[email protected]>

israel-hdez force-pushed the j9436-kserve014-upgrade branch from 18c0f1b to b7a868f Compare October 25, 2024 17:40

spolti approved these changes Oct 25, 2024

View reviewed changes

Merge branch 'odh-master' into j9436-kserve014-upgrade

c9453bf

israel-hdez merged commit 733c1c3 into opendatahub-io:master Oct 28, 2024
24 of 29 checks passed

israel-hdez deleted the j9436-kserve014-upgrade branch October 28, 2024 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code sync for upstream v0.14.0 #421

Code sync for upstream v0.14.0 #421

israel-hdez commented Oct 22, 2024

openshift-ci bot commented Oct 25, 2024

openshift-ci bot commented Oct 28, 2024

Code sync for upstream v0.14.0 #421

Code sync for upstream v0.14.0 #421

Conversation

israel-hdez commented Oct 22, 2024

openshift-ci bot commented Oct 25, 2024

openshift-ci bot commented Oct 28, 2024