Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] master from kserve:master #290

Closed
wants to merge 66 commits into from
Closed

Conversation

pull[bot]
Copy link

@pull pull bot commented Apr 7, 2024

See Commits and Changes for more details.


Created by pull[bot]

Can you help keep this open source service alive? 💖 Please sponsor : )

spolti and others added 16 commits March 23, 2024 20:31
chore:	fixes the GH [Alert](https://github.com/kserve/kserve/security/code-scanning/12080).
	filepath.Clean sanitizes the directory path and remove any unnecessary components (such as . and ..)

Signed-off-by: Spolti <[email protected]>
Due to changes in 39b8a67 which added `reinvocationPolicy: IfNeeded` to the WebHook configuration, the injection called can (and will be) called multiple times, and needs to be idempotent (which is a good thing anyway).

This commit fixes the array field handling and adding volumes, volumemounts and containers only if they not already had been added.

Fixes #3506

Signed-off-by: Roland Huß <[email protected]>
Remove redundant helm chart affinity

labels: 
- app.kubernetes.io/managed-by 
- app.kubernetes.io/instance
- app.kubernetes.io/name: 

with value modelmesh-controller cause affinity to non existent helm chart

Signed-off-by: Ondrej Trojan <[email protected]>
update codeQL to v3

chore:	Update CodeQL to V3 to get rid of this warning:
	`Warning: CodeQL Action v2 will be deprecated on December 5th, 2024`
	Plus, attenpt to fix the Snyk Container scan failures due errors when trying to
	upload the SERIF file:
	`Processing sarif files: ["application/storage-initializer/docker.snyk.sarif"]
	  Uploading results
	  Successfully uploaded results
	Waiting for processing to finish
	Error: Code Scanning could not process the submitted SARIF file:
	could not convert rules: invalid security severity value, is not a number: null
	ConfigurationError: Code Scanning could not process the submitted SARIF file:
	could not convert rules: invalid security severity value, is not a number: null
	    at run (/home/runner/work/_actions/github/codeql-action/v2/lib/upload-sarif-action.js:65:15)`

Signed-off-by: Spolti <[email protected]>
* switch e2e test inference graph to raw mode

Signed-off-by: Andrews Arokiam <[email protected]>

* download xgb server image

Signed-off-by: Andrews Arokiam <[email protected]>

---------

Signed-off-by: Andrews Arokiam <[email protected]>
Pad left for decode-only architecture models.

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
* CVE-2024-24762 - update fastapi to 0.109.1

chore:	Fix [CVE-2024-24762](https://www.cve.org/CVERecord?id=CVE-2024-24762) - fastapi Regular Expression Denial of Service (ReDoS)
	Plus, update Ray to 2.10 to allow updating fastapi. On previous versions of Ray
	the fastapi version was pinned, which was preventing the fastapi version update.

use the new handle api:

From Ray Serve docs:
Ray 2.7 introduces a new {mod}`DeploymentHandle <ray.serve.handle.DeploymentHandle>` API that will replace the existing `RayServeHandle` and `RayServeSyncHandle` APIs.

Signed-off-by: Spolti <[email protected]>

* add link to about the RayServeHandle deprecation

Signed-off-by: Spolti <[email protected]>

---------

Signed-off-by: Spolti <[email protected]>
* wip

Signed-off-by: Yuan Tang <[email protected]>

* comment out

Signed-off-by: Yuan Tang <[email protected]>

* fix wf

Signed-off-by: Yuan Tang <[email protected]>

* helm test

Signed-off-by: Yuan Tang <[email protected]>

* remove mlserver relate tests

Signed-off-by: Yuan Tang <[email protected]>

* fix lint

Signed-off-by: Yuan Tang <[email protected]>

* sklearnserver runtime

Signed-off-by: Yuan Tang <[email protected]>

* Fix test

Signed-off-by: Yuan Tang <[email protected]>

* fix

Signed-off-by: Yuan Tang <[email protected]>

* disable check

Signed-off-by: Yuan Tang <[email protected]>

* reunused imports

Signed-off-by: Yuan Tang <[email protected]>

* Add back mlserver

Signed-off-by: Yuan Tang <[email protected]>

* pre-commit fix

Signed-off-by: Yuan Tang <[email protected]>

* update storage url

Signed-off-by: Yuan Tang <[email protected]>

* fix build

Signed-off-by: Yuan Tang <[email protected]>

* fix codegen

Signed-off-by: Yuan Tang <[email protected]>

* revert uri

Signed-off-by: Yuan Tang <[email protected]>

* int_contents

Signed-off-by: Yuan Tang <[email protected]>

* Remove unused script

Signed-off-by: Yuan Tang <[email protected]>

* remove dockerfile

Signed-off-by: Yuan Tang <[email protected]>

* Empty-Commit

Signed-off-by: Yuan Tang <[email protected]>

* Empty-Commit

Signed-off-by: Yuan Tang <[email protected]>

* Empty-Commit

Signed-off-by: Yuan Tang <[email protected]>

---------

Signed-off-by: Yuan Tang <[email protected]>
* Auto-format all Python files

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Use black for linting

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Don't run poetry check on root pyproject.toml

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Re-add flake8 linting

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Fix linting errors

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Add python path

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Fix linting

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Fix circular dependency

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Fix circular dependency

Signed-off-by: Curtis Maddalozzo <[email protected]>

---------

Signed-off-by: Curtis Maddalozzo <[email protected]>
…3558)

* support model revision and tokenizer revision

Signed-off-by: Lize Cai <[email protected]>

* point to specified commit in test case

Signed-off-by: Lize Cai <[email protected]>

* format code

Signed-off-by: Lize Cai <[email protected]>

---------

Signed-off-by: Lize Cai <[email protected]>
Signed-off-by: Dan Sun <[email protected]>
Co-authored-by: Dan Sun <[email protected]>
* OpenAI data models and endpoints from vLLM

Signed-off-by: Tessa Pham <[email protected]>

* more components for OpenAI endpoints

Signed-off-by: Tessa Pham <[email protected]>

* add OpenAI endpoints to router

Signed-off-by: Tessa Pham <[email protected]>

* modify generate() in data plane

Signed-off-by: Tessa Pham <[email protected]>

* class OpenAIModel

Signed-off-by: Tessa Pham <[email protected]>

* delete and rename files

Signed-off-by: Tessa Pham <[email protected]>

* add create_chat_completion() to OpenAIModel

Signed-off-by: Tessa Pham <[email protected]>

* update routers and lint

Signed-off-by: Tessa Pham <[email protected]>

* Implement streaming

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Add tests for OpenAI data conversion

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Register OpenAI endpoints when appropriate

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Add comments

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Add tests for create_completion and create_chat_completion

Signed-off-by: Curtis Maddalozzo <[email protected]>

* Remove completion types from dataplane methods

Signed-off-by: Curtis Maddalozzo <[email protected]>

* WIP

Signed-off-by: Curtis Maddalozzo <[email protected]>

* fix lint errors

Signed-off-by: Tessa Pham <[email protected]>

* update poetry.lock

Signed-off-by: Tessa Pham <[email protected]>

* update poetry.lock files

Signed-off-by: Tessa Pham <[email protected]>

* add dependency

Signed-off-by: Tessa Pham <[email protected]>

* fix test

Signed-off-by: Tessa Pham <[email protected]>

* revert poetry.lock files

Signed-off-by: Tessa Pham <[email protected]>

* add .itermconfig to .gitignore

Signed-off-by: Tessa Pham <[email protected]>

* add docker-compose.yml to .gitignore

Signed-off-by: Tessa Pham <[email protected]>

* fix build error

Signed-off-by: Tessa Pham <[email protected]>

* fix function descriptions

Signed-off-by: Tessa Pham <[email protected]>

* increase limit for model decompression size

Signed-off-by: Tessa Pham <[email protected]>

* add license & autoformat

Signed-off-by: Tessa Pham <[email protected]>

* make openai dependency mandatory

Signed-off-by: Tessa Pham <[email protected]>

* openai dependency back to optional

Signed-off-by: Tessa Pham <[email protected]>

* fix openai module import error

Signed-off-by: Tessa Pham <[email protected]>

* fix JSON unmarshalling of headers

Signed-off-by: Tessa Pham <[email protected]>

* drop formatting changes in unrelated files

Signed-off-by: Tessa Pham <[email protected]>

* fix openai_is_available()

Signed-off-by: Tessa Pham <[email protected]>

* black reformat

Signed-off-by: Tessa Pham <[email protected]>

---------

Signed-off-by: Tessa Pham <[email protected]>
Signed-off-by: Curtis Maddalozzo <[email protected]>
Co-authored-by: Curtis Maddalozzo <[email protected]>
* Fix python linter

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Format e2e tests

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Format hack, docs

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Add py-fmt target

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Update python lint workflow

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

* Rebase master

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>

---------

Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Copy link

openshift-ci bot commented Apr 7, 2024

Hi @pull[bot]. Thanks for your PR.

I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

* docs: Move Alibi explainer to docs

Signed-off-by: Yuan Tang <[email protected]>

* Empty-Commit

Signed-off-by: Yuan Tang <[email protected]>

* fix test

Signed-off-by: Yuan Tang <[email protected]>

* Empty-Commit

Signed-off-by: Yuan Tang <[email protected]>

---------

Signed-off-by: Yuan Tang <[email protected]>
@pull pull bot added merge-conflict Resolve conflicts manually and removed needs-rebase labels Apr 7, 2024
* build: Add flake8 and black to pre-commit hooks

Signed-off-by: Yuan Tang <[email protected]>

* fix path

Signed-off-by: Yuan Tang <[email protected]>

* pass config

Signed-off-by: Yuan Tang <[email protected]>

* fix flake8

Signed-off-by: Yuan Tang <[email protected]>

---------

Signed-off-by: Yuan Tang <[email protected]>
alexagriffith and others added 2 commits May 11, 2024 09:09
* update wording for huggingface README

small update to make readme easier to understand

Signed-off-by: Alexa Griffith  <[email protected]>

* Update README.md

Signed-off-by: Alexa Griffith [email protected]

* Update python/huggingfaceserver/README.md

Co-authored-by: Filippe Spolti <[email protected]>
Signed-off-by: Alexa Griffith  <[email protected]>

* update vllm

Signed-off-by: alexagriffith <[email protected]>

* Update README.md

---------

Signed-off-by: Alexa Griffith  <[email protected]>
Signed-off-by: Alexa Griffith [email protected]
Signed-off-by: alexagriffith <[email protected]>
Signed-off-by: Dan Sun <[email protected]>
Co-authored-by: Filippe Spolti <[email protected]>
Co-authored-by: Dan Sun <[email protected]>
* fix: HPA equality check should include annotations

Signed-off-by: Yuan Tang <[email protected]>

* Only watch related autoscalerclass annotation

Signed-off-by: Yuan Tang <[email protected]>

* simplify

Signed-off-by: Yuan Tang <[email protected]>

* Add missing delete action

Signed-off-by: Yuan Tang <[email protected]>

* fix logic

Signed-off-by: Yuan Tang <[email protected]>
---------

Signed-off-by: Yuan Tang <[email protected]>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@pull pull bot removed the needs-rebase label May 13, 2024
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

fix huggingface runtime in chart

Signed-off-by: Dan Sun <[email protected]>
@pull pull bot removed the needs-rebase label May 13, 2024
* fix huggingface runtime in chart

Signed-off-by: Dan Sun <[email protected]>

* Allow model_dir to be specified on template

Signed-off-by: Dan Sun <[email protected]>

* Default model_dir to /mnt/models for HF

Signed-off-by: Dan Sun <[email protected]>

* Lint format

Signed-off-by: Dan Sun <[email protected]>

---------

Signed-off-by: Dan Sun <[email protected]>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@pull pull bot removed the needs-rebase label May 14, 2024
* Fix:vLLM Model Supported check throwing circular dependency

Signed-off-by: Gavrish Prabhu <[email protected]>

* remove unwanted comments

Signed-off-by: Gavrish Prabhu <[email protected]>

* remove unwanted comments

Signed-off-by: Gavrish Prabhu <[email protected]>

* fix return case

Signed-off-by: Gavrish Prabhu <[email protected]>

* fix to check all arch in model config forr vllm support

Signed-off-by: Gavrish Prabhu <[email protected]>

* fixlint

Signed-off-by: Gavrish Prabhu <[email protected]>

---------

Signed-off-by: Gavrish Prabhu <[email protected]>
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Fix: allow null in Finish reason

Signed-off-by: Gavrish Prabhu <[email protected]>
@pull pull bot removed the needs-rebase label May 15, 2024
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@spolti
Copy link
Member

spolti commented May 15, 2024

closed in favor of #356

@spolti spolti closed this May 15, 2024
VedantMahabaleshwarkar pushed a commit to VedantMahabaleshwarkar/kserve that referenced this pull request May 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.