Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] master from kserve:master #472

Open
wants to merge 75 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
75 commits
Select commit Hold shift + click to select a range
b836fc8
Remove redundant namespace yaml (#4148)
greenmoon55 Dec 23, 2024
cb38637
Fix Localmodel agent build (#4150)
greenmoon55 Dec 23, 2024
41a1fd7
Fix model server fails to gracefully shutdown (#4116)
sivanantha321 Dec 24, 2024
07edb56
Ensure root model directory exists and add protection for jobs create…
yuzisun Dec 24, 2024
61b0746
Enable transformer deeper readiness check tests (#4121)
sivanantha321 Dec 26, 2024
5e5bff8
Update HuggingFace server dependencies versions (#4147)
AyushSawant18588 Dec 26, 2024
874a4a4
Add workflow for verifying go mod (#4137)
sivanantha321 Dec 26, 2024
ff13ef8
Fix for CVE-2024-52304 - aiohttp upgrade (#4113)
andyi2it Dec 27, 2024
7bec6fb
Allow other engine builders other than docker (#3906)
spolti Dec 30, 2024
8f79c44
Add localmodelnode crd to helm chart (#4161)
greenmoon55 Jan 2, 2025
5b718bc
Fixes Non-linear parsing of case-insensitive content (#4158)
spolti Jan 3, 2025
3e842b8
Helm chart - option to run daemonset as root (#4164)
greenmoon55 Jan 3, 2025
7b2681f
Replace nodeGroup with nodeGroups in charts/kserve-crd (#4166)
ritzdevp Jan 5, 2025
3597fa7
Add affinity and tolerations to localmodel daemonset (#4173)
greenmoon55 Jan 7, 2025
cb8a657
Fix s3 download PermanentRedirectError for legacy s3 endpoint (#4157)
bentohset Jan 8, 2025
654d314
Make label and annotation propagation configurable (#4030)
spolti Jan 8, 2025
b7155b3
Add ModelCache e2e test (#4136)
sivanantha321 Jan 11, 2025
717c6ae
Update vllm to 0.6.6 (#4176)
rajatvig Jan 12, 2025
1b64da9
[bugfix] fix s3 storage download filename bug (#4162)
anencore94 Jan 13, 2025
8a32b6f
Add hf to storageuri prefix list (#4184)
tjandy98 Jan 16, 2025
990e412
Add Support for OpenAI-compatible Embeddings API (#4129)
FabianScheidt Jan 16, 2025
d377350
fix: typo in _construct_http_status_error method (#4190)
Mgla96 Jan 16, 2025
a1ad62b
Fix raw logger e2e test (#4185)
sivanantha321 Jan 19, 2025
b164ea0
Feat: Support configuring isvc resource defaults (#4032)
andyi2it Jan 21, 2025
6653b89
keep replicas when autoscaler set external (#4196)
Jooho Jan 22, 2025
6f842d6
Increase kserve controller readiness probe time period (#4200)
sivanantha321 Jan 22, 2025
1d0afe1
Fix golangci-lint binary path selection based on GOBIN (#4198)
Jooho Jan 22, 2025
96f43de
Add option to disable volume management in localModel config (#4186)
ritzdevp Jan 22, 2025
ea1f055
set MaxUnavailable(0%)/MaxSurge(100%) for rollingUpdate in multinode …
Jooho Jan 24, 2025
549ab62
Gracefully shutdown the router server (#3367)
sivanantha321 Jan 25, 2025
5aedc95
Add workflow for manual huggingface vLLM image publish (#4092)
sivanantha321 Jan 25, 2025
7f4cad9
Gateway API Migration - Raw Deployment (#3952)
sivanantha321 Jan 25, 2025
b80c906
add make goal to build huggingface cpu image (#4202)
spolti Jan 26, 2025
c7b6ad0
Cleanup the filepath in createNewFile to avoid path traversal issue (…
hdefazio Jan 26, 2025
4c940e1
Enhance multinode health_check python and manifests (#4197)
Jooho Jan 26, 2025
41a0904
Publish 0.15-rc0 release (#4213)
yuzisun Jan 27, 2025
736cc48
Fix Gateway API flaky test (#4214)
sivanantha321 Jan 30, 2025
a2296cc
Remove linux/arm64/v8 as platform option to fix build errors (#4217)
gavrissh Jan 31, 2025
c418bea
Fix: isvc resource config name in inferenceservice configmap (#4215)
sukumargaonkar Feb 1, 2025
7f2e443
Fix CI not using localmodelnode agent dev image (#4221)
sivanantha321 Feb 3, 2025
0c42913
Fix model download path (#4112)
hakuro95 Feb 4, 2025
198c835
Support Multiple NodeGroups In LocalModelCache (#4170)
HotsauceLee Feb 7, 2025
895f9fd
Inference Graph: use plain text HTTP when part of Istio Mesh (#4031)
israel-hdez Feb 7, 2025
cbbf41a
Better compatibility with in-place upgrades when auth proxy disabled …
israel-hdez Feb 12, 2025
0b49aad
Increase request timeout seconds for art explainer (#4241)
sivanantha321 Feb 12, 2025
0ec3460
fix: add trainedmodels custom resource to kubeflow-kserve clusterrole…
gigabyte132 Feb 12, 2025
974ac97
Fix CVE-2025-24357 and Bump vLLM to 0.7.2 (#4223)
sivanantha321 Feb 13, 2025
6d71f39
Bump Go to 1.23 (#4239)
rajatvig Feb 13, 2025
e83a197
Remove modelmesh installation from helm chart (#4243)
sivanantha321 Feb 16, 2025
e5b7919
Bump golang-lint to 1.63 and fix all linter errors (#3967)
sivanantha321 Feb 17, 2025
2a8d5fd
Fix request metadata headers parsing in logger (#4249)
tylerhyang Feb 20, 2025
952711a
Add predictor healthcheck to OpenAIProxyModel (#4250)
greenmoon55 Feb 21, 2025
20c5a52
Expose podSpec fields for Inferencegraph (#4091)
sivanantha321 Feb 25, 2025
8f2409b
Fix localmodel test (#4268)
greenmoon55 Feb 26, 2025
ea8bb6d
Force symlink for ModelCar (#4274)
pmtk Feb 28, 2025
1cee348
Refactor vLLM + Embed support (#4177)
gavrissh Mar 1, 2025
2925c91
Fix triton health check (#4277)
greenmoon55 Mar 3, 2025
0a24d86
Upgrade vLLM version to 0.7.3 (#4281)
gavrissh Mar 4, 2025
8b744a9
0.15.0-rc1 release (#4285)
greenmoon55 Mar 4, 2025
88c98d9
Add model_version field to InferRequest (#4287)
greenmoon55 Mar 5, 2025
153189a
quick_install.sh failed to uninstall incomplete installation and has …
zozowell Mar 5, 2025
41f771d
update openshift guide (#4210)
spolti Mar 6, 2025
9716122
Collocation transformer and predictor spec (#4255)
sivanantha321 Mar 6, 2025
7fe14e0
Move arguments from 'args' to 'command' for huggingface server multno…
Jooho Mar 6, 2025
3f84f2b
Include reasoning parser option in vLLM for reasoning models (#4282)
gavrissh Mar 6, 2025
d78fd25
KServe Keda Integration (#3652)
andyi2it Mar 7, 2025
4c13709
add huggingfaceserver-multinode to helm chart (#4293)
Jooho Mar 7, 2025
d5704af
Add missing CRDs for Keda (#4296)
andyi2it Mar 7, 2025
4612cd6
Add well-known label to InferenceGraphs (#4236)
israel-hdez Mar 8, 2025
0188b63
Docs update for speculative_decoding in huggingfaceserver (#4272)
nickaggarwal Mar 10, 2025
3a8ec64
Merging kfp into kserve docs. (#4270)
mholder6 Mar 12, 2025
3d6a77c
fix conflict isvc name for keda e2e (#4305)
Jooho Mar 12, 2025
d0b7f31
update golangci-lint verstion to 1.64 (#4302)
Jooho Mar 13, 2025
47f3014
fix: add huggingfaceserver gpu images to kustomization.yaml and use i…
Jooho Mar 14, 2025
cfba881
Fix watch for k8s service events (#4260)
sivanantha321 Mar 16, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
7 changes: 4 additions & 3 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,9 @@
<!-- Thanks for sending a pull request! Here are some tips for you:
1. If this is your first time, read our contributor guidelines https://www.kubeflow.org/docs/about/contributing/ and developer guide https://github.com/kserve/kserve/blob/master/docs/DEVELOPER_GUIDE.md
2. If you want *faster* PR reviews, read how: https://git.k8s.io/community/contributors/guide/pull-requests.md#best-practices-for-faster-reviews
3. Follow the instructions for writing a release note: https://git.k8s.io/community/contributors/guide/release-notes.md
4. If the PR is unfinished, see how to mark it: https://git.k8s.io/community/contributors/guide/pull-requests.md#marking-unfinished-pull-requests
2. Before raising a PR, please run `make go-lint` and `make py-fmt` to check the code style.
3. If you want *faster* PR reviews, read how: https://git.k8s.io/community/contributors/guide/pull-requests.md#best-practices-for-faster-reviews
4. Follow the instructions for writing a release note: https://git.k8s.io/community/contributors/guide/release-notes.md
5. If the PR is unfinished, see how to mark it: https://git.k8s.io/community/contributors/guide/pull-requests.md#marking-unfinished-pull-requests
-->

**What this PR does / why we need it**:
Expand Down
22 changes: 9 additions & 13 deletions .github/actions/kserve-dep-setup/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,10 @@ inputs:
description: 'Kserve deployment mode. Supported values are serverless and raw'
required: false
default: 'serverless'
enable-keda:
description: 'Enable KEDA for autoscaling'
required: false
default: 'false'
runs:
using: "composite"
steps:
Expand All @@ -17,22 +21,14 @@ runs:
run: |
shopt -s nocasematch

if [[ "${{ inputs.network-layer }}" == "istio" ]]; then
echo "Selected network layer ${{ inputs.network-layer }}"
./test/scripts/gh-actions/setup-deps.sh ${{ inputs.deployment-mode }}

kubectl get pods -n istio-system
kubectl describe svc -n istio-system istio-ingressgateway
elif [[ "${{ inputs.network-layer }}" == "kourier" ]]; then
echo "Selected network layer ${{ inputs.network-layer }}"
./test/scripts/gh-actions/setup-kourier.sh
if [[ "${{ inputs.network-layer }}" == "kourier" ]]; then
echo "Selected network layer ${{ inputs.network-layer }}"
./test/scripts/gh-actions/setup-kourier.sh
else
echo "Unsupported network layer"
echo "Selected network layer ${{ inputs.network-layer }}"
./test/scripts/gh-actions/setup-deps.sh ${{ inputs.deployment-mode }} "${{ inputs.network-layer }}" "${{ inputs.enable-keda }}"
fi

kubectl get pods -n knative-serving
kubectl get pods -n cert-manager

- name: Update test overlays
shell: bash
run: |
Expand Down
24 changes: 20 additions & 4 deletions .github/actions/minikube-setup/action.yml
Original file line number Diff line number Diff line change
@@ -1,22 +1,38 @@
name: 'Minikube setup action'
description: 'Sets up minikube on the github runner'

inputs:
nodes:
description: 'Number of nodes to start minikube with'
required: false
default: '1'
driver:
description: 'Driver to use for minikube'
required: false
default: 'none'
start-args:
description: 'Additional arguments to pass to minikube start'
required: false
default: ''

runs:
using: "composite"
steps:
- name: Install kubectl
uses: azure/[email protected]
with:
version: 'v1.29.7'
version: 'v1.30.7'

- name: Setup Minikube
uses: medyagh/setup-minikube@latest
with:
minikube-version: '1.33.1'
kubernetes-version: 'v1.29.7'
driver: 'none'
kubernetes-version: 'v1.30.7'
driver: ${{ inputs.driver }}
wait: 'all'
start-args: --wait-timeout=6m0s
cpus: 'max'
memory: 'max'
start-args: --wait-timeout=6m0s --nodes=${{ inputs.nodes }} ${{ inputs.start-args }}

- name: Check Kubernetes pods
shell: bash
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/artexplainer-docker-publish.yml
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ jobs:
- name: Run tests
uses: docker/build-push-action@v5
with:
platforms: linux/amd64, linux/arm64/v8
platforms: linux/amd64
context: python
file: python/artexplainer.Dockerfile
push: false
Expand Down Expand Up @@ -93,7 +93,7 @@ jobs:
- name: Build and push
uses: docker/build-push-action@v5
with:
platforms: linux/amd64, linux/arm64/v8
platforms: linux/amd64
context: python
file: python/artexplainer.Dockerfile
push: true
Expand Down
18 changes: 9 additions & 9 deletions .github/workflows/automated-release.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,27 +3,27 @@ on:
workflow_dispatch:
inputs:
releaseBranch:
description: 'The existing branch name to release from, e.g. release-0.12'
description: "The existing branch name to release from, e.g. release-0.12"
required: true
releaseTag:
description: 'The release tag, e.g. v0.12.0-rc1'
description: "The release tag, e.g. v0.12.0-rc1"
required: true

jobs:
prepare-release:
runs-on: ubuntu-latest
steps:
- name: Setup Go
uses: actions/setup-go@v5
with:
go-version: 1.22

- name: Checkout source code
uses: actions/checkout@v4
with:
fetch-depth: 0
ref: ${{ inputs.releaseBranch }}

- name: Setup Go
uses: actions/setup-go@v5
with:
go-version-file: go.mod

- name: Install dependencies
run: |
go mod download
Expand All @@ -39,12 +39,12 @@ jobs:
cd $KSERVE_PATH
export RELEASE_BRANCH=${{ inputs.releaseBranch }}
export RELEASE_TAG=${{ inputs.releaseTag }}

# Bump Versions
make bump-version
./hack/generate-install.sh $RELEASE_TAG
./hack/python-release.sh

# Update Release Branch and Push Tag
git diff
git config --global user.email "[email protected]"
Expand Down
Loading