What's Changed
- Make storage initializer image configurable by @yuzisun in #3145
- chore: Add design doc template links to feature request template by @ckadner in #3155
- Increase pytest workers for kourier e2e test by @sivanantha321 in #3151
- Upgrade istio Api and migrate to v1beta1 Api version by @sivanantha321 in #3150
- Unpack archive files for hdfs by @sivanantha321 in #3093
- Removing update op from pod-mutator webhook by @rachitchauhan43 in #3163
- Fix quick install script by @dtrifiro in #3164
- Fix self-signed-ca installation by @sivanantha321 in #3165
- Generate client-go for StorageContainer CR by @sivanantha321 in #3152
- Add S3_VERIFY_SSL to storage.py for S3 by @Jooho in #3172
- Allow disabling creation of the HPA in raw deployment mode by @andyi2it in #3086
- Restrict workflow concurrency by @vignesh-murugani2i in #3167
- Refractor v1 vs. v2 endpoint unit tests in kserve/test/test_server.py… by @guohaoyu110 in #3158
- Fix runtime not found for triton due to wrong default protocolVersion by @sivanantha321 in #3177
- Make ModelServer to stop correctly when using more than 1 worker by @andyi2it in #3174
- Added custom request timeout for inferencegraph. by @andyi2it in #3173
- Fix serving runtime webhook cert namespace for kubeflow installation by @sivanantha321 in #3188
- Add go security scan for PRs and set it up to run on a regular schedule by @sivanantha321 in #3170
- RawDeployment support for Inference Graph by @bmopuri in #3194
- Revert "RawDeployment support for Inference Graph" by @rachitchauhan43 in #3198
- Verify codegen in CI by @sivanantha321 in #3189
- Fix knative config-defaults values overrided by kserve by @sivanantha321 in #3130
- Fix qpext metrics port by @yuzisun in #3209
- docs: fix some typos by @daixiang0 in #3214
- chore: Add new PR reviewers and approvers by @ckadner in #3213
- Added async with postprocess method. by @andyi2it in #3204
- Remove the redundant python lint check in CI environment by @nilakshi104 in #3184
- Move pmml worker validation to runtime by @sivanantha321 in #3182
- Bump torchserve version to 0.9.0 by @gavrishp in #3217
- CVE-2023-44487 - qpext by @spolti in #3203
- Allow ray >=2.7,<3 by @ddelange in #3075
- Fix lightgbm model input conversion when input is list of lists by @sivanantha321 in #3226
- CVE-2023-44487 by @spolti in #3202
- Sanitize a command line argument in agent by @israel-hdez in #3245
- Validation added for ensuring same model format has same priority for runtime by @andyi2it in #3181
- Fix: Unexpected Panic in Inference graph when it fails to create http request by @HAO2167 in #3079
- Add default clusterstoragecontainer cr into resources by @homily707 in #3219
- Support verify variable with storage-config json style (fix-3263) by @Jooho in #3267
- Update qpext docs on image patch by @sivanantha321 in #3266
- Added 3.11 support for paddle in workflow. by @andyi2it in #3246
- Torchserve grpc v2 by @andyi2it in #3247
- Bump istio version to 1.19.4 by @sivanantha321 in #3258
- image_patch_dev.sh: set pipefail by @dtrifiro in #3274
- s3 storage initializer: only set environment variables if variables are set in storage secret json by @dtrifiro in #3259
- feat: CA bundle mount options for storage initializer by @Jooho in #3250
- Fix tensorflow e2e test fails due to OOM error by @sivanantha321 in #3293
- Update Istio-Dex docs by @sivanantha321 in #3260
- chore: Upgrade Go to v1.21 by @terrytangyuan in #3296
- fix: Properly handle the creation and closure of success file in DownloadModel() by @terrytangyuan in #3295
- Updated ray to 2.8.0 and removed detached flag to avoid deprecation error in future by @andyi2it in #3272
- fix: Surface errors when writing graphHandler response by @terrytangyuan in #3308
- Fix qpext hangs during shutdown by @sivanantha321 in #3268
- chore: Upgrade to XGBoost v2.0.2. Fixes #3310 by @terrytangyuan in #3309
- fix: Check if HPA has the same scaleTargetRef and behavior by @terrytangyuan in #3294
- Updated quick_install script to temporarily fix 0.11.2 release install by @andyi2it in #3311
- Remove deprecated protobuf packages by @sivanantha321 in #3328
- Add health check for controller manager by @sivanantha321 in #3289
- Introduce retry on resource conflict by @sivanantha321 in #3240
- Updated Kserve version file path in pyproject.toml. by @andyi2it in #3225
- docs: Add link to OpenShift Container Platform instructions by @terrytangyuan in #3322
- Fix inference request fails when sending with less number of features than the total model features on lightgbm by @sivanantha321 in #3313
- Add a CI_USE_ISVC_HOST for testing with the ISVC hostname by @israel-hdez in #3324
- Upgraded poetry version to 1.7.1 by @andyi2it in #3271
- ci: publish helm chart to ghcr by @davidspek in #3319
- Fix raw deployment service points to predictor container port instead of transformer container port in transformer collocation by @sivanantha321 in #3318
- Upgrade cloudevent to v2 by @homily707 in #3255
- Restrict storage uri to predictor only in collocation of transformer and predictor by @sivanantha321 in #3280
- Add support for modelcars by @rhuss in #3110
- Add regex support for propagating IG headers by @sivanantha321 in #3178
- chore: Prepare v0.12.0-rc0 release by @terrytangyuan in #3325
- docs: Corrections and edits on release process document by @terrytangyuan in #3326
- build: Switch to use kustomize in kubectl to simplify build process. Fixes #3314 by @terrytangyuan in #3315
- feat: Expose defaults for several batcher handler parameters by @terrytangyuan in #3301
- fix: Properly close resources and handle errors in agent and storage. Fixes #3323 by @terrytangyuan in #3321
- Add model arguments to API and update BERT inference example by @yuzisun in #3332
- chore: Update generated APIs and check generated manifests by @terrytangyuan in #3335
- Update python model serving runtime API docstring by @yuzisun in #3338
- Handles s3 download for object name starts with folder name. by @andyi2it in #3205
- chore: Remove unused timeout annotation and flag in batcher by @terrytangyuan in #3341
- ci: Automate release process by @terrytangyuan in #3345
- fixes critical vulnerabilities on ray by @spolti in #3285
- chore: Bump versions to prepare v0.12.0-rc1 release by @terrytangyuan in #3352
- Change version for helm charts in README by @gawsoftpl in #3353
- Fixes CVE-2023-48795 by @spolti in #3354
- Fix Stack-based Buffer Overflow on protobuf by @spolti in #3358
- Update knative-serving by @spolti in #3362
- Fixes vulnerabilities on the otelhttp dependency by @spolti in #3361
- Change the default value for enableDirectPvcVolumeMount to true by @Jooho in #3371
- feat: Automatically generate Helm Chart docs. Fixes #3356 by @terrytangyuan in #3363
- Modified script for include all kserve poetry projects. by @andyi2it in #3350
- RawDeployment support for Inference Graph by @bmopuri in #3199
- Add compatibility for Istio CNI plugin by @israel-hdez in #3316
- Pass missing infer parameters during conversion by @sivanantha321 in #3368
- feat: Support S3 transfer acceleration by @terrytangyuan in #3305
- Implement HuggingFace model server by @yuzisun in #3334
- fix: Add missing volume mount to transformer container when using modelcars by @rhuss in #3384
- align cloudevents/sdk-go dependency by @spolti in #3387
- Update google-cloud-storage dependecy to >=2.3.0,<3.0.0 and ray dependency to >=2.8.1, <3.0.0 by @sivanantha321 in #3389
- Adding script for setting up Telepresence for local development by @rhuss in #3104
- Fix v2 model ready url in kserve client by @sivanantha321 in #3403
- chore: Remove unused version update script for Python SDK by @terrytangyuan in #3394
- Fix no space left error in CI environment by @sivanantha321 in #3411
- Add exception handler for model server and Add ability to specify custom handler by @sivanantha321 in #3405
- Streamline python release update script to scan multi-depth by @andyi2it in #3351
- feat: Add HuggingFace runtime out-of-the-box support by @terrytangyuan in #3395
- docs: Update link to Slack instructions by @terrytangyuan in #3421
- Fix no space left error on huggingface docker publisher workflow by @sivanantha321 in #3431
- Fix parameters value type conversion by pydantic by @sivanantha321 in #3430
- Fix Raw Logger E2E by @israel-hdez in #3434
- Go vulnerability fixes by @sivanantha321 in #3442
- Reduce E2Es dependency on CI environment by @israel-hdez in #3435
- Implement support for vllm as alternative backend by @gavrishp in #3415
- Expose qpext aggregate metrics port on container by @sivanantha321 in #3291
- Python vulnerability fixes by @sivanantha321 in #3441
- Fix dup metrics aggr port by @yuzisun in #3447
- feat: Allow to disable ingress creation for raw deployment mode by @terrytangyuan in #3436
- build: Skip already downloaded Istio in quick_install.sh by @terrytangyuan in #3364
- fix: HuggingFace predictor should not be recognized as multi-model server by @terrytangyuan in #3449
- Fix: bugs for huggingface runtime template by @yuzisun in #3448
- Fix: Add padding and truncation in huggingface tokenizer by @kevinmingtarja in #3450
- Fix: vllm backend does not work with model_dir for huggingface runtime by @yuzisun in #3456
- Fix azure workload identity federation by excluding azure client secret by @robbertvdg in #3390
- Publish v0.12.0 release by @sivanantha321 in #3458
- Change
certificate
toca_bundle
in json style of s3 storageSecret by @Jooho in #3463 - fix: Add 'model_version' to InferResponse in python library by @ajstewart in #3466
New Contributors
- @dtrifiro made their first contribution in #3164
- @Jooho made their first contribution in #3172
- @vignesh-murugani2i made their first contribution in #3167
- @guohaoyu110 made their first contribution in #3158
- @bmopuri made their first contribution in #3194
- @daixiang0 made their first contribution in #3214
- @nilakshi104 made their first contribution in #3184
- @gavrishp made their first contribution in #3217
- @spolti made their first contribution in #3203
- @HAO2167 made their first contribution in #3079
- @homily707 made their first contribution in #3219
- @rhuss made their first contribution in #3110
- @gawsoftpl made their first contribution in #3353
- @kevinmingtarja made their first contribution in #3450
- @robbertvdg made their first contribution in #3390
- @ajstewart made their first contribution in #3466
Full Changelog: v0.11.1...v0.12.0