github kserve/kserve v0.5.0

latest releases: v0.13.0-rc0, v0.12.1, v0.12.0...
3 years ago

InferenceService V1Beta1

🚢 KFServing 0.5 promotes the core InferenceService from v1alpha2 to v1beta1!

The minimum required versions are Kubernetes 1.16 and Istio 1.3.1/Knative 0.14.3. Conversion webhook is installed to automatically convert v1alpha2 inference service to v1beta1.

🆕 What's new ?

  • You can now specify container fields on ML Framework spec such as env variable, liveness/readiness probes etc.
  • You can now specify pod template fields on component spec such as NodeAffinity etc.
  • Allow specifying timeouts on component spec
  • Tensorflow Serving gRPC support.
  • Triton Inference server V2 inference REST/gRPC protocol support, see examples
  • TorchServe predict integration, see examples
  • SKLearn/XGBoost V2 inference REST/gRPC protocol support with MLServer, see SKLearn and XGBoost examples
  • PMMLServer support, see examples
  • LightGBM support, see examples
  • Simplified canary rollout, traffic split at knative revisions level instead of services level, see examples
  • Transformer to predictor call is now using AsyncIO by default

⚠️ What's gone ?

  • Default/Canary level is removed, canaryTrafficPercent is moved to the component level
  • rollout_canary and promote_canary API is deprecated on KFServing SDK
  • Parallelism field is renamed to containerConcurrency
  • Custom keyword is removed and container field is changed to be an array

⬆️ What actions are needed to take to upgrade?

  • Make sure canary traffic is all rolled out before upgrade as v1alpha2 canary spec is deprecated, please use v1beta1 spec for canary rollout feature.
  • Although KFServing automatically converts the InferenceService to v1beta1, we recommend rewriting all your spec with V1beta1 API as we plan to drop the support for v1alpha2 in later versions.

Contribution list

Multi Model Serving V1Alpha1

🌈 KFServing 0.5 introduces Multi Model Serving with V1Alpha1 TrainedModel CR, this is currently for experiment only and we are looking for your feedbacks!

Checkout sklearn, triton MMS examples.

Explanation

Documentation

Developer Experience

Don't miss a new kserve release

NewReleases is sending notifications on new releases.