github kserve/kserve v0.8.0

latest releases: v0.14.0, v0.14.0-rc1, v0.14.0-rc0...
2 years ago

🌈 What's New?

This release introduces two new CRDs ServingRuntimes and ClusterServingRuntimes with the only difference between these two is that one is namespace-scoped and one is cluster-scoped. A ServingRuntime defines the templates for Pods that can serve one or more particular model formats. Each ServingRuntime defines key information such as the container image of the runtime and a list of the model formats that the runtime supports.

In previous versions of KServe, supported predictor formats and container images were defined in a config map in the control plane namespace. The ServingRuntime CRD should allow for improved flexibility and extensibility for defining or customizing runtimes to how you see fit without having to modify any controller code or any resources in the controller namespace.

Several out-of-the-box ClusterServingRuntimes are provided with KServe so that users can continue to use KServe how they did before without having to define the runtimes themselves.

⚠️ What's Changed

  • Rename KF prefixed PythonSDK classes by @markwinter in #1951
    KFModel -> Model
    KFServer -> ModelServer
    KFModelRepository -> ModelRepository
  • KServe's pytorchserver has been deprecated, for PyTorch model KServe now defaults to use TorchServe serving runtime.
  • ONNX runtime server has been deprecated, for ONNX model KServe now defaults to use Triton Inference Server.

⬆️ Version upgrades

🐞 Fixes

Full Changelog: v0.7.0...v0.8.0

Don't miss a new kserve release

NewReleases is sending notifications on new releases.