github kserve/kserve v0.4.0
KFServing 0.4 release

latest releases: v0.14.0-rc1, v0.14.0-rc0, v0.13.1...
4 years ago

Action Required

  • KFServing has added object selector on pod mutator webhook configuration which requires minimally Kubernetes 1.15 to take effect.
  • The generated KFServing InferenceService openAPI schema validation now includes markers like x-kubernetes-list-map-keys and x-kubernetes-map-type which requires minimally Kubernetes 1.16, if you are on kubernetes 1.15 or lower version please install KFServing with --validate=false flag.
  • Tensorrt inference server has been renamed to Triton inference server, if you are using tensorrt predictor on inference service yaml please rename to triton.
  • KFserving has removed the default percentage based queue proxy resource limit due to #844, please set queue proxy requests/limits in the knative config-deployment.yaml config map which is introduced in knative 0.16 or add the queue proxy resource limit annotation if you are on lower version and your cluster has resource quota turned on, we highly recommend upgrading linux kernel if you are hitting the same cpu throttling issue.
  • The default S3 credential name has been updated to follow the convention from awsAccessKeyID and awsSecretAccessKey to AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY, if you have secrets configured with the old way please update accordingly.
  • KFServing has stopped maintaining the model server image versions in the configmap, user now can set the corresponding model server version on runtimeVersion field if you need the version different from the default.

New features

Bug Fixes

Documentations

Developer Experience

Don't miss a new kserve release

NewReleases is sending notifications on new releases.