github kserve/kserve v0.3.0
v0.3 "Stability"

latest releases: v0.14.0-rc1, v0.14.0-rc0, v0.13.1...
pre-release4 years ago

Features

  • Pytorch model server with GPU inference #540
  • Support internal mesh routing to inference service e.g routing from Kafka event source #583
  • Add storage URI for transformer #643
  • Add parallelism field to allow setting autoscaling target concurrency and number of tornado workers #637
  • SKLearn model server to support pickled model #560
  • Add extra information for Logger #699
  • Default min replica to 1 instead of 0 #655
  • Upgrade knative API from v1alpha1 to v1 for KFServing #585
  • Upgrade KFServing Kubernetes dependency 1.15 and knative dependency to 1.11 #630
  • Upgrade openapi-gen #600
  • Expose containerPort to let knative listen on logger port, support logger for custom spec #592
  • Self-signed certs generation script #650

Bug Fixes

  • Fix default queue proxy container resource limit which was too low #608
  • Allow configuring max buffer size for tornado server #665
  • Relax data plane "instances" key validation #705
  • Return application/json in response header #615
  • Fix top level virtual service for HTTPS #726

Developer Experience, Tools & Testing, Examples

  • Enable local development for model servers, explainer and storage initializer #591
  • Add wait inference service SDK api #610
  • Adding custom examples #678 #698
  • Add canary rollout examples #691
  • Add e2e tests for canary rollout #658

Don't miss a new kserve release

NewReleases is sending notifications on new releases.