Features
- Pytorch model server with GPU inference #540
- Support internal mesh routing to inference service e.g routing from Kafka event source #583
- Add storage URI for transformer #643
- Add parallelism field to allow setting autoscaling target concurrency and number of tornado workers #637
- SKLearn model server to support pickled model #560
- Add extra information for Logger #699
- Default min replica to 1 instead of 0 #655
- Upgrade knative API from v1alpha1 to v1 for KFServing #585
- Upgrade KFServing Kubernetes dependency 1.15 and knative dependency to 1.11 #630
- Upgrade openapi-gen #600
- Expose containerPort to let knative listen on logger port, support logger for custom spec #592
- Self-signed certs generation script #650
Bug Fixes
- Fix default queue proxy container resource limit which was too low #608
- Allow configuring max buffer size for tornado server #665
- Relax data plane "instances" key validation #705
- Return application/json in response header #615
- Fix top level virtual service for HTTPS #726