tensorflow/serving 1.13.0 on GitHub

TensorFlow Serving using TensorFlow 1.13.1

Major Features and Improvements

Support for TensorRT 5.0 (GPU docker image built against CUDA 10 and TensorRT 5.0)
Support for listening gRPC over UNIX socket (commit: a25b0da)
New GPU version of TensorFlow Serving API PIP package. This depends on the tensorflow-gpu instead of tensorflow PIP package, but is otherwise identical. (commit: 525c1af)
TF Serving end-to-end colab! Training with Keras, serving with TF Serving and REST API (commit: 1ff8aad)

Make error message for input size mismatch in Predict call even more actionable. (commit: 7237fb5)
Document how to use the version policy to pin a specific version, or serve multiple versions, of a model. (commit: 2724bfe)
Document config reloading and model version labels. (commit: f4890af)
Fix the compile error on ARM-32 in net_http/server. (commit: 5446fd9)
Adds ModelSpec to SessionRunResponse. (commit: 58a2263)
Add MKL support (commit: 8f79253)
Fix default path of Prometheus metrics endpoint (commit: 9d05b0c)
Add monitoring metrics for saved model (export_dir) warm up latency. (commit: de0935b)
Add more details/clarification to model version labels documentation. (commit: f9e6ac4)
Split --tensorflow_session_parallelism flag into two new flags: --tensorflow_intra_op_parallelism and --tensorflow_inter_op_parallelism (commit: 71092e4)
Update CPU Docker images to Ubuntu 18.04 (commit: 8023fba)
Upgrade to Bazel 0.20.0 (commit: fc0b75f)
Update Python 2 scripts to be compatible with both Python 2 and 3 (commit: 846d443)

This release contains contributions from many people at Google, as well as:

Daniel Shi, Karthik Vadla, lapolonio, robert, Shintaro Murakami, Siju, Tom Forbes, Ville TöRhöNen