github tensorflow/serving 2.18.0-rc0

pre-release16 hours ago

Major Features and Improvements

  • No major features or improvements.

Breaking Changes

  • No breaking changes.

Bug Fixes and Other Changes

  • Extend GbmcChannel interface to implement redfish channel for TPUs (commit: 683cb64)
  • Add tests to validate monitoring states. (commit: fab5c05)
  • Disable xnn_enable_avx256vnnigfni (commit: 19f9ccf)
  • Reduce duplicate code using a test class (commit: 51cf3a7)
  • Define an option to specify different IFRT client. (commit: aca5cfa)
  • Add release notes for tf-serving 2.17.0 (commit: b72a86e)
  • avoid SetNumLoadThreads stall the server by forcing reset ThreadPool (commit: 6b9cf7c)
  • Add max_enqueued_batches option for model servers (commit: 7c99259)
  • Remove gpr_set_log_verbosity from grpc_client.cc (commit: 6e05a38)
  • Add option to stop retrying on permanent loading errors. (commit: 9ba72fa)
  • Add the batch_padding_policy attribute the tensorflow serving api. (commit: ea02141)
  • Improve handling of large JSON objects. (commit: 6cb0131)
  • Silence warnings from external code (commit: 010d61a)
  • Migration of the histogram header and cc code for TSL. Move tsl/lib/histogram to compiler/tsl/lib/histogram and update users. (commit: ab33df4)
  • Add hermetic CUDA repository rule calls to TF serving project. (commit: 787c85f)
  • Update users of status_test_util to use the new location in xla/tsl (commit: 22b2b1e)
  • Bump Bazel version from 6.4.0 to 6.5.0. (commit: 82e532f)
  • provide an option to customize the sort order among servable names (commit: 32a85a8)
  • Remove cc_api_version stage 4: deletion where cc_api_version = 2 (commit: 7e0c196)
  • Remove cc_api_version stage 4: deletion where cc_api_version = 2 (commit: 48e0f56)
  • This is a noop comment update for streaming inputs. (commit: cfac240)
  • Add a resource kind for number of LoRA models. (commit: 6b7ba27)
  • Disable more warnings to make logs cleaner (commit: 4a830ca)
  • Add bool return_single_response field to PredictStreamedOptions. (commit: 648c9ee)
  • Use gcc-10 to avoid build issues while building XLA on CI (commit: 8bd1fda)
  • Create separate kokoro config (commit: dbc7681)
  • Remove top-level .bazelrc settings now that scripts use --config=kokoro (commit: f920b98)
  • Update Dockerfile.devel to build with gcc-10 (commit: f9c0262)
  • Move tsl/lib/monitoring to xla/tsl/lib/monitoring (commit: cb934df)
  • Delete 'enable_lazy_split', since the flag is not used anywhere. The code paths for the above flag being false are retained and true are eliminated. This will ensure that improving batching will be easier. (commit: 873993f)
  • BUILD rule fix. (commit: d89b272)
  • Automated Code Change (commit: 4decd0a)
  • Automated Code Change (commit: 0b05e86)
  • Fix build error (commit: d341c34)
  • Added capability to use XLA on a GPU. (commit: e5e795f)
  • Update version for 2.18.0-rc0 release. (#2258) (commit: d6d4022)
  • Mark Tensorflow compatible with Protobuf v26+. (#2261) (commit: 424dba4)
  • Update version for 2.18.0-rc0 release. (#2262) (commit: 67f4ee8)
  • This release is based on TF version 2.18.0-rc2.

Don't miss a new serving release

NewReleases is sending notifications on new releases.