github SeldonIO/seldon-core v2.8.4

16 hours ago

Overview

This release introduces a number of features that help users run core 2 with dynamic loads. We added support for HPA model & server autoscaling based on custom metrics in the single-model-serving case. Documentation showing an example of scaling based on model RPS is described here.

This release also reduces the risk of data plane outage in the case of control plane issues (specifically when seldon-scheduler is restarting). This is done by synchronising the process of the scheduler (re)start with the rest of the system.

A number of features have been added, most notably users can now upgrade a model to a new version that is part of an ongoing experiment.

We are migrating docs to a new site here (experimental).

(Main) Features:

  • Add gitbook support (experimental) #5943
  • Add metadata object to a seldonconfig component #5918
  • seldon-scheduler startup synchronisation #5930
  • Envoy Grafana Dashbaord #5894
  • Upgrade models in experiment #5874
  • Add pipeline and model name validation #5872
  • Add retry parameters #5875

(Main) Bugs Fixed:

  • Skip out of order control messages in seldon-agent #5969
  • Use StatefulSet.Spec.Replicas as reference for Server status #5945
  • Cleaning up models that fail to load #5857 and #5830

Changelog

All notable changes to this project will be documented in this file. Dates are displayed in UTC.

Generated by auto-changelog.

v2.8.4

11 October 2024

  • fix(ci): Changes from v2 for release 2.8.4 (5) #5970
  • fix(docs): revert changes for changelog.md #5971
  • fix(agent): Skip agent out of order control messages #5969
  • Update README.md #5955
  • fix(docs): changed the name from core v2 to Core 2 #5963
  • fix(ci): Changes from v2 for release 2.8.4 (4) #5968
  • increase drain grace period wait #5967
  • refactor: error checks and nil pointer check #5964
  • fix(ci): changes from v2 for release 1.8.4 (3) #5966
  • fix(scheduler): Add grace period for model unload #5965
  • Update README.md #5962
  • fix(ci): Changes from v2 for release 2.8.4 (2) #5956
  • fix(controller): Adjust controller -> scheduler state recreation upon scheduler disconnect. #5944
  • feat(docs): improve HPA autoscaling docs #5953
  • fix(docs): fix HPA-autoscaling formula in old docs #5950
  • feat: bump k8s kind to 1.31 #5954
  • Merge GitBook Docs with V2 master #5943
  • feat: adding -ldflag and -trimpath to go build #5940
  • fix(k8s/yaml): add timing environment variables to triton agent #5949
  • fix(operator): Use StatefulSet.Spec.Replicas as reference for Server status #5945
  • Bump ubi9/ubi-minimal in /scheduler #5941
  • Bump rclone/rclone from 1.68.0 to 1.68.1 in /scheduler #5942
  • fix(ci): Release 2.8.4 #5934
  • feat(charts): adding metadata object to a seldonconfig component #5918
  • feat(operator): add Model selector for scale subresource to enable HPA-based scaling #5932
  • fix(scheduler): Scheduler wait on server connect #5930
  • Bump envoyproxy/envoy from v1.31.1 to v1.31.2 in /scheduler #5923
  • fix headers key for http k6 test #5920
  • feat(chart): exposed retry constants as helm chart #5889
  • create a CDS cluster per model #5916
  • feat(o11y): Envoy Grafana Dashbaord #5894
  • Bump ubi9/ubi-minimal from 9.4-1227 to 9.4-1227.1725849298 in /scheduler #5913
  • Bump rclone/rclone from 1.67.0 to 1.68.0 in /scheduler #5914
  • Bump envoyproxy/envoy from v1.31.0 to v1.31.1 in /scheduler #5915
  • fix(scheduler): Controller to update the list of servers to scheduler on reconnect #5893
  • fix(bug): time unit fix #5892
  • feat(env): parameters exposed as env variables #5875
  • Bump ubi9/ubi-micro from 9.4-13 to 9.4-15 in /scheduler #5877
  • Bump ubi9/ubi-minimal from 9.4-1194 to 9.4-1227 in /scheduler #5878
  • Bump ubi9/ubi-micro from 9.4-13 to 9.4-15 in /hodometer #5879
  • Bump github.com/signalfx/splunk-otel-go/instrumentation/github.com/confluentinc/confluent-kafka-go/v2/kafka/splunkkafka #5881
  • Bump ubi9/ubi-micro from 9.4-13 to 9.4-15 in /operator #5885
  • feat: pipeline and model name validation #5872
  • fix(envoy): upgrade models in experiment #5874
  • feat(grafana): add performance dashboard #5852
  • fix(golangci-lint): version bump and location fix #5853
  • Re-generate license info #5871
  • Re-generate license info #5869
  • Bump github.com/envoyproxy/go-control-plane in /scheduler #5837
  • Bump github.com/spf13/cobra from 1.8.0 to 1.8.1 in /operator #5841
  • Bump grafana/grafana from 11.1.3 to 11.2.0 in /scheduler #5858
  • Bump google.golang.org/grpc from 1.65.0 to 1.66.0 in /hodometer #5861
  • fix(model-gateway): Create consumer if model can serve traffic #5865
  • fix(agent): cleaning up models that fail to load #5857
  • Bump google.golang.org/protobuf from 1.34.1 to 1.34.2 in /operator #5820
  • Bump grafana/grafana from 11.1.1 to 11.1.3 in /scheduler #5823
  • fix(agent): cleanup rclone directory on errors #5830
  • feat(scheduler): Add max elapsed duration for model load/unload #5819
  • Bump github.com/onsi/gomega from 1.33.1 to 1.34.0 in /scheduler #5799
  • fix: Add experiments pipeline tests for envoy processor #5810
  • Re-generate license info #5813
  • Re-generate license info #5811
  • Bump ubi9/ubi-minimal from 9.4-1134 to 9.4-1194 in /scheduler #5800
  • Bump grafana/grafana from 11.1.0 to 11.1.1 in /scheduler #5801
  • Bump ubi9/ubi-micro from 9.4-9 to 9.4-13 in /hodometer #5795
  • Bump ubi9/ubi-micro from 9.4-9 to 9.4-13 in /operator #5798
  • Bump ubi9/ubi-micro from 9.4-9 to 9.4-13 in /scheduler #5802
  • Bump envoyproxy/envoy from v1.30.4 to v1.31.0 in /scheduler #5780
  • Re-generate license info #5792
  • fix(ci): add additional licenses for transitive dependencies #5791
  • build(grpc): move to latest versions of protobuf/grpc codegen tools #5789
  • Re-generate license info #5790
  • build(deps): bump google.golang.org/grpc to 1.65.0 #5748
  • feat(helm-charts): allow pod spec overrides via values #5771
  • feat(ansible): allow local (host) mounts in kind #5770
  • Re-generate license info #5769
  • Bump github.com/confluentinc/confluent-kafka-go/v2 in /scheduler #5755
  • Bump go.opentelemetry.io/otel/sdk from 1.27.0 to 1.28.0 in /scheduler #5756
  • Bump go.opentelemetry.io/contrib/instrumentation/net/http/otelhttp #5757
  • Bump github.com/confluentinc/confluent-kafka-go/v2 in /operator #5759
  • Bump go.opentelemetry.io/contrib/instrumentation/github.com/gorilla/mux/otelmux #5758
  • Update changelog.yml #5764
  • fix(ci): Update changelog.yml #5763
  • Generating changelog for v2.8.4 e7b41bb
  • Generating changelog for v2.8.4-rc1 5c0d866
  • Setting version for yaml manifests e148660
  • Setting version for helm charts 54980bb
  • Setting version for helm charts 2d27dae

Don't miss a new seldon-core release

NewReleases is sending notifications on new releases.