Changes since v0.17.4:
Actions Required Before Upgrading
(No, really, you MUST read this before you upgrade)
- Minor releases: Review the
.0release notes for each new minor version you cross; see:v0.16.0,v0.17.0. - Patch releases: Review the patch release notes leading up to this version, but only within this minor release line; see:
v0.17.1,v0.17.2,v0.17.3,v0.17.4.
Changes by Kind
Bug or Regression
- DRA: Fixed a bug where the kueue-controller-manager startup fails when DRA v1 APIs are not available (#11813, @tenzen-y)
- DRA: Fixed hot reconcile loops for inadmissible Workloads with deterministic DRA resolution
failures. Kueue now avoids requeueing permanent DRA spec or configuration errors while still
retrying transient failures with backoff. (#12094, @thc1006) - ElasticJobsViaWorkloadSlices: Fix the bug that regular (non-elastic) workloads with the required/preferred topology
were rejected when the feature ElasticJobsViaWorkloadSlicesWithTAS is enabled. (#12043, @yaroslava-serdiuk) - Fixed LocalQueue status updates being rejected ("status.flavorsReservation: Too many: ... must have at most 16 items") when the referenced ClusterQueue has more than 16 flavors, by raising the LocalQueue status flavor limits to 64 to match the ClusterQueue limits. (#12089, @AsherWright)
- Kueue-populator: Fixed
events.k8s.ioRBAC permissions for event recording. (#12032, @weizhoublue) - KueueViz: Fixed a bug where the dashboard briefly displayed zero counts for all metrics on page load before the WebSocket connection finished loading. (#12040, @YadavAkhileshh)
- KueueViz: Fixed a layout-bleed bug where switching directly between detail pages briefly rendered stale queue data from the previously visited resource. (#12018, @YadavAkhileshh)
- Observability: Fix ClusterQueue Borrowing Limit metric to display infinity if the limit is unset. (#12106, @mszadkow)
- Observability: Fixed a misleading
kueue_cluster_queue_lending_limitmetric value for ClusterQueues with unsetlendingLimit. Kueue now reports+Inf, matching the actual unconstrained lending behavior instead of reporting 0. (#12171, @weizhoublue) - Observability: add a safeguard check truncating the event messages to make sure the events can be successfully recorded in the API server. (#12090, @olekzabl)
- TAS: Fix a bug where TAS ignores excluded or transformed resources in node capacity tracking. (#12035, @wafrelka)
- TAS: Fixed error handling for TAS topology assignments so Workloads are not considered
Fitwhen topology assignment fails. Kueue now treats such assignment errors asNoFit
instead of allowing the Workload to reserve quota. (#12188, @mimowo) - VisibilityOnDemand: Fixed forbidden list/watch errors caused by unused
MutatingAdmissionPolicy informers in the visibility server. (#11875, @kimminw00)