github Altinity/clickhouse-operator release-0.26.0

5 hours ago

IMPORTANT: Due to ClickHouse upstream regression ClickHouse/ClickHouse#89693 DDL queries may not work on newly created ClickHouse pods. It affects Kubernetes deployments only in some new ClickHouse versions (25.8.10+ and above). The workaround is to restart ClickHouse pods. The problem is fixed by ClickHouse/ClickHouse#92339, see backports for different release branches. The fix is backported to Altinity Stable 25.8.16.10001 as well.
Closes #1883 and #1913

Added

  • Added an option to abort reconcile if STS needs to be recreated. It can be configured in operator configuration or CHI.
# Reconcile StatefulSet scenario
reconcile:
  statefulSet:
    recreate:
      # What to do in case operator is in need to recreate StatefulSet?
      # Possible options:
      # 1. abort - abort the process, do nothing with the problematic StatefulSet, leave it as it is,
      #    do not try to fix or delete or update it, just abort reconcile cycle.
      #    Do not proceed to the next StatefulSet(s) and wait for an admin to assist.
      # 2. recreate - proceed and recreate StatefulSet.
      # Triggered when StatefulSet update fails or StatefulSet is not ready
      onUpdateFailure: recreate
  • Added an option to configure system tables for metrics scrapping. The default is system.metrics and system.custom_metrics tables, but those can be changed with a regular expression if needed:
    tablesRegexp: "^(metrics|custom_metrics)$"

Changed

  • The suspend flag now immediately aborts a running reconcile. Previously, it did not affect the one that was running
  • When suspend flag is set, any reconcile attempt automatically sets CHI/CHK status to aborted.
  • Add optional registry prefix for operator and metrics images in Helm chart by @lesandie in #1928
  • Improve ClickHouse Keeper Grafana Dashboard by @discostur in #1872
  • Add CRDHook annotations by @eyyu in #1914
  • Hotfix crdhook, add imagePullSecrets by @Slach in #1917
  • Fix installer to default template URL to OPERATOR_VERSION by @realyota in #1910
  • sort keys in Settings.Keys() method for consistent order (fix manifest reconcile issue) by @mastercactapus in #1900
  • Multiple documentation fixes

Fixed

  • Fixed Keeper rolling update logic. Closes #1796 #1915
  • Fixed a bug when replica was not added to monitoring until it catches up the replication lag
  • Fixed version parsing for FIPS compatible builds of ClickHouse. Closes #1850
  • Fixed stop and suspend attributes for CHK that were previously ignored
  • Fix distributed_ddl.replicas_path mismatch that could prevent sharing (Zoo)Keeper between multiple clusters @Elmo33 in #1922
  • Fixed a bug when defaults.storageManagement.reclaimPolicy was not respected
  • Fixed slow initial connectivity to newly created pods caused by DNS search list exhaustion (ndots:5). Added trailing dot to FQDN and increased connect timeout
  • Fixed a bug where reconcile settings specified at CHI level (e.g. spec.reconcile.statefulSet.recreate.onUpdateFailure) were not inherited by cluster-level reconcile configuration

Other

  • stdlib has been upgraded to 1.25.6 to address CVEs
  • Operator has been certified for 25.8.16.10001 Altinity.Stable.

New Contributors

Full Changelog: release-0.25.6...release-0.26.0

Don't miss a new clickhouse-operator release

NewReleases is sending notifications on new releases.