IMPORTANT: Due to ClickHouse upstream regression ClickHouse/ClickHouse#89693 DDL queries may not work on newly created ClickHouse pods. It affects Kubernetes deployments only in some new ClickHouse versions (25.8.10+ and above). The workaround is to restart ClickHouse pods. The problem is fixed by ClickHouse/ClickHouse#92339, see backports for different release branches. The fix is backported to Altinity Stable 25.8.16.10001 as well.
Closes #1883 and #1913
Added
- Added an option to abort reconcile if STS needs to be recreated. It can be configured in operator configuration or CHI.
# Reconcile StatefulSet scenario
reconcile:
statefulSet:
recreate:
# What to do in case operator is in need to recreate StatefulSet?
# Possible options:
# 1. abort - abort the process, do nothing with the problematic StatefulSet, leave it as it is,
# do not try to fix or delete or update it, just abort reconcile cycle.
# Do not proceed to the next StatefulSet(s) and wait for an admin to assist.
# 2. recreate - proceed and recreate StatefulSet.
# Triggered when StatefulSet update fails or StatefulSet is not ready
onUpdateFailure: recreate
- Added an option to configure system tables for metrics scrapping. The default is
system.metricsandsystem.custom_metricstables, but those can be changed with a regular expression if needed:
tablesRegexp: "^(metrics|custom_metrics)$"
Changed
- The
suspendflag now immediately aborts a running reconcile. Previously, it did not affect the one that was running - When
suspendflag is set, any reconcile attempt automatically sets CHI/CHK status to aborted. - Add optional registry prefix for operator and metrics images in Helm chart by @lesandie in #1928
- Improve ClickHouse Keeper Grafana Dashboard by @discostur in #1872
- Add CRDHook annotations by @eyyu in #1914
- Hotfix crdhook, add imagePullSecrets by @Slach in #1917
- Fix installer to default template URL to OPERATOR_VERSION by @realyota in #1910
- sort keys in Settings.Keys() method for consistent order (fix manifest reconcile issue) by @mastercactapus in #1900
- Multiple documentation fixes
Fixed
- Fixed Keeper rolling update logic. Closes #1796 #1915
- Fixed a bug when replica was not added to monitoring until it catches up the replication lag
- Fixed version parsing for FIPS compatible builds of ClickHouse. Closes #1850
- Fixed
stopandsuspendattributes for CHK that were previously ignored - Fix
distributed_ddl.replicas_pathmismatch that could prevent sharing (Zoo)Keeper between multiple clusters @Elmo33 in #1922 - Fixed a bug when
defaults.storageManagement.reclaimPolicywas not respected - Fixed slow initial connectivity to newly created pods caused by DNS search list exhaustion (ndots:5). Added trailing dot to FQDN and increased connect timeout
- Fixed a bug where reconcile settings specified at CHI level (e.g.
spec.reconcile.statefulSet.recreate.onUpdateFailure) were not inherited by cluster-level reconcile configuration
Other
- stdlib has been upgraded to 1.25.6 to address CVEs
- Operator has been certified for 25.8.16.10001 Altinity.Stable.
New Contributors
- @rajdudhare1 made their first contribution in #1890
- @siggy made their first contribution in #1906
- @eyyu made their first contribution in #1914
- @Elmo33 made their first contribution in #1922
- @mastercactapus made their first contribution in #1900
- @lesandie made their first contribution in #1928
Full Changelog: release-0.25.6...release-0.26.0