Release date: Nov 28, 2025
Enhancements:
-
Improved network failure detection for replica instances by setting the default
tcp_user_timeoutto 5 seconds. This change helps replicas detect and recover from silent network drops more quickly. Previously, replicas could wait up to 127 seconds before detecting such failures; with the new timeout, they reconnect to the primary within 5 seconds. To preserve the previous behavior, setSTANDBY_TCP_USER_TIMEOUTto0in the operator configuration. (#9317) -
Enhanced cluster restore to wait for all init containers to complete before starting the restore process. This ensures that backup tools running in init containers finish preparing the data before the restore begins. The implementation correctly handles Kubernetes init container sidecars by ignoring those with
RestartPolicy=Always. (#9026)
-
Added the
PGBOUNCER_IMAGE_NAMEoperator configuration parameter to allow overriding the default PgBouncer image. This is useful for air-gapped environments or when using internal registries. (#9232) -
cnpgplugin:- Added a
--timeoutflag to thekubectl cnpg statuscommand for configuring the timeout for filesystem operations such as calculating cluster size. The default remains 10 seconds but can be adjusted for large clusters where operations may take longer. (#9201)
- Added a
Fixes:
- Improved resilience of all probe types (liveness, readiness, and startup) to transient Kubernetes API server connectivity issues. Probes now use a caching mechanism that falls back to cached cluster definitions during brief network interruptions, preventing unnecessary pod restarts and probe failures. (#9148)
- Fixed the
CheckEmptyWalArchivesafeguard to run correctly when restoring from a volume snapshot using CNPG-I backup/WAL plugins (e.g.,plugin-barman-cloud). Previously, this check was skipped for plugin-based implementations. (#9306)
- Improved error reporting when ImageCatalog retrieval fails. The operator now emits a Warning event and logs errors for all failure types, not just
NotFounderrors, improving visibility into configuration issues. (#9266)
- Fixed TLS certificate verification issues when connecting to CNPG-I plugins by adding the
cnpg.io/pluginServerNameannotation. This allows customizing the DNS name used for certificate verification in environments where the plugin's certificate uses a different DNS name than the Service name. (#9222)
- Fixed an issue where the instance manager controller could fail to restart after an error, reporting a "controller already exists" message. The controller now uses
SkipNameValidationfor subsequent initialization attempts. Contributed by @mateusoliveira43. (#9123)