4.0.9
Release Date: April 16, 2026
Behavior Changes
- When VARBINARY columns appear inside nested types (ARRAY, MAP, or STRUCT), StarRocks now correctly encodes the values in binary format in MySQL result sets. Previously, raw bytes were emitted directly, which could break text-protocol parsing for null bytes or non-printable characters. This change may affect downstream clients or tools that process VARBINARY data inside nested types. #71346
- Routine Load jobs now automatically pause when a non-retryable error is encountered, such as a row causing the Primary Key size limit to be exceeded. Previously, the job would retry indefinitely because such errors were not recognized as non-retryable by the FE transaction status handler. #71161
SHOW CREATE TABLEandDESCstatements now display the Primary Key columns for Paimon external tables. #70535- Cloud-native tablet metadata fetch operations (such as
get_tablet_statsandget_tablet_metadatas) now use a dedicated thread pool instead of the sharedUPDATE_TABLET_META_INFOpool. This prevents metadata fetch contention from impacting repair and other tasks. The new thread pool size is configurable via a new BE parameter. #70492
Improvements
- Added session variables to control the encoding behavior of VARBINARY values in MySQL protocol responses, providing fine-grained control over binary result encoding in client connections. #71415
- Added a
snapshot_meta.jsonmarker file to cluster snapshots to support integrity validation before snapshot restoration. #71209 - Added warning logs for silently swallowed exceptions in
WarehouseManagerto improve observability of silent failures. #71215 - Added metrics for Iceberg metadata table queries to support performance monitoring and diagnosis. #70825
- The
regexp_replace()function now supports constant folding during FE query planning, reducing planning overhead for queries with constant string arguments. #70804 - Added categorized metrics for Iceberg time travel queries to improve monitoring and performance analysis. #70788
- Added log output when update compaction is suspended, improving visibility into compaction lifecycle. #70538
SHOW COLUMNSnow returns column comments for PostgreSQL external tables. #70520- Added support for dumping query execution plans when a query encounters an exception, improving diagnosability of runtime failures. #70387
- Tablet deletion during DDL operations is now batched, reducing write lock contention on tablet metadata. #70052
- Added a Force Drop recovery mechanism for synchronous materialized views that are stuck in an error state and cannot be dropped through normal means. #70029
Bug Fixes
The following issues have been fixed:
- An issue where the profile
START_TIMEandEND_TIMEwere not displayed in the session timezone. #71429 - A shared-object mutation bug in
PushDownAggregateRewriterwhen processing CASE-WHEN/IF expressions, which could cause incorrect query results. #71309 - A use-after-free bug in
ThreadPool::do_submittriggered when thread creation fails. #71276 - An issue where
information_schema.tablesdid not properly escape special characters in equality predicates, causing incorrect results. #71273 - An issue where the materialized view scheduler continued to run after the materialized view became inactive. #71265
- Fixed a task signature collision in
UpdateTabletSchemaTaskacross concurrent ALTER jobs that could cause schema update tasks to be skipped. #71242 - An issue where row count estimation produced NaN values for histograms that contained only MCV (Most Common Values) entries. #71241
- A missing dependency on the AWS S3 Transfer Manager in the AWS SDK integration. #71230
- An issue where
TaskManagerscheduler callbacks did not verify whether the current node is the leader, potentially causing duplicate task execution on follower nodes. #71156 - A thread-local context pollution issue where
ConnectContextinformation was not cleared after a leader-forwarded request completed. #71141 - An issue where the partition predicate was missing in short-circuit point lookups, causing incorrect query results. #71124
- A NullPointerException when analyzing generated columns during Stream Load or Broker Load if a column referenced by the generated column expression was absent from the load schema. #71116
- A use-after-free bug in the error handling path of parallel segment and rowset loading. #71083
- An issue where delvec orphan entries were left behind when a write operation preceded compaction in the same publish batch. #71049
- An issue where queries appeared in the
current_queriesresult via HTTP loopback when checking query progress internally. #71032 - CVE-2026-33870 and CVE-2026-33871. #71017
- A read lock leak in
SharedDataStorageVolumeMgr. #70987 - An issue where the input and result columns of the
locate()function shared the same NullColumn reference inside BinaryColumns, causing incorrect results. #70957 - An issue where safe tablet deletion checks were incorrectly applied during ALTER operations in share-nothing mode. #70934
- A race condition in
_all_global_rf_ready_or_timeoutthat could prevent global runtime filters from being applied correctly. #70920 - An int32 overflow in the
ACCUMULATEDmetric macro that caused metric values to silently overflow. #70889 - Incorrect aggregation results in dictionary-encoded merge GROUP BY queries. #70866
- CVE-2025-54920. #70862
- A potential data loss issue in aggregation spill caused by incorrect hash table state handling during
set_finishing. #70851 - An issue where the
content-lengthheader was not reset whenproxy_pass_request_bodyis disabled. #70821 - An issue where the spill directory for load operations was cleaned up in the object destructor rather than during
DeltaWriter::close(), potentially causing premature deletion of spill data. #70778 - An issue where
INSERT INTO ... BY NAMEfromFILES()did not correctly push down the schema for partial column sets. #70774 - An issue where connector scan nodes did not reset the scan range source on query retry, causing incorrect results upon retry. #70762
- A potential rowset metadata loss for Primary Key model tablets caused by a GC race during disk re-migration of the form A→B→A. #70727
- An issue where a query-scoped warehouse hint leaked the
ComputeResourceobject inConnectContext, potentially affecting subsequent queries on the same connection. #70706 - An issue where redundant conjuncts in
MySqlScanNodeandJDBCScanNodecaused BE errors related toVectorizedInPredicatetype mismatches. #70694 - A missing
libssl-devdependency in the Ubuntu runtime environment. #70688 - An issue where Iceberg manifest cache completeness was not validated on read, leading to incorrect scan results when the cache was partially populated. #70675
- A duplicate closure reference in
_tablet_multi_get_rpcthat could cause use-after-free. #70657 - Partial manifest cache writes in the Iceberg
ManifestReaderthat could result in incomplete cache entries and incorrect scan behavior. #70652 - A crash in
array_map()when processing arrays that contain null literal elements. #70629 - A stack overflow in the
to_base64()function when processing large inputs. #70623 - An issue where
INSERT INTO ... BY NAMEfromFILES()used positional column mapping instead of name-based mapping, causing data to be written to incorrect columns. #70622 - An issue where
NOT NULLconstraints were incorrectly pushed down into the schema inferred fromFILES(), causing load failures for nullable columns. #70621 - An issue where precise external materialized view refresh did not fall back correctly for Iceberg-like connectors. #70589
- A
num_short_key_columnsmismatch when constructing a partial tablet schema, which could cause data read errors. #70586 - A BE crash that occurred when the child iterator was exhausted in
MaskMergeIterator. #70539 - An issue where materialized view refresh jobs repeatedly refreshed partitions whose corresponding Iceberg snapshots had expired. #70523
- An issue where starlet configuration parameters could not be set. #70482
- An issue where the lock-free materialized view rewrite path incorrectly fell back to live metadata, causing inconsistent rewrite behavior. #70475
- An issue in
JoinHashTable::merge_htwhere dummy rows were not skipped for expression-based join key columns, causing incorrect join results. #70465 - An incorrect equality comparison in
InformationFunctionthat could produce wrong results in certain queries. #70464 - A column type mismatch in the
__iceberg_transform_bucketinternal function. #70443 - An issue where Iceberg materialized view refresh failed when Iceberg snapshot timestamps were non-monotonic. #70382
- An issue where user authentication credentials were exposed in audit logs and SQL redaction output. #70360
- A CN crash that occurred when scanning an empty tablet with physical split enabled. #70281
- An issue where the VARCHAR column length was not preserved after a redundant CAST was eliminated during query optimization. #70269
- An issue where brpc connection retry logic did not correctly handle a wrapped
NoSuchElementException, causing connection failures after the retry attempt. #70203 - An issue where null fractions for outer join columns were not preserved during statistics estimation, leading to suboptimal query plans. #70144
- A memory tracker leak in connector sink operations running on poller threads. #70121