4.48.0 (2026-05-01)
Features Added
- Added
additionalHeaderssupport to allow setting additional headers (e.g.,x-ms-cosmos-workload-id) that are sent with every request. - See PR 48128 - Added new
CosmosItemsDataSource.readManyByPartitionKeysSpark function to execute bulk queries by a list of pk-values with better efficiency. Configure null handling viaspark.cosmos.read.readManyByPk.nullHandling- defaultNulltreats a null PK column as JSON null (addNullValue),Nonetreats it asPartitionKey.NONE(addNoneValue/NOT IS_DEFINED). These route to different physical partitions - picking the wrong mode silently returns zero rows. See PR 48801 - Added Spark config
spark.cosmos.read.readManyByPk.maxConcurrentBatchPrefetch(default1) to bound the per-task prefetch parallelism the SDK uses insidereadManyByPartitionKeys. See PR 48801 - Added Spark config
spark.cosmos.read.readManyByPk.maxBatchSize(default100) to set the max. number of partition keys used for a single batch. See PR 48930
Other Changes
- Refactored to use shared
azure-cosmos-spark_4base module for code common across Spark 4.x versions. - See PR 48861