github NVIDIA/cccl python-0.6.0
CCCL Python Libraries v0.6.0

10 hours ago

These are the release notes for the cuda-cccl Python package version 0.6.0, dated April 9th, 2026. The previous release was v0.5.1.

cuda-cccl is in "experimental" status, meaning that its API and feature set can change quite rapidly.

Installation

Please refer to the install instructions here

API breaking changes

  • cuda.coop refactored to use maker factory functions (#7713)

Features

  • ShuffleIterator — New iterator type added to cuda.compute (#7721)
  • max_segment_size guarantee — Exposed in the public API (#8284)
  • LTO-IR support — Can now directly pass LTO-IR for custom operators (#7625)
  • Numba-optional install — Added a path to install cuda.compute without Numba as a dependency (#7633)

Performance

  • Faster TransformIterator construction (#7660)

Bug Fixes

  • Fix faulty pointer arithmetic in CUB dispatch (#7940)
  • Fix merge sort returning negative temp storage bytes (#7916)
  • Fix histogram build object caching when using privatized smem strategy (#7657)

Don't miss a new cccl release

NewReleases is sending notifications on new releases.