github NVIDIA/cccl v3.0.3

15 hours ago

What's Changed

🔄 Other Changes

  • Backport #5442 to branch/3.0x by @shwina in #5469
  • Backport to 3.0: Fix grid dependency sync in cub::DeviceMergeSort (#5456) by @bernhardmgruber in #5461
  • Partial backport to 3.0: Fix SMEM alignment in DeviceTransform by @bernhardmgruber in #5463
  • [Version] Update branch/3.0.x to v3.0.3 by @github-actions[bot] in #5502
  • [Backport branch/3.0.x] NV_TARGET and cuda::ptx for CTK 13 by @fbusato in #5481
  • [BACKPORT 3.0]: Update PTX ISA version for CUDA 13 (#5676) by @miscco in #5700
  • Backport some MSVC test fixes to 3.0 by @miscco in #5819
  • [Backport 3.0]: Work around submdspan compiler issue on MSVC (#5885) by @miscco in #5903
  • Backport pin of llvmlite dependency to branch/3.0x by @shwina in #6000
  • [Backport branch/3.0.x] Ensure that we are actually calling the cuda APIs ... (#4570) by @davebayer in #6098
  • [Backport to 3.0] add a specialization of __make_tuple_types for complex<T> (#6102) by @davebayer in #6117
  • [Backport 3.0.x] Use proper qualification in allocate.h (#4796) by @wmaxey in #6126

Full Changelog: v3.0.2...v3.0.3

Don't miss a new cccl release

NewReleases is sending notifications on new releases.