What's Changed
🔄 Other Changes
- Backport #5442 to branch/3.0x by @shwina in #5469
- Backport to 3.0: Fix grid dependency sync in cub::DeviceMergeSort (#5456) by @bernhardmgruber in #5461
- Partial backport to 3.0: Fix SMEM alignment in DeviceTransform by @bernhardmgruber in #5463
- [Version] Update branch/3.0.x to v3.0.3 by @github-actions[bot] in #5502
- [Backport branch/3.0.x] NV_TARGET and cuda::ptx for CTK 13 by @fbusato in #5481
- [BACKPORT 3.0]: Update PTX ISA version for CUDA 13 (#5676) by @miscco in #5700
- Backport some MSVC test fixes to 3.0 by @miscco in #5819
- [Backport 3.0]: Work around
submdspan
compiler issue on MSVC (#5885) by @miscco in #5903 - Backport pin of llvmlite dependency to branch/3.0x by @shwina in #6000
- [Backport branch/3.0.x] Ensure that we are actually calling the cuda APIs ... (#4570) by @davebayer in #6098
- [Backport to 3.0] add a specialization of
__make_tuple_types
forcomplex<T>
(#6102) by @davebayer in #6117 - [Backport 3.0.x] Use proper qualification in allocate.h (#4796) by @wmaxey in #6126
Full Changelog: v3.0.2...v3.0.3