Details
CUDA: Replace init_offsets kernel with iterators in cub-based argsort (#18930)
- CUDA: Replace
init_offsetswith iterators in argsort
This is a QOL improvement, saving us the cost of materializing the
iterator
- Remove unnecessary include from top-k.cu
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: