Details
cuda : add RDNA4-specific MMVQ parameter table for bs=1 decode (#19478)
-
mmvq: add RDNA3/RDNA4-specific parameter table (nwarps=8, rows=1)
-
mmvq: add dedicated RDNA3 parameter table
-
mmvq: exclude RDNA3.5 (gfx1150/1151) from RDNA3 table
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: