Details
opencl: add flattened q6_K mv (#19054)
-
opencl: flatten
q6_Kand addkernel_mul_mv_q6_K_f32_flat -
opencl: clean up
-
opencl: refactor q6_K mv - put loop body in
block_q_6_K_dot_y_flat -
opencl: tweak the workgroup size a bit
-
opencl: output 4 values per subgroup for
kernel_mul_mv_q6_K_f32_flat -
opencl: proper alignment for q6_K
-
opencl: boundary handling for flattened q6_K mv
-
opencl: rename q6_K mv kernel file
-
opencl: put flattened q6_K mv in its own file
-
opencl: use lower k in file name
-
opencl: use K in variable names
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: