Details
(Bugfix, ggml-cuda) Pool alloc count fix + small size computation type adjustment (#18559)
-
CUDA: Fixed obj byte size instead of obj count being passed to pool alloc (fattn-common, dst_tmp_meta)
-
CUDA: Explicitly casted some of the int alloc counts before multiplication in argsort
Co-authored-by: pl752 maximpl752@gmail.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: