github ggml-org/llama.cpp b9859

4 hours ago
Details

opencl: allow loading precompiled binary kernels from library (#23042)

  • opencl: allow loading binary kernel

  • opencl: add libdl.h

  • ggml-backend-dl is in ggml, which depends backend libs, thus
    ggml-opencl cannot depend on ggml-backend-dl

  • add libdl.h to break cyclic dep

  • opencl: allow loading bin kernel lib

  • opencl: load gemm_moe_mxfp4_f32_ns from kernel lib if available

  • opencl: load q8_0 gemm from kernel lib

  • opencl: load q4_0 moe gemm from kernel lib

  • opencl: load q4_1 moe gemm from kernel lib

  • opencl: load q4_k moe gemm from kernel lib

  • opencl: always declare get_adreno_bin_kernel_func_t

  • opencl: rephrase message

  • opencl: fix for rebase

  • opencl: update doc

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.