github ggml-org/llama.cpp b8589

one hour ago
Details

opencl: add q4_K gemm and gemv kernels for Adreno (#20919)

  • opencl: add q4_K gemm and gemv kernels for Adreno

  • opencl: fix whitespace

  • opencl: add workarounds for compiler bugs on older devices

  • opencl: handle fp16 denorm on X Elite

  • opencl: fix kernel build error

  • opencl: fix whitespace

  • opencl: make q4_K cvt kernels signature consistent


Co-authored-by: Li He lih@qti.qualcomm.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.