ggml-org/llama.cpp b7673
on GitHub

latest releases: b10173, b10172, b10171...

6 months ago

Details

metal : add MoE kernel specialization for ne20=5 (#18667)

Add template specialization for kernel_mul_mm_id_map0 with ne20=5
to support models using 5 active experts (e.g., VAETKI).

macOS/iOS:

Linux:

Windows:

openEuler:

Check out latest releases or
releases around ggml-org/llama.cpp b7673

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.

Get notifications