github ggml-org/llama.cpp b7379

latest releases: b8508, b8507, b8506...
3 months ago

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

vulkan: Allow non-pow2 n_experts in topk_moe (#17872)

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.