github ggml-org/llama.cpp b8070

latest releases: b8661, b8660, b8658...
one month ago
Details

models : deduplicate delta-net graphs for Qwen family (#19597)

  • models : add llm_build_delta_net_base

  • cont : keep qwen35 and qwen35moe graphs intact

  • cont : add comments

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.