github ggml-org/llama.cpp b9495

latest release: b9496
7 hours ago
Details

qwen35: use post-norm hidden state for MTP (#24025)

  • qwen35: use post-norm hidden state for MTP

  • rename pre_norm to nextn

  • fix step35

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.