github ggml-org/llama.cpp b8036

latest release: b8037
2 hours ago
Details

model: support GLM MoE DSA arch (NOTE: indexer is not yet supported) (#19460)

  • model: support GLM MoE DSA arch

  • working version

  • pyright

  • keep indexer tensors

  • add indexer gguf params

  • loaded now

  • Apply suggestions from code review

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • update

  • Update src/llama-model.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • minor fix and cleanup

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.