github ggml-org/llama.cpp b8164

latest releases: b8172, b8171, b8170...
9 hours ago
Details

llama: Add option to merge gate and exp weights (#19139)

  • llama: Add option to merge gate and exp weights

  • Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • update constants.py

  • add gate_up for the all MoE models

  • convert: simplify merge tensor condition

  • update constants.py

  • reduce number of models, add create_tensor_gate_up helper


Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.