github ggml-org/llama.cpp b7516

latest releases: b8083, b8082, b8079...
one month ago
Details

model : fix div-by-zero for Nemotron V2 (#18309)

  • llama-model : fix Nemotron V2 crash by moving MoE parameters calculation

  • remove whitespace


Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.