github ggml-org/llama.cpp b8100

latest releases: b8106, b8105, b8104...
10 hours ago
Details

model : full modern bert support (#18330)

  • full modern bert support

  • added gelu op in rank pooling for modern bert

  • still working on stuff, added mean calculation before classifier head

  • Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • first layer is dense, as per modern bert research paper

  • Update src/llama-graph.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • fixed set input for mean pooling to check if pooling type is ranking since modern bert does mean & rank

  • Update src/llama-graph.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com


Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.