github ggml-org/llama.cpp b7605

latest releases: b7609, b7608, b7607...
16 hours ago
Details

model : add support for JinaBertModel with non-gated ffn (#18475)

  • WIP: Initial commit for fixing JinaBert original FF type support

  • convert: add jina-v2-de tokenizer variant for German_Semantic_V3

  • convert: fix token collision in BERT phantom vocab conversion

  • convert: add feed_forward_type metadata

  • model: add feed_forward_type metadata for jina-bert-v2

  • model: jina-bert-v2 support standard GELU FFN variant

  • model: remove ffn_type, detect FFN variant from tensor dimensions

  • Update src/llama-model.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • Update src/llama-model.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • Update src/models/bert.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • Update src/models/bert.cpp

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • revert collision fix to be handled in separate PR

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.