Details
model : add support for JinaBertModel with non-gated ffn (#18475)
-
WIP: Initial commit for fixing JinaBert original FF type support
-
convert: add jina-v2-de tokenizer variant for German_Semantic_V3
-
convert: fix token collision in BERT phantom vocab conversion
-
convert: add feed_forward_type metadata
-
model: add feed_forward_type metadata for jina-bert-v2
-
model: jina-bert-v2 support standard GELU FFN variant
-
model: remove ffn_type, detect FFN variant from tensor dimensions
-
Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
- Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
- Update src/models/bert.cpp
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
- Update src/models/bert.cpp
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
- revert collision fix to be handled in separate PR
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: