Details
model : full modern bert support (#18330)
-
full modern bert support
-
added gelu op in rank pooling for modern bert
-
still working on stuff, added mean calculation before classifier head
-
Update convert_hf_to_gguf.py
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
-
first layer is dense, as per modern bert research paper
-
Update src/llama-graph.cpp
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
-
fixed set input for mean pooling to check if pooling type is ranking since modern bert does mean & rank
-
Update src/llama-graph.cpp
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
- Update convert_hf_to_gguf.py
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: