Details
model: support GLM MoE DSA arch (NOTE: indexer is not yet supported) (#19460)
-
model: support GLM MoE DSA arch
-
working version
-
pyright
-
keep indexer tensors
-
add indexer gguf params
-
loaded now
-
Apply suggestions from code review
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
-
update
-
Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
- minor fix and cleanup
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: