Details
[Model] Qwen3.5 dense and MoE support (no vision) (#19435)
-
Unified delta net handling
-
Remove old methods.
-
Refactor and optimize
-
Adapt autoregressive version from @ymcki
-
Change to decay mask approach
-
Fix bad permute
-
Qwen 3.5 support
-
Apply suggestions from code review
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
-
Further fixes
-
Use inheritance, remove unneeded conts
-
Not like this!
-
Remove ggml.h explicit import
-
Remove transformers, fix the views
-
ACTUALLY fix views, make super calls explicit in conversion.
-
Fix conversion again
-
Remove extra ggml.h imports
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: