Details
model : support Step3.5-Flash (#19283)
-
Support Step3.5-Flash
-
fix: norm.weight + 1 (HF zero_centered=true)
-
step35: simplify GGUF conversion + drop redundant rope KVs
-
Address review feedback
-
rename limits -> clamp
-
Apply suggestions from code review
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
- Apply suggestion from @CISC
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
-
rename swiglu limits -> swiglu clamp in LLM_KV
-
avoid CI fail
-
Apply suggestions from code review
-
Apply suggestions from code review
-
disabled KV shifting for LLM_ARCH_STEP35
-
Apply suggestions from code review
-
mistakenly removed cmath
-
add model size && apply missed suggestion
-
assert partial_rotary_factors
-
fix CI errors:
-
load freq_base_swa
Co-authored-by: lvyichen lvyichen@stepfun.com
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: