Warning
Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.
Details
arch: refactor LLM_TENSOR_NAMES (#18051)
-
arch: refactor LLM_TENSOR_NAMES
-
update docs
-
typo
-
fix LLM_ARCH_NEMOTRON_H_MOE
-
show more meaningful error message on missing tensor
-
fix and tested LLM_ARCH_NEMOTRON_H_MOE
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12)
- Windows x64 (CUDA 13)
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: