Details
llama: fix llama-model-saver (#20503)
-
llama : add fd-based model loading via llama_model_load_from_fd
-
llama : address review feedback for fd-based model loading
-
llama : use FILE pointer instead of fd in public API
-
llama : use FILE pointer consistently, address review feedback
-
fixup
-
fix tensor names
-
fix llama-model-saver
-
roundtrip tests
-
fixup
-
refactor tests
-
fix prints
-
fix model saving
-
fix CI, disable Chameleon
-
print seed
Co-authored-by: Siddhesh2377 siddheshsonar2377@gmail.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: