Details
server : support multi-modal context checkpoints (#19849)
-
Modify llama-memory-hybrid-iswa.cpp
-
Modify llama-memory-recurrent.cpp
-
Modify server-common.cpp
-
Modify server-common.h
-
Modify server-context.cpp
-
Modify server-task.h
-
Added comment to llama-memory-hybrid-iswa.cpp
-
Remove comment from server-context.cpp
-
Stylistic fix server-context.cpp
-
Fix an issue when seqrm isn't called in server-context.cpp
-
cont : alternative impl
-
cont : cleanup
-
cont : n_tokens -> int64_t
Co-authored-by: timkhronos timkhronos@gmail.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: