Details
context : reserve new scheduler when graph topology changes (#18547)
-
context : reserve new scheduler when graph topology changes
-
cont : fix
-
cont : fix reserve
-
cont : reserve only when changes occur + timing
-
context : add comments
-
llama : reserve on sampler changes
-
common : allow null common_sampler
-
server : task declares needs (embd, logits, sampling)
-
server : do not init sampler if not needed
-
llama : fix need_reserve when unsetting a sampler
-
server : consolidate slot reset/clear logic
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: