Warning
Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.
server: fix duplicate HTTP headers in multiple models mode (#17698)
-
llama-server: fix duplicate HTTP headers in multiple models mode (#17693)
-
llama-server: address review feedback from ngxson
- restrict scope of header after std::move
- simplify header check (remove unordered_set)
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA)
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: