This post release contains two bug fix for memory leak and model accuracy
- Fix Memory Leak in
_cached_reqs_data
(#17567) - Fix sliding window attention in V1 giving incorrect results (#17574)
Full Changelog: v0.8.5...v0.8.5.post1
This post release contains two bug fix for memory leak and model accuracy
_cached_reqs_data
(#17567)
Full Changelog: v0.8.5...v0.8.5.post1
Don't miss a new vllm release
NewReleases is sending notifications on new releases.