github vllm-project/vllm v0.2.5

latest releases: v0.6.3.post1, v0.6.3, v0.6.2...
11 months ago

Major changes

  • Optimize Mixtral performance with expert parallelism (thanks to @Yard1)
  • [BugFix] Fix input positions for long context with sliding window

What's Changed

Full Changelog: v0.2.4...v0.2.5

Don't miss a new vllm release

NewReleases is sending notifications on new releases.