pypi vllm 0.2.3
v0.2.3

latest releases: 0.6.3.post1, 0.6.3, 0.6.2...
11 months ago

Major changes

  • Refactoring on Worker, InputMetadata, and Attention
  • Fix TP support for AWQ models
  • Support Prometheus metrics
  • Fix Baichuan & Baichuan 2

What's Changed

New Contributors

Full Changelog: v0.2.2...v0.2.3

Don't miss a new vllm release

NewReleases is sending notifications on new releases.