github vllm-project/vllm v0.1.5

latest releases: v0.6.3.post1, v0.6.3, v0.6.2...
14 months ago

Major Changes

  • Align beam search with hf_model.generate.
  • Stablelize AsyncLLMEngine with a background engine loop.
  • Add support for CodeLLaMA.
  • Add many model correctness tests.
  • Many other correctness fixes.

What's Changed

New Contributors

Full Changelog: v0.1.4...v0.1.5

Don't miss a new vllm release

NewReleases is sending notifications on new releases.