github InternLM/lmdeploy v0.6.2
LMDeploy Release v0.6.2

latest releases: v0.11.1, v0.11.0, v0.10.2...
14 months ago

Highlights

  • PyTorch engine supports graph mode on ascend platform, doubling the inference speed
  • Support llama3.2-vision models in PyTorch engine
  • Support Mixtral in TurboMind engine, achieving 20+ RPS using SharedGPT dataset with 2 A100-80G GPUs

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.6.1...v0.6.2

Don't miss a new lmdeploy release

NewReleases is sending notifications on new releases.