github xorbitsai/inference v0.14.2

latest releases: v0.16.3, v0.16.2, v0.16.1...
2 months ago

What's new in 0.14.2 (2024-08-16)

These are the changes in inference v0.14.2.

New features

  • FEAT: add gemma-2-it 2b & internlm2.5-chat 1.8b and 20b & update video and sglang docs by @qinxuye in #2080
  • FEAT: support FP8 for vllm & sglang engine by @qinxuye in #2069
  • Feat: Support internvl2 and internvl stream by @amumu96 in #2079

Enhancements

Bug fixes

Documentation

  • DOC: update readme & add tips for large image models by @qinxuye in #2056

New Contributors

Full Changelog: v0.14.1...v0.14.2

Don't miss a new inference release

NewReleases is sending notifications on new releases.