github xorbitsai/inference v1.3.0.post1

latest release: v1.3.0.post2
one day ago

What's new in 1.3.0.post1 (2025-02-21)

These are the changes in inference v1.3.0.post1.

New features

Enhancements

  • enh: add gpu utilization info by @amumu96 in #2852
  • ENH: Update Kokoro model by @codingl2k1 in #2843
  • ENH: cmdline supports --n-worker, add --model-path and make it compatible with --model_path by @qinxuye in #2890
  • BLD: update sglang to v0.4.2.post4 and vllm to v0.7.2 by @qinxuye in #2838
  • BLD: fix flashinfer installation in dockerfile by @qinxuye in #2844

Bug fixes

Tests

Documentation

Others

  • CHORE: Xavier now supports vLLM >= 0.7.0, drops support for older versions by @ChengjieLi28 in #2886

New Contributors

Full Changelog: v1.2.2...v1.3.0.post1

Don't miss a new inference release

NewReleases is sending notifications on new releases.