github xorbitsai/inference v2.3.0

6 hours ago

What's new in 2.3.0 (2026-03-13)

These are the changes in inference v2.3.0.

New features

Enhancements

Bug fixes

  • BUG: fix error WorkerWrapperBase.__init__() got multiple values for argument 'rpc_rank' by @llyycchhee in #4649
  • BUG: fix vLLM embedding check for qwen3-vl-embedding by @ace-xc in #4647
  • FIX: update the QR code URL by @yiboyasss in #4668
  • BUG: fix chat for multiple gpus by @llyycchhee in #4671
  • BUG: [UI] initialize formData with default values from modelFormConfig. by @yiboyasss in #4678
  • BUG: fix qwen 3.5 vllm since no generation_config.json exists by @llyycchhee in #4681

Documentation

New Contributors

Full Changelog: v2.2.0...v2.3.0

Don't miss a new inference release

NewReleases is sending notifications on new releases.