What's new in 2.3.0 (2026-03-13)
These are the changes in inference v2.3.0.
New features
- FEAT: support qwen-3.5 for vllm by @llyycchhee in #4656
- FEAT: add seed and repeptition_penalty parameter for precision test by @ZhikaiGuo960110 in #4684
- FEAT: [model] qwen2-audio removed by @ZhikaiGuo960110 in #4683
Enhancements
- ENH: update 2 models JSON ("qwen3.5", "glm-5") by @llyycchhee in #4655
- ENH: update model "MiniMax-M2.5" JSON by @llyycchhee in #4663
- ENH: update model "qwen3.5" JSON by @llyycchhee in #4661
- ENH: update model "qwen3.5" JSON by @Jun-Howie in #4672
- ENH: update 2 models JSON ("glm-5", "Kimi-K2.5") by @llyycchhee in #4662
- ENH: update models JSON [llm] by @XprobeBot in #4682
- ENH: support transformers for qwen 3.5 by @llyycchhee in #4685
- ENH: update models JSON [llm] by @XprobeBot in #4686
- BLD: [CI] fix windows runner SSL can't found by @llyycchhee in #4627
- REF: Implement REST API dependency injection and response handling by @amumu96 in #4620
- REF: extract require_model utility to reduce code duplication by @amumu96 in #4677
Bug fixes
- BUG: fix error
WorkerWrapperBase.__init__() got multiple values for argument 'rpc_rank'by @llyycchhee in #4649 - BUG: fix vLLM embedding check for qwen3-vl-embedding by @ace-xc in #4647
- FIX: update the QR code URL by @yiboyasss in #4668
- BUG: fix chat for multiple gpus by @llyycchhee in #4671
- BUG: [UI] initialize formData with default values from modelFormConfig. by @yiboyasss in #4678
- BUG: fix qwen 3.5 vllm since no generation_config.json exists by @llyycchhee in #4681
Documentation
- DOC: add v2.2.0 release notes by @qinxuye in #4643
- DOC: add missing periods in docstrings by @Jah-yee in #4669
New Contributors
Full Changelog: v2.2.0...v2.3.0