What's new in 1.3.0 (2025-02-21)
These are the changes in inference v1.3.0.
New features
- FEAT: Support qwen-2.5-instruct-1m by @Jun-Howie in #2841
- FEAT: support deepseek-v3 and deepseek-r1 by @qinxuye in #2864
- FEAT: [UI] additional parameter tip function. by @yiboyasss in #2876
- FEAT: [UI] add featured models filtering function. by @yiboyasss in #2871
- FEAT: [UI] support form parameters and command line conversion. by @yiboyasss in #2850
- FEAT: support distributed inference for sglang by @qinxuye in #2877
- FEAT: [UI] add n_worker parameter for model launch. by @yiboyasss in #2889
- FEAT: InternVL 2.5 by @Minamiyama in #2776
- FEAT: support vllm reasoning content by @amumu96 in #2905
Enhancements
- enh: add gpu utilization info by @amumu96 in #2852
- ENH: Update Kokoro model by @codingl2k1 in #2843
- ENH: cmdline supports --n-worker, add --model-path and make it compatible with --model_path by @qinxuye in #2890
- BLD: update sglang to v0.4.2.post4 and vllm to v0.7.2 by @qinxuye in #2838
- BLD: fix flashinfer installation in dockerfile by @qinxuye in #2844
Bug fixes
- BUG: Fix whisper CI by @codingl2k1 in #2822
- BUG: fix FLUX when a scheduler is specified which is incompatible. by @shuaiqidezhong in #2897
- BUG: [UI] fix the bug of missing hint during model running. by @yiboyasss in #2904
- BUG: Clear dependency by @codingl2k1 in #2910
Tests
- TST: Pin CI transformers<4.49 by @codingl2k1 in #2883
Documentation
Others
- CHORE: Xavier now supports
vLLM >= 0.7.0
, drops support for older versions by @ChengjieLi28 in #2886
New Contributors
- @shuaiqidezhong made their first contribution in #2897
Full Changelog: v1.2.2...v1.3.0