What's new in 1.7.0 (2025-06-13)
These are the changes in inference v1.7.0.
New features
- FEAT: support CogView4 image model by @qinxuye in #3557
- FEAT: [UI] support model_ability filter for image and video models. by @yiboyasss in #3563
- FEAT: [UI] auto-switch to active tab when Running Models page loads. by @yiboyasss in #3568
- FEAT: support first-last-frame to video by @qinxuye in #3555
- FEAT: [UI] add Japanese and Korean language support. by @yiboyasss in #3574
- FEAT: SeACoParaformer model by @leslie2046 in #3587
- FEAT: support verbose_json for funasr family audio2text models by @leslie2046 in #3591
- FEAT: support deepseek-r1-0528 Mixed quantization by @Jun-Howie in #3601
- FEAT: support engines for embedding models by @pengjunfeng11 in #2791
- FEAT:support MiniCPM4 Series by @Jun-Howie in #3609
- FEAT: [UI] add model_engine parameter to embedding model. by @yiboyasss in #3617
- FEAT: add kwargs for transripts client API by @leslie2046 in #3622
- FEAT: support qwen3 embedding by @qinxuye in #3615
- FEAT: support qwen3-reranker by @qinxuye in #3627
Enhancements
- ENH: Support pcm response_format by @codingl2k1 in #3606
Bug fixes
- BUG: Fix dependency by @codingl2k1 in #3566
- BUG: Fix cmdline by @codingl2k1 in #3589
- BUG: fix potential hang for sglang by @qinxuye in #3597
- BUG: [UI] fixed the mobile language switching bug. by @yiboyasss in #3608
- BUG: Fix the error when using Qwen function call with Spring AI. by @aniya105 in #3614
Documentation
- DOC: update links by @qinxuye in #3565
- DOC: Update CosyVoice doc by @codingl2k1 in #3605
- DOC: update models by @qinxuye in #3628
Others
- FIX: [UI] fix model_engine parameter bug. by @yiboyasss in #3620
New Contributors
Full Changelog: v1.6.1...v1.7.0