What's new in 1.3.1 (2025-03-09)
These are the changes in inference v1.3.1.
New features
- FEAT: Support qwen2.5-instruct-1m by @Jun-Howie in #2928
- FEAT: Support moonlight-16b-a3b by @Jun-Howie in #2963
- FEAT: create_embedding add field model_replica by @zhoudelong in #2779
- FEAT: [UI] add the reasoning_content parameter. by @yiboyasss in #2980
- FEAT: Support QwQ-32B by @cyhasuka in #3005
- FEAT: all engine support reasoning_content by @amumu96 in #3013
Enhancements
- ENH: InternVL2.5-MPO by @Minamiyama in #2913
- ENH: [UI] add copy button by @Minamiyama in #2920
- ENH: [UI] add model ability filtering feature to the audio model. by @yiboyasss in #2986
- ENH: Support xllamacpp by @codingl2k1 in #2997
- BLD: Install ffmpeg 6 for audio & video models by @phuchoang2603 in #2946
- BLD: fix ffprobe library not imported by @phuchoang2603 in #2971
- BLD: fix docker requirements for sglang by @qinxuye in #3015
- REF: [UI] move featureModels to data.js by @yiboyasss in #3008
Bug fixes
- BUG: fix qwen2.5-vl-7b cannot chat bug by @amumu96 in #2944
- BUG: Fix modelscope model id on Qwen2.5-VL Added support for AWQ quantization format in Qwen2.5-VL by @Jun-Howie in #2943
- BUG: fix Error while using Langchain-chatchat, because the parameter [max_tokens] passed is None by @William533036 in #2962
- BUG: using jina-clip-v2, no attribute error when only text of image pass in by @Minamiyama in #2974
- BUG: fix compatibility of mlx-lm v0.21.5 by @qinxuye in #2993
- BUG: Fix tokenizer error in create_embedding by @shuaiqidezhong in #2992
- BUG: wrong kwargs passing to encode method when using jina-clip-v2 by @Minamiyama in #2991
- BUG: [UI] fix the white screen bug. by @yiboyasss in #3014
New Contributors
- @phuchoang2603 made their first contribution in #2946
- @William533036 made their first contribution in #2962
- @zhoudelong made their first contribution in #2779
Full Changelog: v1.3.0.post2...v1.3.1