What's new in 0.15.1 (2024-09-14)
These are the changes in inference v0.15.1.
New features
- FEAT: Support qwen2-vl-instruct GPTQ format and AWQ format by @Jun-Howie in #2251
- FEAT: Support minicpm-4B by @Jun-Howie in #2263
- FEAT: support sdapi/txt2img by @qinxuye in #2248
- FEAT: [UI] Auto-fill chat_template parameter on registration page. by @yiboyasss in #2268
- FEAT: support sdapi/sd-models and sdapi/samplers by @qinxuye in #2288
- FEAT: support deepseek-v2 and 2.5 by @amumu96 in #2292
- FEAT: Update Qwen2-VL-Model to support flash_attention_2 implementation by @LaureatePoet in #2289
- FEAT: support sdapi/img2img by @qinxuye in #2293
- FEAT: support flux.1 image2image and inpainting by @qinxuye in #2296
- FEAT: Support yi-coder-chat by @Jun-Howie in #2302
- FEAT: qwen2 audio by @codingl2k1 in #2271
Enhancements
- ENH: Update CosyVoice Huggingface by @codingl2k1 in #2249
- ENH: Supports multi functions in tool call for qwen2 by @ChengjieLi28 in #2265
- ENH: add
print-error
option in benchmark by @Dawnfz-Lenfeng in #2283 - ENH: Support fish speech 1.4 by @codingl2k1 in #2295
Bug fixes
- BUG: tts stream mode not working by @leslie2046 in #2279
- BUG: fix issue with model launch failing when .safetensors file is missing (#2094) by @Charmnut in #2290
- BUG: fix sampler_name for img2img by @qinxuye in #2301
- BUG: modify vllm image version by @amumu96 in #2311
- Bug: modify vllm image version by @amumu96 in #2312
Documentation
New Contributors
- @Jun-Howie made their first contribution in #2251
- @leslie2046 made their first contribution in #2279
- @Charmnut made their first contribution in #2290
- @LaureatePoet made their first contribution in #2289
Full Changelog: v0.15.0...v0.15.1