What's new in 0.14.1 (2024-08-09)
These are the changes in inference v0.14.1.
New features
- FEAT: support SenseVoice audio-to-text model by @qinxuye in #2008
- FEAT: support flux.1-schnell & flux.1-dev by @qinxuye in #2007
- FEAT: support kolors image model by @qinxuye in #2028
- FEAT: Add support for llama-3.1-instruct 405B model by @frostyplanet in #2025
- FEAT: Support CogVideoX video model by @codingl2k1 in #2049
- FEAT: Support MiniCPM-v-2_6 by @Minamiyama in #2031
Enhancements
- ENH: Improve internal server error by @codingl2k1 in #2009
- ENH: Add
stream
option in Benchmark by @Dawnfz-Lenfeng in #2038 - ENH: optimize availability of vLLM by @qinxuye in #2046
- ENH: [worker] Allow init supervisor_ref lazy by @frostyplanet in #1958
- ENH: optimize performance of sglang by @qinxuye in #2050
- REF: Mark
Deprecate
forprompt
,system_prompt
andchat_history
parameters inchat
client interface by @ChengjieLi28 in #2043
Bug fixes
- BUG: fix flexible model register in worker by @frostyplanet in #2011
- BUG: [UI] Fix the 'model_path' bug. by @yiboyasss in #2015
- BUG: fix custom embedding launch error by @amumu96 in #2016
Tests
- TST: Fix some dependency version issues by @ChengjieLi28 in #2042
Documentation
- DOC: Directly launch custom model by
model_path
by @ChengjieLi28 in #2047 - DOC: fix typo in README by @ArtificialZeng in #2048
Others
- CHORE: Increased frequency of issue processing by @ChengjieLi28 in #2024
New Contributors
- @ArtificialZeng made their first contribution in #2048
- @Dawnfz-Lenfeng made their first contribution in #2038
Full Changelog: v0.14.0...v0.14.1