xorbitsai/inference v0.14.1
on GitHub

latest releases: v0.16.3, v0.16.2, v0.16.1...

3 months ago

What's new in 0.14.1 (2024-08-09)

These are the changes in inference v0.14.1.

New features

FEAT: support SenseVoice audio-to-text model by @qinxuye in #2008
FEAT: support flux.1-schnell & flux.1-dev by @qinxuye in #2007
FEAT: support kolors image model by @qinxuye in #2028
FEAT: Add support for llama-3.1-instruct 405B model by @frostyplanet in #2025
FEAT: Support CogVideoX video model by @codingl2k1 in #2049
FEAT: Support MiniCPM-v-2_6 by @Minamiyama in #2031

Enhancements

ENH: Improve internal server error by @codingl2k1 in #2009
ENH: Add stream option in Benchmark by @Dawnfz-Lenfeng in #2038
ENH: optimize availability of vLLM by @qinxuye in #2046
ENH: [worker] Allow init supervisor_ref lazy by @frostyplanet in #1958
ENH: optimize performance of sglang by @qinxuye in #2050
REF: Mark Deprecate for prompt, system_prompt and chat_history parameters in chat client interface by @ChengjieLi28 in #2043

Bug fixes

BUG: fix flexible model register in worker by @frostyplanet in #2011
BUG: [UI] Fix the 'model_path' bug. by @yiboyasss in #2015
BUG: fix custom embedding launch error by @amumu96 in #2016

Tests

TST: Fix some dependency version issues by @ChengjieLi28 in #2042

Documentation

DOC: Directly launch custom model by model_path by @ChengjieLi28 in #2047
DOC: fix typo in README by @ArtificialZeng in #2048

Others

CHORE: Increased frequency of issue processing by @ChengjieLi28 in #2024

New Contributors

@ArtificialZeng made their first contribution in #2048
@Dawnfz-Lenfeng made their first contribution in #2038

Full Changelog: v0.14.0...v0.14.1

Check out latest releases or
releases around xorbitsai/inference v0.14.1

Don't miss a new inference release

NewReleases is sending notifications on new releases.

Get notifications