xorbitsai/inference v0.15.4
on GitHub

latest releases: v0.16.2, v0.16.1, v0.16.0...

24 days ago

What's new in 0.15.4 (2024-10-12)

These are the changes in inference v0.15.4.

New features

FEAT: Llama 3.1 Instruct support tool call by @codingl2k1 in #2388
FEAT: qwen2.5 instruct tool call by @codingl2k1 in #2393
FEAT: add whisper-large-v3-turbo audio model by @hwzhuhao in #2409
FEAT: Add environment variable setting to increase the retry attempts after model download failures by @hwzhuhao in #2411
FEAT: support getting progress for image model by @qinxuye in #2395
FEAT: support qwenvl2 vllm engine by @amumu96 in #2428

Enhancements

ENH: Launch the ChatTTS model with kwargs by @codingl2k1 in #2425
REF: refactor controlnet for image model by @qinxuye in #2346

Bug fixes

BUG: Pin ChatTTS<0.2 by @codingl2k1 in #2419
BUG: tool call streaming output has duplicated list by @ChengjieLi28 in #2416

Full Changelog: v0.15.3...v0.15.4

Check out latest releases or
releases around xorbitsai/inference v0.15.4

Don't miss a new inference release

NewReleases is sending notifications on new releases.

Get notifications