github xorbitsai/inference v0.15.4

latest releases: v0.16.2, v0.16.1, v0.16.0...
24 days ago

What's new in 0.15.4 (2024-10-12)

These are the changes in inference v0.15.4.

New features

  • FEAT: Llama 3.1 Instruct support tool call by @codingl2k1 in #2388
  • FEAT: qwen2.5 instruct tool call by @codingl2k1 in #2393
  • FEAT: add whisper-large-v3-turbo audio model by @hwzhuhao in #2409
  • FEAT: Add environment variable setting to increase the retry attempts after model download failures by @hwzhuhao in #2411
  • FEAT: support getting progress for image model by @qinxuye in #2395
  • FEAT: support qwenvl2 vllm engine by @amumu96 in #2428

Enhancements

Bug fixes

Full Changelog: v0.15.3...v0.15.4

Don't miss a new inference release

NewReleases is sending notifications on new releases.