github xorbitsai/inference v2.5.0

5 hours ago

What's new in 2.5.0 (2026-04-12)

These are the changes in inference v2.5.0.

New features

Enhancements

  • ENH: update model "DeepSeek-OCR" JSON by @amumu96 in #4751
  • ENH: update 2 models JSON ("Ernie4.5", "qwen3.5") by @XprobeBot in #4754
  • ENH: update model "DeepSeek-V3.2" JSON by @amumu96 in #4762
  • ENH: update 2 models JSON ("Qwen3-ASR-0.6B", "Qwen3-ASR-1.7B") by @qinxuye in #4765
  • ENH: auto-detect PyTorch CUDA version for virtual environment setup by @qinxuye in #4766
  • ENH: update model "jina-embeddings-v4" JSON by @qinxuye in #4775
  • ENH: Optimize worker details for deployment progress tooltip. by @leslie2046 in #4746
  • ENH: update model "qwen3.5" JSON by @llyycchhee in #4782
  • ENH: update 2 models JSON ("Kokoro-82M-v1.1-zh", "Kokoro-82M") by @qinxuye in #4795
  • ENH: update model "gemma-3-it" JSON by @qinxuye in #4794
  • ENH: update models JSON [llm] by @XprobeBot in #4796
  • ENH: add lightweight heartbeat mechanism for worker liveness detection by @qinxuye in #4785
  • ENH: update model "ChatTTS" JSON by @qinxuye in #4793
  • bld: Fix the front-end UI access issue for aarch64 image by @zwt-1234 in #4743
  • bld: Fix the front-end UI access issue for aarch64 image by @zwt-1234 in #4749
  • bld: Fix the front-end UI access issue by @zwt-1234 in #4758

Bug fixes

Documentation

Others

New Contributors

Full Changelog: v2.4.0...v2.5.0

Don't miss a new inference release

NewReleases is sending notifications on new releases.