What's new in 1.15.0 (2025-12-13)
These are the changes in inference v1.15.0.
New features
- FEAT: added more detailed instructions for engine unavailability. by @OliverBryant in #4308
- FEAT: [model] Z-Image-Turbo support by @OliverBryant in #4333
- FEAT: [model] DeepSeek-V3.2 support by @Jun-Howie in #4344
- FEAT: [model] PaddleOCR-VL support by @leslie2046 in #4354
- FEAT: add llama_cpp json schema output by @OliverBryant in #4282
- FEAT: PaddleOCR-VL implementation by @leslie2046 in #4304
- FEAT: multi replicas on a single GPU && add launch strategy by @OliverBryant in #4358
Enhancements
- ENH: update models JSON [llm] by @XprobeBot in #4343
- ENH: update model "MiniMax-M2" JSON by @XprobeBot in #4342
- ENH: update models JSON [llm] by @XprobeBot in #4349
- ENH: support lauching with --device cpu by @hubutui in #4352
- ENH: add glm-4.5 tool calls support && vllm StructuredOutputsParams support by @OliverBryant in #4357
Bug fixes
- BUG: fix manage cache models missing by @OliverBryant in #4329
- BUG: [llm, vllm]: support ignore eos by @ZhikaiGuo960110 in #4332
- BUG: Multimodal settings for video parameters are not taking effect. by @OliverBryant in #4338
- BUG: Soft links cannot be completely deleted by @OliverBryant in #4337
- BUG: Packages with identical names in virtual environments error by @OliverBryant in #4348
- BUG: Fix typo in xinference/deploy/docker/Dockerfile.cu128 by @hubutui in #4350
- BUG: custom embedding model register fail by @OliverBryant in #4335
- BUG: [UI] fix the bug in the copy function. by @yiboyasss in #4355
- BUG: [UI] control Select dropdown width to prevent it from becoming too wide. by @yiboyasss in #4356
Documentation
Others
- Fixed- workflow Vulnerability by @barakharyati in #4328
- CHORE: add i18n for replica details by @leslie2046 in #4306
New Contributors
- @barakharyati made their first contribution in #4328
- @ZhikaiGuo960110 made their first contribution in #4332
- @hubutui made their first contribution in #4350
Full Changelog: v1.14.0...v1.15.0