github xorbitsai/inference v1.6.0

latest releases: v1.9.1, v1.9.0, v1.8.1...
3 months ago

What's new in 1.6.0 (2025-05-16)

These are the changes in inference v1.6.0.

New features

Enhancements

Bug fixes

  • BUG: fix qwen3 235b spec by @qinxuye in #3375
  • BUG: fix incomplete parsing of reasoning content in reasoning_parser by @amumu96 in #3391
  • BUG: fix the processing logic for inference content parsing and tool calls by @amumu96 in #3394
  • BUG: fix stop word handling logic in vllm model generation configuration by @amumu96 in #3414
  • BUG: fix Model._get_full_prompt() takes 3 positional arguments but 4 were given by @qinxuye in #3417
  • BUG: fix potential stop hang by @qinxuye in #3434
  • BUG: [UI] Added cpu_offload parameter to video model and fixed bug in audio model's filtering function. by @yiboyasss in #3461

New Contributors

Full Changelog: v1.5.1...v1.6.0

Don't miss a new inference release

NewReleases is sending notifications on new releases.