github xorbitsai/inference v1.3.1

one day ago

What's new in 1.3.1 (2025-03-09)

These are the changes in inference v1.3.1.

New features

Enhancements

Bug fixes

  • BUG: fix qwen2.5-vl-7b cannot chat bug by @amumu96 in #2944
  • BUG: Fix modelscope model id on Qwen2.5-VL Added support for AWQ quantization format in Qwen2.5-VL by @Jun-Howie in #2943
  • BUG: fix Error while using Langchain-chatchat, because the parameter [max_tokens] passed is None by @William533036 in #2962
  • BUG: using jina-clip-v2, no attribute error when only text of image pass in by @Minamiyama in #2974
  • BUG: fix compatibility of mlx-lm v0.21.5 by @qinxuye in #2993
  • BUG: Fix tokenizer error in create_embedding by @shuaiqidezhong in #2992
  • BUG: wrong kwargs passing to encode method when using jina-clip-v2 by @Minamiyama in #2991
  • BUG: [UI] fix the white screen bug. by @yiboyasss in #3014

New Contributors

Full Changelog: v1.3.0.post2...v1.3.1

Don't miss a new inference release

NewReleases is sending notifications on new releases.