What's new in 0.6.3 (2023-11-16)
These are the changes in inference v0.6.3.
New features
- FEAT: qwen-chat-14b by @UranusSeven in #494
- FEAT: Support gptq quantization by @codingl2k1 in #645
Bug fixes
- BUG: Fix restful api serialization slow by @codingl2k1 in #648
Tests
- TST: disable test_is_self_hosted by @UranusSeven in #641
Documentation
- DOC: About Logging in Xinference by @ChengjieLi28 in #631
- DOC: Init for Chinese doc by @ChengjieLi28 in #565
Full Changelog: v0.6.2...v0.6.3