What's new in 0.6.2 (2023-11-09)
These are the changes in inference v0.6.2.
New features
- FEAT: Support Yi Model by @ChengjieLi28 in #629
Enhancements
- ENH: cache status by @UranusSeven in #616
- ENH: Supports request limits for the model by @ChengjieLi28 in #596
- ENH: running model location & accelerators by @UranusSeven in #626
- ENH: Create completion restful api compatibility by @codingl2k1 in #622
Bug fixes
- BUG: Compatible with openai 1.1 by @codingl2k1 in #619
- BUG: fix spec decoding by @UranusSeven in #628
- BUG:
No slot available
error for embedding and LLM model on one card by @ChengjieLi28 in #611 - BUG: Rotating log does not create a new one when recreate the xinference cluster by @ChengjieLi28 in #618
Documentation
Full Changelog: v0.6.1...v0.6.2