xorbitsai/inference v0.6.2
on GitHub

latest releases: v1.2.0, v1.1.1, v1.1.0...

14 months ago

What's new in 0.6.2 (2023-11-09)

These are the changes in inference v0.6.2.

New features

FEAT: Support Yi Model by @ChengjieLi28 in #629

Enhancements

ENH: cache status by @UranusSeven in #616
ENH: Supports request limits for the model by @ChengjieLi28 in #596
ENH: running model location & accelerators by @UranusSeven in #626
ENH: Create completion restful api compatibility by @codingl2k1 in #622

Bug fixes

BUG: Compatible with openai 1.1 by @codingl2k1 in #619
BUG: fix spec decoding by @UranusSeven in #628
BUG: No slot available error for embedding and LLM model on one card by @ChengjieLi28 in #611
BUG: Rotating log does not create a new one when recreate the xinference cluster by @ChengjieLi28 in #618

Documentation

DOC: Change links for some tutorials by @onesuper in #617

Full Changelog: v0.6.1...v0.6.2

Check out latest releases or
releases around xorbitsai/inference v0.6.2

Don't miss a new inference release

NewReleases is sending notifications on new releases.

Get notifications