xorbitsai/inference v0.1.2
on GitHub

latest releases: v0.16.3, v0.16.2, v0.16.1...

15 months ago

What's new in 0.1.2 (2023-08-04)

These are the changes in inference v0.1.2.

New features

FEAT: custom model by @UranusSeven in #290

Enhancements

ENH: select q4_0 as default quantization method for ggmlv3 model in benchmark by @pangyoki in #293
ENH: disable gradio telemetry by @UranusSeven in #299

Bug fixes

BUG: llm_family.json encoding by @UranusSeven in #297
BUG: handle ChatGLM ggml specific case for RESTful API by @jiayini1119 in #309
BUG: handle Qwen update by @UranusSeven in #307

Others

DEMO: LangChain QA System with Xinference LLMs and Milvus Vector DB by @jiayini1119 in #304
Chore: update issue template by @UranusSeven in #300
Chore: remove codecov by @UranusSeven in #308

Full Changelog: v0.1.1...v0.1.2

Check out latest releases or
releases around xorbitsai/inference v0.1.2

Don't miss a new inference release

NewReleases is sending notifications on new releases.

Get notifications