What's new in 0.1.2 (2023-08-04)
These are the changes in inference v0.1.2.
New features
- FEAT: custom model by @UranusSeven in #290
Enhancements
- ENH: select q4_0 as default quantization method for ggmlv3 model in benchmark by @pangyoki in #293
- ENH: disable gradio telemetry by @UranusSeven in #299
Bug fixes
- BUG: llm_family.json encoding by @UranusSeven in #297
- BUG: handle ChatGLM ggml specific case for RESTful API by @jiayini1119 in #309
- BUG: handle Qwen update by @UranusSeven in #307
Others
- DEMO: LangChain QA System with Xinference LLMs and Milvus Vector DB by @jiayini1119 in #304
- Chore: update issue template by @UranusSeven in #300
- Chore: remove codecov by @UranusSeven in #308
Full Changelog: v0.1.1...v0.1.2