What's new in 0.8.2 (2024-01-26)
These are the changes in inference v0.8.2.
New features
- FEAT: Support events by @codingl2k1 in #916
- FEAT: Support audio model by @codingl2k1 in #929
- FEAT: Support orion series models by @aresnow1 in #933
- Feat: Support Mixtral-8x7B-Instruct-v0.1-AWQ by @aresnow1 in #936
Enhancements
- ENH: Launch model by
version
by @ChengjieLi28 in #896 - ENH: Move multimodal to LLM by @codingl2k1 in #917
- ENH: InternLM2 chat template by @aresnow1 in #919
- ENH: Support
use_fp16
for rerank model by @aresnow1 in #927 - ENH: record instance count and version count when detailed listing model registrations by @ChengjieLi28 in #920
- BLD: Resolve conflicts during installation by @aresnow1 in #924
- REF: Move auth code to service for better scalability by @ChengjieLi28 in #925
Documentation
- DOC: Update readme by @aresnow1 in #914
- DOC: Display contributors in readme by @onesuper in #915
- DOC: Merge multimodal to LLM by @codingl2k1 in #923
- DOC: Model usage guide by @onesuper in #926
- DOC: Audio doc by @codingl2k1 in #937
Full Changelog: v0.8.1...v0.8.2