What's new in 0.10.3 (2024-04-24)
These are the changes in inference v0.10.3.
New features
- FEAT: support llama-3 family by @qinxuye in #1332
- FEAT: Add Belle-whisper-large-v3-zh by @codingl2k1 in #1351
Enhancements
- ENH: fix the max length of codeqwen-7B-chat by @mikeshi80 in #1354
- ENH: Clear cache for embedding and rerank by @codingl2k1 in #1360
Bug fixes
- BUG: Fix Launching embedding or reranking models from commandline fails due to PEFT by @hainaweiben in #1343
- BUG: Fix extra parameters issue when auto-recovering models by @ChengjieLi28 in #1348
- BUG: Fix old rerank models use flag rerank issue by @codingl2k1 in #1350
Documentation
- DOC: Add new models to README by @qinxuye in #1346
- DOC: Update README, add FastGPT to integrations by @yangchuansheng in #1355
New Contributors
- @yangchuansheng made their first contribution in #1355
Full Changelog: v0.10.2.post1...v0.10.3