xorbitsai/inference v0.10.3
on GitHub

latest releases: v1.0.0, v0.16.3, v0.16.2...

6 months ago

What's new in 0.10.3 (2024-04-24)

These are the changes in inference v0.10.3.

New features

FEAT: support llama-3 family by @qinxuye in #1332
FEAT: Add Belle-whisper-large-v3-zh by @codingl2k1 in #1351

Enhancements

ENH: fix the max length of codeqwen-7B-chat by @mikeshi80 in #1354
ENH: Clear cache for embedding and rerank by @codingl2k1 in #1360

Bug fixes

BUG: Fix Launching embedding or reranking models from commandline fails due to PEFT by @hainaweiben in #1343
BUG: Fix extra parameters issue when auto-recovering models by @ChengjieLi28 in #1348
BUG: Fix old rerank models use flag rerank issue by @codingl2k1 in #1350

Documentation

DOC: Add new models to README by @qinxuye in #1346
DOC: Update README, add FastGPT to integrations by @yangchuansheng in #1355

New Contributors

@yangchuansheng made their first contribution in #1355

Full Changelog: v0.10.2.post1...v0.10.3

Check out latest releases or
releases around xorbitsai/inference v0.10.3

Don't miss a new inference release

NewReleases is sending notifications on new releases.

Get notifications