What's new in 0.10.2 (2024-04-19)
These are the changes in inference v0.10.2.
New features
- FEAT: [UI] Add replica configuration when launching
embedding
andrerank
models by @yiboyasss in #1306 - FEAT: Lora multi support by @hainaweiben in #1273
- FEAT: Support SeaLLM-7B and c4ai-command-r-v01 by @mujin2 in #1310
- FEAT: Support BAAI/bge-reranker-v2-* rerank model by @codingl2k1 in #1305
- FEAT: UI supports multi lora by @yiboyasss in #1320
- FEAT: Add_cia4command_modelscope by @mujin2 in #1321
- FEAT: support m3e embedding models by @qinxuye in #1298
- FEAT: hotkey to active search by @Minamiyama in #1287
- FEAT: support codeqwen1.5-chat by @qinxuye in #1322
Enhancements
- ENH: Support custom audio model by @amumu96 in #1279
- ENH: support int and str compare for model size by @mikeshi80 in #1277
- BLD: Add
FlagEmbedding
in cpu docker by @ChengjieLi28 in #1318 - REF: support query for engine feature by @Ago327 in #1294
Others
Full Changelog: v0.10.1...v0.10.2