What's Changed
Bug fixes 🐛
- fix: make sure to turn down all processes on exit by @mudler in #6200
- fix(p2p): automatically install llama-cpp for p2p workers by @mudler in #6199
- Point to LocalAI-examples repo for llava by @mauromorales in #6241
- fix: runtime capability detection for backends by @sozercan in #6149
- fix(chat): use proper finish_reason for tool/function calling by @imkira in #6243
- fix(rocm): Rename tag suffix for hipblas whisper build to match backend config by @KingJ in #6247
- fix(llama-cpp): correctly calculate embeddings by @mudler in #6259
Exciting New Features 🎉
- feat(launcher): show welcome page by @mudler in #6234
- feat: support HF_ENDPOINT env for the HuggingFace endpoint by @qxo in #6220
🧠 Models
- chore(model gallery): add nousresearch_hermes-4-14b by @mudler in #6197
- chore(model gallery): add MiniCPM-V-4.5-8b-q4_K_M by @M0Rf30 in #6205
- chore(model-gallery): ⬆️ update checksum by @localai-bot in #6211
- feat(whisper): Add diarization (tinydiarize) by @richiejp in #6184
- chore(model gallery): add baidu_ernie-4.5-21b-a3b-thinking by @mudler in #6267
- chore(model gallery): add aquif-ai_aquif-3.5-8b-think by @mudler in #6269
- chore(model gallery): add qwen3-stargate-sg1-uncensored-abliterated-8b-i1 by @mudler in #6270
- chore(model gallery): add k2-think-i1 by @mudler in #6288
- chore(model gallery): add holo1.5-72b by @mudler in #6289
- chore(model gallery): add holo1.5-7b by @mudler in #6290
- chore(model gallery): add holo1.5-3b by @mudler in #6291
- chore(model gallery): add alibaba-nlp_tongyi-deepresearch-30b-a3b by @mudler in #6295
- chore(model gallery): add webwatcher-7b by @mudler in #6297
- chore(model gallery): add webwatcher-32b by @mudler in #6298
- chore(model gallery): add websailor-32b by @mudler in #6299
- chore(model gallery): add websailor-7b by @mudler in #6300
📖 Documentation and examples
👒 Dependencies
- chore(deps): bump github.com/opencontainers/image-spec from 1.1.0 to 1.1.1 by @dependabot[bot] in #6223
- chore(deps): bump actions/stale from 9.1.0 to 10.0.0 by @dependabot[bot] in #6227
- chore(deps): bump go.opentelemetry.io/otel/exporters/prometheus from 0.50.0 to 0.60.0 by @dependabot[bot] in #6226
- chore(deps): bump oras.land/oras-go/v2 from 2.5.0 to 2.6.0 by @dependabot[bot] in #6225
- chore(deps): bump github.com/swaggo/swag from 1.16.3 to 1.16.6 by @dependabot[bot] in #6222
- chore(deps): bump actions/labeler from 5 to 6 by @dependabot[bot] in #6229
- feat(nvidia-gpu): bump images to cuda 12.8 by @mudler in #6239
- feat(chatterbox): add MPS, and CPU, pin version by @mudler in #6242
Other Changes
- chore: ⬆️ Update ggml-org/llama.cpp to
0fce7a1248b74148c1eb0d368b7e18e8bcb96809
by @localai-bot in #6193 - chore: ⬆️ Update leejet/stable-diffusion.cpp to
2eb3845df5675a71565d5a9e13b7bad0881fafcd
by @localai-bot in #6192 - docs: ⬆️ update docs version mudler/LocalAI by @localai-bot in #6201
- chore: ⬆️ Update ggml-org/llama.cpp to
fb15d649ed14ab447eeab911e0c9d21e35fb243e
by @localai-bot in #6202 - Fix Typos in Docs by @alizfara112 in #6204
- chore: ⬆️ Update ggml-org/whisper.cpp to
bb0e1fc60f26a707cabf724edcf7cfcab2a269b6
by @localai-bot in #6203 - chore: ⬆️ Update ggml-org/llama.cpp to
408ff524b40baf4f51a81d42a9828200dd4fcb6b
by @localai-bot in #6207 - chore: ⬆️ Update ggml-org/llama.cpp to
c4df49a42d396bdf7344501813e7de53bc9e7bb3
by @localai-bot in #6209 - chore: ⬆️ Update leejet/stable-diffusion.cpp to d7f430cd693f2e12ecbaa0ce881746cf305c3b1f by @richiejp in #6213
- chore: ⬆️ Update leejet/stable-diffusion.cpp to
c648001030d4c2cc7c851fdaf509ee36d642dc99
by @localai-bot in #6215 - chore: ⬆️ Update ggml-org/llama.cpp to
3976dfbe00f02a62c0deca32c46138e4f0ca81d8
by @localai-bot in #6214 - chore: ⬆️ Update leejet/stable-diffusion.cpp to
abb115cd021fc2beed826604ed1a479b6a77671c
by @localai-bot in #6236 - chore: ⬆️ Update ggml-org/whisper.cpp to
edea8a9c3cf0eb7676dcdb604991eb2f95c3d984
by @localai-bot in #6237 - chore: ⬆️ Update leejet/stable-diffusion.cpp to
b0179181069254389ccad604e44f17a2c25b4094
by @localai-bot in #6246 - chore: ⬆️ Update ggml-org/llama.cpp to
0e6ff0046f4a2983b2c77950aa75960fe4b4f0e2
by @localai-bot in #6235 - chore: ⬆️ Update leejet/stable-diffusion.cpp to fce6afcc6a3250a8e17923608922d2a99b339b47 by @richiejp in #6256
- chore: ⬆️ Update ggml-org/llama.cpp to
40be51152d4dc2d47444a4ed378285139859895b
by @localai-bot in #6260 - chore: ⬆️ Update ggml-org/llama.cpp to
aa0c461efe3603639af1a1defed2438d9c16ca0f
by @localai-bot in #6261 - chore(aio): upgrade minicpm-v model to latest 4.5 by @M0Rf30 in #6262
- chore: ⬆️ Update ggml-org/llama.cpp to
0fa154e3502e940df914f03b41475a2b80b985b0
by @localai-bot in #6263 - chore: ⬆️ Update ggml-org/llama.cpp to
6c019cb04e86e2dacfe62ce7666c64e9717dde1f
by @localai-bot in #6265 - chore: ⬆️ Update leejet/stable-diffusion.cpp to 0ebe6fe118f125665939b27c89f34ed38716bff8 by @richiejp in #6271
- chore: ⬆️ Update ggml-org/llama.cpp to
b907255f4bd169b0dc7dca9553b4c54af5170865
by @localai-bot in #6287 - chore: ⬆️ Update ggml-org/llama.cpp to
8ff206097c2bf3ca1c7aa95f9d6db779fc7bdd68
by @localai-bot in #6292
New Contributors
- @alizfara112 made their first contribution in #6204
- @qxo made their first contribution in #6220
- @imkira made their first contribution in #6243
- @KingJ made their first contribution in #6247
Full Changelog: v3.5.0...v3.5.1