mudler/LocalAI v1.40.0 on GitHub

This release is a preparation before v2 - the efforts now will be to refactor, polish and add new backends. Follow up on: #1126

Hot topics

This release now brings the llama-cpp backend which is a c++ backend tied to llama.cpp. It follows more closely and tracks recent versions of llama.cpp. It is not feature compatible with the current llama backend but plans are to sunset the current llama backend in favor of this one. This one will be probably be the latest release containing the older llama backend written in go and c++. The major improvement with this change is that there are less layers that could be expose to potential bugs - and as well it ease out maintenance as well.

Support for ROCm/HIPBLAS

This release bring support for AMD thanks to @65a . See more details in #1100

More CLI commands

Thanks to @jespino now the local-ai binary has more subcommands allowing to manage the gallery or try out directly inferencing, check it out!

What's Changed

Bug fixes 🐛

fix(openai): Populate ID and Created fields in OpenAI compatible responses by @jespino in #1164
Fix backend/cpp/llama CMakeList.txt on OSX by @dave-gray101 in #1212

Exciting New Features 🎉

feat(llama.cpp): enable ROCm/HIPBLAS support by @65a in #1100
Cleaning up examples/ models and starter .env files by @jamesbraza in #1124
Update docker-compose.yaml by @scenaristeur in #1131
Feats: bruno example, gallery improvements for new scraper by @dave-gray101 in #1161
feat(cli): Adding models subcommand with list and install subcommands by @jespino in #1165
feat: add tts command line subcommand by @jespino in #1169
Adding transcript subcommand by @jespino in #1171
feat: llama.cpp gRPC C++ backend by @mudler in #1170
feat(metrics): Adding initial support for prometheus metrics by @jespino in #1176
ci: add GPU tests by @mudler in #1095

👒 Dependencies

fix(deps): update module github.com/onsi/gomega to v1.28.0 by @renovate in #1113
⬆️ Update go-skynet/go-llama.cpp by @localai-bot in #1106
fix(deps): update github.com/tmc/langchaingo digest to e16b777 by @renovate in #1101
fix(deps): update github.com/go-skynet/go-llama.cpp digest to 79f9587 by @renovate in #1085
fix(deps): update module github.com/shirou/gopsutil/v3 to v3.23.9 by @renovate in #1120
fix(deps): update module github.com/sashabaranov/go-openai to v1.15.4 by @renovate in #1122
fix(deps): update module github.com/rs/zerolog to v1.31.0 by @renovate in #1102
⬆️ Update go-skynet/go-llama.cpp by @localai-bot in #1130
fix(deps): update github.com/go-skynet/go-llama.cpp digest to 6018c9d by @renovate in #1129
⬆️ Update go-skynet/go-llama.cpp by @localai-bot in #1136
fix(deps): update github.com/go-skynet/go-llama.cpp digest to 1676dcd by @renovate in #1135
fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 56c0d28 by @renovate in #1140
fix(deps): update module github.com/onsi/ginkgo/v2 to v2.13.0 by @renovate in #1152
fix(deps): update module google.golang.org/grpc to v1.58.3 by @renovate in #1160
fix(deps): update github.com/go-skynet/go-llama.cpp digest to aeba71e by @renovate in #1155
fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 10f9b49 by @renovate in #1158
fix(deps): update module github.com/sashabaranov/go-openai to v1.16.0 by @renovate in #1159
fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 22de3c5 by @renovate in #1172
fix(deps): update github.com/tmc/langchaingo digest to a02d4fd by @renovate in #1175
fix(deps): update module github.com/gofiber/fiber/v2 to v2.50.0 by @renovate in #1177
fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 9a19c74 by @renovate in #1179
fix(deps): update github.com/tmc/langchaingo digest to c636b3d by @renovate in #1188
fix(deps): update module google.golang.org/grpc to v1.59.0 by @renovate in #1189
chore(deps): update actions/checkout action to v4 by @renovate in #1006
feat(llama.cpp): update by @mudler in #1200
⬆️ Update go-skynet/go-llama.cpp by @localai-bot in #1156
fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to c25dc51 by @renovate in #1191
⬆️ Update ggerganov/llama.cpp by @localai-bot in #1204
fix(deps): update module github.com/onsi/gomega to v1.28.1 by @renovate in #1205

Other Changes

fix(deps): update github.com/nomic-ai/gpt4all/gpt4all-bindings/golang digest to 6711bdd by @renovate in #1079
ci: cleanup worker by @mudler in #1166
docs(examples): Add mistral example by @mudler in #1214
feat(llama.cpp): Bump llama.cpp, adapt grpc server by @mudler in #1211
cleanup: drop bloomz and ggllm as now supported by llama.cpp by @mudler in #1217
ci: use self-hosted to build container images by @mudler in #1206
ci: run only cublas on selfhosted by @mudler in #1224
⬆️ Update ggerganov/llama.cpp by @localai-bot in #1207

New Contributors

@65a made their first contribution in #1100
@localai-bot made their first contribution in #1106
@scenaristeur made their first contribution in #1131
@jespino made their first contribution in #1164

Full Changelog: v1.30.0...v1.40.0