github mudler/LocalAI v4.5.6

5 hours ago

What's Changed

👒 Dependencies

Other Changes

  • docs: ⬆️ update docs version mudler/LocalAI by @localai-bot in #10560
  • fix(distributed): missing agent NATS permission by @ALameLlama in #10549
  • feat(distributed): SyncedMap component + migrate finetune/quant/agent-tasks to cross-replica state by @localai-bot in #10542
  • chore(fish-speech): drop the darwin/metal build target by @localai-bot in #10561
  • fix(config): fall back to DefaultContextSize for unparseable GGUFs; pin NVFP4 gallery context_size by @localai-bot in #10563
  • ci(vibevoice): skip the ASR transcription e2e on release tag builds by @localai-bot in #10567
  • fix(gallery): match mmproj/model quant as a whole token so F16 no longer selects BF16 (#10559) by @localai-bot in #10564
  • fix(distributed): return empty backend list for agent nodes instead of failing backend.list (#10545) by @localai-bot in #10565
  • feat(distributed): add LOCALAI_DISTRIBUTED_SHARED_MODELS to skip staging on shared volumes (#10556) by @localai-bot in #10566
  • chore: ⬆️ Update leejet/stable-diffusion.cpp to 9956436c925a367daeab097598b1ea1f32d3503f by @localai-bot in #10533
  • fix(openresponses): bound resume-stream buffer and enforce response ownership by @localai-bot in #10569
  • chore: ⬆️ Update ggml-org/whisper.cpp to 0ae02cdb2c7317b50991367c165736ce42ed96ac by @localai-bot in #10532
  • chore: ⬆️ Update CrispStrobe/CrispASR to 6514c9da00b03a2f0f1b49a43fae4f3a01a41844 by @localai-bot in #10535
  • chore: ⬆️ Update ggml-org/llama.cpp to 0ed235ea2c17a19fc8238668653946721ed136fd by @localai-bot in #10536
  • fix(ik-llama): port multimodal path to mtmd API and bump to f96eaddb (#10534) by @localai-bot in #10568
  • feat(backends): add voice-detect + face-detect ggml backends (replace Python insightface/speaker-recognition) by @localai-bot in #10441
  • fix(kokoro): add explicit click dep so spacy CLI works on intel build by @localai-bot in #10572
  • fix(launcher): robust binary download/upgrade (resume, rate-limit, UX) by @localai-bot in #10575
  • fix(distributed): missing agent NATS permissions by @ALameLlama in #10571
  • fix(fish-speech): allow invalid_reference_casting so tokenizers builds on darwin by @localai-bot in #10573
  • fix(oci): retry layer downloads on transient network errors by @localai-bot in #10579
  • chore(model-gallery): ⬆️ update checksum by @localai-bot in #10585
  • chore: ⬆️ Update leejet/stable-diffusion.cpp to c1790754d31bec0731ed5fddc9d5b9ff22ee19cd by @localai-bot in #10584
  • chore: ⬆️ Update CrispStrobe/CrispASR to 6b50f76e59700665358a1aabf5295597fa318e06 by @localai-bot in #10583
  • chore: ⬆️ Update ggml-org/llama.cpp to dbdaece23de9ac63f2e7ca9e6bfcdc4fc156a3fa by @localai-bot in #10582
  • chore: ⬆️ Update mudler/voice-detect.cpp to 3d510772357538c5182808ac7de2278b84824e24 by @localai-bot in #10581
  • chore: ⬆️ Update mudler/face-detect.cpp to 06914b077d52f90d5421299138e7be6bdd06b5e8 by @localai-bot in #10580
  • chore: ⬆️ Update vllm-metal (darwin) to v0.3.0.dev20260628073537 by @localai-bot in #10562
  • chore(recon): re-pin voice/face-detect to squashed release commits (+ graph-cache fix) by @localai-bot in #10591
  • fix(sglang): parse tool_call function arguments before applying the chat template by @pos-ei-don in #10558
  • feat(realtime): Semantic VAD EOU token by @richiejp in #10444
  • fix(openai): stop max_tokens streaming retry loop on reasoning models (#9716) by @Dennisadira in #10448
  • fix(import): derive model name from selected GGUF for repo-root URIs by @Dennisadira in #10589
  • fix(functions): avoid quadratic-time debug logging in CleanupLLMResult / ParseFunctionCall by @pos-ei-don in #10592
  • chore: ⬆️ Update leejet/stable-diffusion.cpp to 3b6c9ca97cfcda8e68e719e6670d06379fcbe943 by @localai-bot in #10594
  • chore: ⬆️ Update ggml-org/llama.cpp to 6f4f53f2b7da54fcdbbecaaa734337c337ad6176 by @localai-bot in #10595
  • chore: ⬆️ Update localai-org/privacy-filter.cpp to 595f59630c69d361b5196f2aba2c71c873d0c13c by @localai-bot in #10596
  • chore: ⬆️ Update CrispStrobe/CrispASR to 3b93758f9725d400eca82976f895e4cec3f31260 by @localai-bot in #10597
  • chore: ⬆️ Update ikawrakow/ik_llama.cpp to f74a6fb87b315b2c3154166e075360e15021a61d by @localai-bot in #10598
  • fix(import): strip file:// scheme from model path for local imports by @Dennisadira in #10599
  • fix(tests): align openresponses test model name with GGUF-derived naming (#10589) by @localai-bot in #10609
  • fix(macos): staple the notarization ticket to the .app, not just the dmg by @localai-bot in #10606
  • fix(watchdog): persist UI-saved Check Interval across restarts (#10601) by @localai-bot in #10605
  • feat(config): default swa_full:true for sliding-window-attention models by @localai-bot in #10611

New Contributors

Full Changelog: v4.5.5...v4.5.6

Don't miss a new LocalAI release

NewReleases is sending notifications on new releases.