npm openclaw 2026.4.5
openclaw 2026.4.5

14 hours ago

Breaking

  • Config: remove legacy public config aliases such as talk.voiceId / talk.apiKey, agents.*.sandbox.perSession, browser.ssrfPolicy.allowPrivateNetwork, hooks.internal.handlers, and channel/group/room allow toggles in favor of the canonical public paths and enabled, while keeping load-time compatibility and openclaw doctor --fix migration support for existing configs. (#60726) Thanks @vincentkoc.

Changes

  • Agents/video generation: add the built-in video_generate tool so agents can create videos through configured providers and return the generated media directly in the reply.
  • Agents/music generation: ignore unsupported optional hints such as durationSeconds with a warning instead of hard-failing requests on providers like Google Lyria.
  • Providers/ComfyUI: add a bundled comfy workflow media plugin for local ComfyUI and Comfy Cloud workflows, including shared image_generate, video_generate, and workflow-backed music_generate support, with prompt injection, optional reference-image upload, live tests, and output download.
  • Tools/music generation: add the built-in music_generate tool with bundled Google (Lyria) and MiniMax providers plus workflow-backed Comfy support, including async task tracking and follow-up delivery of finished audio.
  • Providers: add bundled Qwen, Fireworks AI, and StepFun providers, plus MiniMax TTS, Ollama Web Search, and MiniMax Search integrations for chat, speech, and search workflows. (#60032, #55921, #59318, #54648)
  • Providers/Amazon Bedrock: add bundled Mantle support plus inference-profile discovery and automatic request-region injection so Bedrock-hosted Claude, GPT-OSS, Qwen, Kimi, GLM, and similar routes work with less manual setup. (#61296, #61299) Thanks @wirjo.
  • Control UI/multilingual: add localized control UI support for Simplified Chinese, Traditional Chinese, Brazilian Portuguese, German, Spanish, Japanese, Korean, French, Turkish, Indonesian, Polish, and Ukrainian. Thanks @vincentkoc.
  • Plugins: add plugin-config TUI prompts to guided onboarding/setup flows, and add openclaw plugins install --force so existing plugin and hook-pack targets can be replaced without using the dangerous-code override flag. (#60590, #60544)
  • Control UI/skills: add ClawHub search, detail, and install flows directly in the Skills panel. (#60134) Thanks @samzong.
  • iOS/exec approvals: add generic APNs approval notifications that open an in-app exec approval modal, fetch command details only after authenticated operator reconnect, and clear stale notification state when the approval resolves. (#60239) Thanks @ngutman.
  • Matrix/exec approvals: add Matrix-native exec approval prompts with account-scoped approvers, channel-or-DM delivery, and room-thread aware resolution handling. (#58635) Thanks @gumadeiras.
  • Channels/context visibility: add configurable contextVisibility per channel (all, allowlist, allowlist_quote) so supplemental quote, thread, and fetched history context can be filtered by sender allowlists instead of always passing through as received.
  • Providers/request overrides: add shared model and media request transport overrides across OpenAI-, Anthropic-, Google-, and compatible provider paths, including headers, auth, proxy, and TLS controls. (#60200)
  • Providers/OpenAI: add forward-compat openai-codex/gpt-5.4-mini, an opt-in GPT personality, and provider-owned GPT-5 prompt contributions so Codex/GPT runs stay cache-stable and compatible with bundled catalog lag.
  • Agents/Claude CLI: expose OpenClaw tools to background Claude CLI runs through a loopback MCP bridge and switch bundled runs to stdin + stream-json partial-message streaming so prompts stop riding argv, long replies show live progress, and final session/usage metadata still land cleanly. (#35676) Thanks @mylukin.
  • ACPX/runtime: embed the ACP runtime directly in the bundled acpx plugin, remove the extra external ACP CLI hop, harden live ACP session binding and reuse, and add a generic reply_dispatch hook so bundled plugins like ACPX can own reply interception without hardcoded ACP paths in core auto-reply routing. (#61319)
  • Agents/progress: add experimental structured plan updates and structured execution item events so compatible UIs can show clearer step-by-step progress during long-running runs.
  • Providers/Anthropic: remove the Claude CLI backend and setup-token from new onboarding, keep existing configured legacy profiles runnable, and have openclaw doctor repair or remove stale anthropic:claude-cli state during migration.
  • Tools/video generation: add bundled xAI (grok-imagine-video), Alibaba Model Studio Wan, and Runway video providers, plus live-test/default model wiring for all three.
  • Memory/search: add Amazon Bedrock embeddings for Titan, Cohere, Nova, and TwelveLabs models, with AWS credential-chain auto-detection for provider: "auto" and provider-specific dimension controls. Thanks @wirjo.
  • Providers/Amazon Bedrock Mantle: generate bearer tokens from the AWS credential chain so Mantle auto-discovery can use IAM auth without manually exporting AWS_BEARER_TOKEN_BEDROCK. Thanks @wirjo.
  • Memory/dreaming (experimental): add weighted short-term recall promotion, a /dreaming command, Dreams UI, multilingual conceptual tagging, and doctor/status repair support, while refactoring dreaming from competing modes into three cooperative phases (light, deep, REM) with independent schedules and recovery behavior so durable memory promotion can run in the background with less manual setup. (#60569, #60697) Thanks @vignesh07.
  • Memory/dreaming: add configurable aging controls (recencyHalfLifeDays, maxAgeDays) plus optional verbose logging so operators can tune recall decay and inspect promotion decisions more easily.
  • Memory/dreaming: add REM preview tooling (openclaw memory rem-harness, promote-explain), surface possible lasting truths during REM staging, and make deep promotion replay-safe so reruns reconcile instead of duplicating MEMORY.md entries.
  • Memory/dreaming: write dreaming trail content to top-level dreams.md instead of daily memory notes, update /dreaming help text to point there, and keep dreams.md available for explicit reads without pulling it into default recall. Thanks @davemorin.
  • Memory/dreaming: add the Dream Diary surface in Dreams, simplify user-facing dreaming config to enabled plus optional frequency, treat phases as implementation detail in docs/UI, and keep the lobster animation visible above diary content. Thanks @vignesh07.
  • Prompt caching: keep prompt prefixes more reusable across transport fallback, deterministic MCP tool ordering, compaction, embedded image history, normalized system-prompt fingerprints, openclaw status --verbose cache diagnostics, and the removal of duplicate in-band tool inventories from agent system prompts so follow-up turns hit cache more reliably. (#58036, #58037, #58038, #59054, #60603, #60691) Thanks @bcherny and @vincentkoc.
  • Agents/cache: diagnostics: add prompt-cache break diagnostics, trace live cache scenarios through embedded runner paths, and show cache reuse explicitly in openclaw status --verbose. Thanks @vincentkoc.
  • Agents/cache: stabilize cache-relevant system prompt fingerprints by normalizing equivalent structured prompt whitespace, line endings, hook-added system context, and runtime capability ordering so semantically unchanged prompts reuse KV/cache more reliably. Thanks @vincentkoc.
  • Agents/tool prompts: remove the duplicate in-band tool inventory from agent system prompts so tool-calling models rely on the structured tool definitions as the single source of truth, improving prompt stability and reducing stale tool guidance.
  • Config/schema: enrich the exported openclaw config schema JSON Schema with field titles and descriptions so editors, agents, and other schema consumers receive the same config help metadata. (#60067) Thanks @solavrc.
  • Providers/CLI: remove bundled CLI text-provider backends and the agents.defaults.cliBackends surface, while keeping ACP harness sessions and Gemini media understanding on the native bundled providers.
  • Matrix/exec approvals: clarify unavailable-approval replies so Matrix no longer claims chat approvals are unsupported when native exec approvals are merely unconfigured. (#61424) Thanks @gumadeiras.
  • Docs/IRC: replace public IRC hostname examples with irc.example.com and recommend private servers for bot coordination while listing common public networks for intentional use.
  • Memory/dreaming: group nearby daily-note lines into short coherent chunks before staging them for dreaming, so one-off context from recent notes reaches REM/deep with better evidence and less line-level noise.
  • Memory/dreaming: drop generic date/day headings from daily-note chunk prefixes while keeping meaningful section labels, so staged snippets stay cleaner and more reusable. (#61597) Thanks @mbelinky.
  • Plugins/Lobster: run bundled Lobster workflows in process instead of spawning the external CLI, reducing transport overhead and unblocking native runtime integration. (#61523) Thanks @mbelinky.
  • Plugins/Lobster: harden managed resume validation so invalid TaskFlow resume calls fail earlier, and memoize embedded runtime loading per runner while keeping failed loads retryable. (#61566) Thanks @mbelinky.

Fixes

  • Security: preserve restrictive plugin-only tool allowlists, require owner access for /allowlist add and /allowlist remove, fail closed when before_tool_call hooks crash, block browser SSRF redirect bypasses earlier, and keep non-interactive auth-choice inference scoped to bundled and already-trusted plugins. (#58476, #59836, #59822, #58771, #59120) Thanks @eleqtrizit and @pgondhi987.
  • Providers/OpenAI: make GPT-5 and Codex runs act sooner with lower-verbosity defaults, visible progress during tool work, and a one-shot retry when a turn only narrates the plan instead of taking action.
  • Providers/OpenAI and reply delivery: preserve native reasoning.effort: "none" and strict schemas where supported, add GPT-5.4 assistant phase metadata across replay and the Gateway /v1/responses layer, and keep commentary buffered until final_answer so web chat, session previews, embedded replies, and Telegram partials stop leaking planning text. Fixes #59150, #59643, #61282.
  • Telegram: fix current-model checks in the model picker, HTML-format non-default /model confirmations, explicit topic replies, persisted reaction ownership across restarts, caption-media placeholder and file_id preservation on download failure, and upgraded-install inbound image reads. (#60384, #60042, #59634, #59207, #59948, #59971) Thanks @sfuminya, @GitZhangChi, @dashhuang, @samzong, @v1p0r, and @neeravmakwana.
  • Telegram: restore DM voice-note preflight transcription so direct-message audio stops arriving as raw <media:audio> placeholders. (#61008) Thanks @manueltarouca.
  • Telegram/reasoning: only create a Telegram reasoning preview lane when the session is explicitly reasoning:stream, so hidden <think> traces from streamed replies stop surfacing as chat previews on normal sessions. Thanks @vincentkoc.
  • Telegram/native command menu: trim long menu descriptions before dropping commands so sub-100 command sets can still fit Telegram's payload budget and keep more / entries visible. (#61129) Thanks @neeravmakwana.
  • Discord: keep REST, webhook, and monitor traffic on the configured proxy, preserve component-only media sends, honor @everyone and @here mention gates, keep ACK reactions on the active account, and split voice connect/playback timeouts so auto-join is more reliable. (#57465, #60361, #60345) Thanks @geekhuashan.
  • Discord/reply tags: strip leaked [[reply_to_current]] control tags from preview text and honor explicit reply-tag threading during final delivery, so Discord replies stay attached to the triggering message instead of printing reply metadata into chat.
  • Discord/replies: replace the unshipped replyToOnlyWhenBatched flag with replyToMode: "batched" so native reply references only attach on debounced multi-message turns while explicit reply tags still work.
  • Discord/image generation: include the real generated MEDIA: paths in tool output, avoid duplicate plain-output media requeueing, and persist volatile workspace-generated media into durable outbound media before final reply delivery so generated image replies stop pointing at missing local files.
  • Slack: route live DM replies back to the concrete inbound DM channel while keeping persisted routing metadata user-scoped, so normal assistant replies stop disappearing when pairing and system messages still arrive. (#59030) Thanks @afurm.
  • WhatsApp: restore channels.whatsapp.blockStreaming and reset watchdog timeouts after reconnect so quiet chats stop falling into reconnect loops. (#60007, #60069) Thanks @MonkeyLeeT and @mcaxtr.
  • Android/Talk Mode: cancel in-flight talk.speak playback when speech is explicitly stopped, and restore spoken replies on both node-scoped and gateway-backed sessions by keeping reply routing and embedded transport overrides aligned with the current playback path. (#60306, #61164, #61214)
  • Voice-call/OpenAI: pass full plugin config into realtime transcription provider resolution so streaming calls can discover the bundled OpenAI realtime transcription provider again. Fixes #60936. Thanks @sliekens and @vincentkoc.
  • Matrix/exec approvals: anchor seeded approval reactions to the primary Matrix prompt event, resolve them from event metadata instead of prompt text, and clean up chunked approval prompts correctly. (#60931) Thanks @gumadeiras.
  • Matrix: recover more reliably when secret storage or recovery keys are missing by recreating secret storage during repair and backup reset, hold crypto snapshot locks during persistence, and surface explicit too-large attachment markers. (#59846, #59851, #60599, #60289) Thanks @al3mart, @emonty, and @efe-arv.
  • Matrix/DM sessions: add channels.matrix.dm.sessionScope, shared-session collision notices, and aligned outbound session reuse so separate Matrix DM rooms can keep distinct context when configured. (#61373) Thanks @gumadeiras.
  • Matrix: move legacy top-level avatarUrl into the default account during multi-account promotion and keep env-backed account setup avatar config persisted. (#61437) Thanks @gumadeiras.
  • MS Teams: download inline DM images via Graph API and preserve channel reply threading in proactive fallback. (#52212, #55198) Thanks @Ted-developer and @hyojin.
  • MS Teams: replace the deprecated Teams SDK HttpPlugin stub with httpServerAdapter so recurring gateway deprecation warnings stop firing and the Express 5 compatibility workaround stays on the supported SDK path. (#60939) Thanks @coolramukaka-sys.
  • Control UI/chat: add a per-session thinking-level picker in the chat header and mobile chat settings, and keep the browser bundle on UI-local thinking/session-key helpers so Safari no longer crashes on Node-only imports before rendering chat controls.
  • Sandbox/SSH: reject hardlinked files during cross-device rename fallback so EXDEV file copies preserve the same pinned file-boundary checks as direct reads.
  • Control UI: keep Stop visible during tool-only execution, preserve pending-send busy state, and clear stale ClawHub search results as soon as the query changes. (#54528, #59800, #60267) Thanks @chziyue and @frankekn.
  • Control UI/avatar: honor ui.assistant.avatar when serving /avatar/:agentId so Appearance UI avatar paths stop falling back to initials placeholders. (#60778) Thanks @hannasdev.
  • Control UI/cron: highlight the Cron refresh button while refresh is in flight so the page's loading state stays visible even when prior data remains on screen. (#60394) Thanks @coder-zhuzm.
  • Control UI/Overview: prevent gateway access token/password visibility toggle buttons from overlapping their inputs at narrow widths. (#56924) Thanks @bbddbb1.
  • Auto-reply: unify reply lifecycle ownership across preflight compaction, session rotation, CLI-backed runs, and gateway restart handling so /stop and same-session overlap checks target the right active turn and restart-interrupted turns return the restart notice instead of being silently dropped. (#61267) Thanks @dutifulbob.
  • Reply delivery: prevent duplicate block replies on text_end channels so providers that emit explicit text-end boundaries no longer double-send the same final message. (#61530)
  • Gateway/startup: default gateway.mode to local when unset, detect PID recycling in gateway lock files on Windows and macOS, and show startup progress so healthy restarts stop getting blocked by stale locks. (#54801, #60085, #59843) Thanks @BradGroux and @TonyDerek-dot.
  • Gateway/macOS: let launchd KeepAlive own in-process gateway restarts again, adding a short supervised-exit delay so rapid restarts avoid launchd crash-loop unloads while openclaw gateway restart still reports real LaunchAgent errors synchronously.
  • Gateway/macOS: re-bootstrap the LaunchAgent if launchctl kickstart -k unloads it during restart so failed restarts do not leave the gateway unmanaged until manual repair.
  • Gateway/macOS: recover installed-but-unloaded LaunchAgents during openclaw gateway start and restart, while still preferring live unmanaged gateways during restart recovery. (#43766) Thanks @HenryC-3.
  • Gateway/Windows scheduled tasks: preserve Task Scheduler settings on reinstall, fail loudly when /Run does not start, and report fast failed restarts accurately instead of pretending they timed out after 60 seconds. (#59335) Thanks @tmimmanuel.
  • Windows/restart: fall back to the installed Startup-entry launcher when the scheduled task was never registered, so /restart can relaunch the gateway on Windows setups where schtasks install fell back during onboarding. (#58943) Thanks @imechZhangLY.
  • Windows/restart: clean up stale gateway listeners before Windows self-restart and treat listener and argv probe failures as inconclusive, so scheduled-task relaunch no longer falls into an EADDRINUSE retry loop. (#60480) Thanks @arifahmedjoy.
  • Update/npm: prefer the npm binary that owns the installed global OpenClaw prefix so mixed Homebrew-plus-nvm setups update the right install. (#60153) Thanks @jayeshp19.
  • Agents/music and video generation: add tools.media.asyncCompletion.directSend as an opt-in direct-delivery path for finished async media tasks, while keeping the legacy requester-session wake/model-delivery flow as the default.
  • CLI/skills JSON: route skills list --json, skills info --json, and skills check --json output to stdout instead of stderr so machine-readable consumers receive JSON on the expected stream again. (#60914; fixes #57599; landed from contributor PR #57611 by @Aftabbs) Thanks @Aftabbs.
  • CLI/Commander: preserve Commander-computed exit codes for argument and help-error paths, and cover the user-argv parse mode in the regression tests so invalid CLI invocations no longer report success when exits are intercepted. (#60923) Thanks @Linux2010.
  • Cron: replay interrupted recurring jobs on the first gateway restart instead of waiting for a second restart. (#60583) Thanks @joelnishanth.
  • Cron: send failure notifications through the job's primary delivery channel using the same session context as successful delivery when no explicit failureDestination is configured. (#60622) Thanks @artwalker.
  • Exec/remote skills: stop advertising exec host=node when the current exec policy cannot route to a node, and clarify blocked exec-host override errors with both the requested host and allowed config path.
  • Agents/Claude CLI/security: clear inherited Claude Code config-root and plugin-root env overrides like CLAUDE_CONFIG_DIR and CLAUDE_CODE_PLUGIN_*, so OpenClaw-launched Claude CLI runs cannot be silently pointed at an alternate Claude config/plugin tree with different hooks, plugins, or auth context. Thanks @vincentkoc.
  • Agents/Claude CLI/security: clear inherited Claude Code provider-routing and managed-auth env overrides, and mark OpenClaw-launched Claude CLI runs as host-managed, so Claude CLI backdoor sessions cannot be silently redirected to proxy, Bedrock, Vertex, Foundry, or parent-managed token contexts. Thanks @vincentkoc.
  • Agents/Claude CLI/security: force host-managed Claude CLI backdoor runs to --setting-sources user, even under custom backend arg overrides, so repo-local .claude project/local settings, hooks, and plugin discovery do not silently execute inside non-interactive OpenClaw sessions. Thanks @vincentkoc.
  • Agents/Claude CLI: treat malformed bare --permission-mode backend overrides as missing and fail safe back to bypassPermissions, so custom cliBackends.claude-cli.args security config cannot accidentally consume the next flag as a bogus permission mode. Thanks @vincentkoc.
  • Gateway/device pairing: require non-admin paired-device sessions to manage only their own device for token rotate/revoke and paired-device removal, blocking cross-device token theft inside pairing-scoped sessions. (#50627) Thanks @coygeek.
  • Gateway/plugin routes: keep gateway-auth plugin runtime routes on write-only fallback scopes unless a trusted-proxy caller explicitly declares narrower x-openclaw-scopes, so plugin HTTP handlers no longer mint admin-level runtime scopes on missing or untrusted HTTP scope headers. (#59815) Thanks @pgondhi987.
  • Build/types: fix the Node createRequire(...) helper typing so provider-runtime lazy loads compile cleanly again and pnpm build no longer fails in the Pi embedded provider error-pattern path.
  • Gateway/security: scope loopback browser-origin auth throttling by normalized origin so one localhost Control UI tab cannot lock out a different localhost browser origin after repeated auth failures.
  • Gateway/auth: serialize async shared-secret auth attempts per client so concurrent Tailscale-capable failures cannot overrun the intended auth rate-limit budget. Thanks @Telecaster2147.
  • Device pairing/security: keep non-operator device scope checks bound to the requested role prefix so bootstrap verification cannot redeem operator.* scopes through node auth. (#57258) Thanks @jlapenna.
  • Device pairing: reject rotating device tokens into roles that were never approved during pairing, and keep reconnect role checks bounded to the paired device's approved role set. (#60462) Thanks @eleqtrizit.
  • Gateway/device auth: reuse cached device-token scopes only for cached-token reconnects, while keeping explicit deviceToken scope requests and empty-cache fallbacks intact so reconnects preserve operator.read without breaking explicit auth flows. (#46032) Thanks @caicongyang.
  • Mobile pairing/security: fail closed for internal /pair setup-code issuance, cleanup, and approval paths when gateway pairing scopes are missing, and keep approval-time requested-scope enforcement on the internal command path. (#55996) Thanks @coygeek.
  • Mobile pairing/bootstrap: keep QR bootstrap handoff tokens bounded to the mobile-safe contract so node handoff stays unscoped and operator handoff drops mixed node.*, operator.admin, and operator.pairing scopes.
  • Mobile pairing/Android: tighten secure endpoint handling so Tailscale and public remote setup reject cleartext endpoints, private LAN pairing still works, merged-role approvals mint both node and operator device tokens, and bootstrap tokens survive node auto-pair until operator approval finishes. (#60128, #60208, #60221) Thanks @obviyus.
  • Android/canvas security: require exact normalized A2UI URL matches before forwarding canvas bridge actions, rejecting query mismatches and descendant paths while still allowing fragment-only A2UI navigation.
  • Synology Chat/security: default low-level HTTPS helper TLS verification to on so helper/API defaults match the shipped safe account default, and only explicit allowInsecureSsl: true opts out.
  • Synology Chat/security: route webhook token comparison through the shared constant-time secret helper for consistency with other bundled plugins.
  • Plugins/marketplace: block remote marketplace symlink escapes without breaking ordinary local marketplace install paths. (#60556) Thanks @eleqtrizit.
  • Telegram/local Bot API: honor channels.telegram.apiRoot for buffered media downloads, add channels.telegram.network.dangerouslyAllowPrivateNetwork for trusted fake-IP setups, and require channels.telegram.trustedLocalFileRoots before reading absolute Bot API file_path values. (#59544, #60705) Thanks @SARAMALI15792 and @obviyus.
  • Outbound/sanitizer: strip leaked <tool_call>, <function_calls>, and model special tokens from shared user-visible assistant text, including truncated tool-call streams, so internal scaffolding no longer bleeds into replies across surfaces. (#60619) Thanks @oliviareid-svg.
  • Agents/errors: surface an explicit disk-full message when local session or transcript writes fail with ENOSPC/disk full, so those runs stop degrading into opaque NO_REPLY-style failures. Thanks @vincentkoc.
  • Exec approvals: remove heuristic command-obfuscation gating from host exec so gateway and node runs rely on explicit policy, allowlist, and strict inline-eval rules only.
  • Agents/tool results: cap live tool-result persistence and overflow-recovery truncation at 40k characters so oversized tool output stays bounded without discarding recent context entirely.
  • Discord/video replies: split text-plus-video deliveries into a text reply followed by a media-only send, and let live provider auth checks honor manifest-declared API key env vars like MODELSTUDIO_API_KEY.
  • Config/All Settings: keep the raw config view intact when sensitive fields are blank instead of corrupting or dropping the rendered snapshot. (#28214) Thanks @solodmd.
  • Plugin SDK/facades: back-fill bundled plugin facade sentinels before plugin-id tracking re-enters config loading, so CLI/provider startup no longer crashes with shouldNormalizeGoogleProviderConfig is not a function or other empty-facade reads during bundled plugin re-entry. Thanks @adam91holt.
  • Plugins/facades: back-fill facade sentinels before tracked-plugin resolution re-enters config loading, so facade exports stay defined during circular provider normalization. (#61180) Thanks @adam91holt.
  • QA lab: restore typed mock OpenAI gateway config wiring so QA-lab config helpers compile cleanly again and pnpm check / pnpm build stay green.
  • Discord/image generation: include the real generated MEDIA: paths in tool output and avoid duplicate plain-output media requeueing so Discord image replies stop pointing at missing local files.
  • Slack: route live DM replies back to the concrete inbound DM channel while keeping persisted routing metadata user-scoped, so normal assistant replies stop disappearing when pairing and system messages still arrive. (#59030) Thanks @afurm.
  • Discord/reply tags: strip leaked [[reply_to_current]] control tags from preview text and honor explicit reply-tag threading during final delivery, so Discord replies stay attached to the triggering message instead of printing reply metadata into chat.
  • Telegram: fix current-model checks in the model picker, HTML-format non-default /model confirmations, explicit topic replies, persisted reaction ownership across restarts, caption-media placeholder and file_id preservation on download failure, and upgraded-install inbound image reads. (#60384, #60042, #59634, #59207, #59948, #59971) Thanks @sfuminya, @GitZhangChi, @dashhuang, @samzong, @v1p0r, and @neeravmakwana.
  • Telegram: restore DM voice-note preflight transcription so direct-message audio stops arriving as raw <media:audio> placeholders. (#61008) Thanks @manueltarouca.
  • Telegram/reasoning: only create a Telegram reasoning preview lane when the session is explicitly reasoning:stream, so hidden <think> traces from streamed replies stop surfacing as chat previews on normal sessions. Thanks @vincentkoc.
  • Telegram/native command menu: trim long menu descriptions before dropping commands so sub-100 command sets can still fit Telegram's payload budget and keep more / entries visible. (#61129) Thanks @neeravmakwana.
  • Feishu/reasoning: only expose streamed reasoning previews when the session is explicitly reasoning:stream, so hidden reasoning traces do not surface on normal streaming sessions. Thanks @vincentkoc.
  • Discord: keep REST, webhook, and monitor traffic on the configured proxy, preserve component-only media sends, honor @everyone and @here mention gates, keep ACK reactions on the active account, and split voice connect/playback timeouts so auto-join is more reliable. (#57465, #60361, #60345) Thanks @geekhuashan.
  • WhatsApp: restore channels.whatsapp.blockStreaming and reset watchdog timeouts after reconnect so quiet chats stop falling into reconnect loops. (#60007, #60069) Thanks @MonkeyLeeT and @mcaxtr.
  • Memory: keep memory-core builtin embedding registration on the already-registered path so selecting memory-core no longer recurses through plugin discovery and crashes during startup. (#61402) Thanks @ngutman.
  • Agents/tool results: keep large read outputs visible longer, preserve the latest read output when older tool output can absorb the overflow budget, and fall back to Pi's normal overflow compaction/retry path before replacing a fresh read with a compacted stub. Thanks @vincentkoc.
  • Memory/QMD: prefer modern qmd collection add --glob, accept newer single-line JSON hit metadata while keeping legacy line fields, refresh QMD docs/doctor install guidance and model-override guidance, and keep older QMD releases working. Thanks @vincentkoc.
  • MS Teams: download inline DM images via Graph API and preserve channel reply threading in proactive fallback. (#52212, #55198) Thanks @Ted-developer and @hyojin.
  • MS Teams: replace the deprecated Teams SDK HttpPlugin stub with httpServerAdapter so recurring gateway deprecation warnings stop firing and the Express 5 compatibility workaround stays on the supported SDK path. (#60939) Thanks @coolramukaka-sys.
  • Matrix/exec approvals: anchor seeded approval reactions to the primary Matrix prompt event, resolve them from event metadata instead of prompt text, and clean up chunked approval prompts correctly. (#60931) Thanks @gumadeiras.
  • Matrix: recover more reliably when secret storage or recovery keys are missing by recreating secret storage during repair and backup reset, hold crypto snapshot locks during persistence, and surface explicit too-large attachment markers. (#59846, #59851, #60599, #60289) Thanks @al3mart, @emonty, and @efe-arv.
  • Android/Talk Mode: cancel in-flight talk.speak playback when speech is explicitly stopped, so stale replies stop starting after barge-in or manual stop. (#61164) Thanks @obviyus.
  • Android/Talk Mode: restore spoken assistant replies on node-scoped sessions by keeping reply routing synced to the resolved node session key and pausing mic capture during reply playback. (#60306) Thanks @MKV21.
  • Android/Talk Mode: restore voice replies on gateway-backed talk mode sessions by updating embedded runner transport overrides to the current agent transport API. (#61214) Thanks @obviyus.
  • Voice-call/OpenAI: pass full plugin config into realtime transcription provider resolution so streaming calls can discover the bundled OpenAI realtime transcription provider again. Fixes #60936. Thanks @sliekens and @vincentkoc.
  • Control UI/chat: add a per-session thinking-level picker in the chat header and mobile chat settings, and keep the browser bundle on UI-local thinking/session-key helpers so Safari no longer crashes on Node-only imports before rendering chat controls.
  • Control UI: keep Stop visible during tool-only execution, preserve pending-send busy state, and clear stale ClawHub search results as soon as the query changes. (#54528, #59800, #60267) Thanks @chziyue and @frankekn.
  • Control UI/avatar: honor ui.assistant.avatar when serving /avatar/:agentId so Appearance UI avatar paths stop falling back to initials placeholders. (#60778) Thanks @hannasdev.
  • Control UI/cron: highlight the Cron refresh button while refresh is in flight so the page's loading state stays visible even when prior data remains on screen. (#60394) Thanks @coder-zhuzm.
  • Control UI/Overview: prevent gateway access token/password visibility toggle buttons from overlapping their inputs at narrow widths. (#56924) Thanks @bbddbb1.
  • CLI/skills JSON: route skills list --json, skills info --json, and skills check --json output to stdout instead of stderr so machine-readable consumers receive JSON on the expected stream again. (#60914; fixes #57599; landed from contributor PR #57611 by @Aftabbs) Thanks @Aftabbs.
  • CLI/Commander: preserve Commander-computed exit codes for argument and help-error paths, and cover the user-argv parse mode in the regression tests so invalid CLI invocations no longer report success when exits are intercepted. (#60923) Thanks @Linux2010.
  • Cron: replay interrupted recurring jobs on the first gateway restart instead of waiting for a second restart. (#60583) Thanks @joelnishanth.
  • Cron: send failure notifications through the job's primary delivery channel using the same session context as successful delivery when no explicit failureDestination is configured. (#60622) Thanks @artwalker.
  • Live model switching: only treat explicit user-driven model changes as pending live switches, so fallback rotation, heartbeat overrides, and compaction no longer trip LiveSessionModelSwitchError before making an API call. (#60266) Thanks @kiranvk-2011.
  • Exec approvals: reuse durable exact-command allow-always approvals in allowlist mode so identical reruns stop prompting, and tighten Windows interpreter/path approval handling so wrapper and malformed-path cases fail closed more consistently. (#59880, #59780, #58040, #59182) Thanks @luoyanglang, @SnowSky1, and @pgondhi987.
  • Node exec approvals: keep node-host system.run approvals bound to the prepared execution plan across async forwarding, so mutable script operands still get approval-time binding and drift revalidation instead of dropping back to unbound execution.
  • Agents/exec approvals: let exec-approvals.json agent security override stricter gateway tool defaults so approved subagents can use security: “full” without falling back to allowlist enforcement again. (#60310) Thanks @lml2468.
  • Agents/exec: restore host=node routing for node-pinned and host=auto sessions, while still blocking sandboxed auto sessions from jumping to gateway. (#60788) Thanks @openperf.
  • Exec/heartbeat: use the canonical exec-event wake reason for notifyOnExit so background exec completions still trigger follow-up turns when HEARTBEAT.md is empty or comments-only. (#41479) Thanks @rstar327.
  • Heartbeat: skip wake delivery when the target session lane is already busy so the pending event is retried instead of getting drained too early. (#40526) Thanks @lucky7323.
  • Group chats/agent prompts: tell models to minimize empty lines and use normal chat-style spacing so group replies avoid document-style blank-line formatting.
  • Providers/OpenAI GPT: treat short approval turns like ok do it and go ahead as immediate action turns, and trim overly memo-like GPT-5 chat confirmations so OpenAI replies stay shorter and more conversational by default.
  • Providers/OpenAI Codex: split native contextWindow from runtime contextTokens, keep the default effective cap at 272000, and expose a per-model contextTokens override on models.providers.*.models[].
  • Providers/OpenAI-compatible WS: compute fallback token totals from normalized usage when providers omit or zero total_tokens, so DashScope-compatible sessions stop storing zero totals after alias normalization. (#54940) Thanks @lyfuci.
  • Agents/OpenAI: mark Claude-compatible file tool schemas as additionalProperties: false so direct OpenAI GPT-5 routes stop rejecting the read tool with invalid strict-schema errors.
  • Agents/OpenAI: fall back to strict: false for native OpenAI tool calls when a tool schema is not strict-compatible, and normalize empty-object tool schemas to include required: [], so direct GPT-5 routes stop failing with invalid strict-schema errors like missing path in required.
  • Agents/GPT: add explicit work-item lifecycle events for embedded runs, use them to surface real progress more reliably, and stop counting tool-started turns as planning-only retries.
  • Plugins/OpenAI: enable gpt-image-1 reference-image edits through /images/edits multipart uploads, and stop inferring unsupported resolution overrides when no explicit size or resolution is provided.
  • Agents/replay: remove the malformed assistant-content canonicalization repair from replay history sanitization instead of extending that legacy repair path into replay validation.
  • Plugins/OpenAI: tune the OpenAI prompt overlay for live-chat cadence so GPT replies stay shorter, more human, and less wall-of-text by default.
  • Providers/compat: stop forcing OpenAI-only defaults on proxy and custom OpenAI-compatible routes, preserve native vendor-specific reasoning/tool/streaming behavior across Anthropic-compatible, Moonshot, Mistral, ModelStudio, OpenRouter, xAI, and Z.ai endpoints, and route GitHub Copilot Claude models through Anthropic Messages instead of OpenAI Responses.
  • Providers/GitHub Copilot: send IDE identity headers on runtime model requests and GitHub token exchange so IDE-authenticated Copilot runs stop failing with missing Editor-Version. (#60641) Thanks @VACInc and @vincentkoc.
  • Providers/OpenRouter failover: classify 403 “Key limit exceeded” spending-limit responses as billing so model fallback continues instead of stopping on generic auth. (#59892) Thanks @rockcent.
  • Providers/Anthropic: keep claude-cli/* auth on live Claude CLI credentials at runtime, avoid persisting stale bearer-token profiles, and suppress macOS Keychain prompts during non-interactive Claude CLI setup. (#61234) Thanks @darkamenosa.
  • Providers/Anthropic: when Claude CLI auth becomes the default, write a real claude-cli auth profile so local and gateway agent runs can use Claude CLI immediately without missing-API-key failures. Thanks @vincentkoc.
  • Providers/Anthropic Vertex: honor cacheRetention: “long” with the real 1-hour prompt-cache TTL on Vertex AI endpoints, and default anthropic-vertex cache retention like direct Anthropic. (#60888) Thanks @affsantos.
  • Agents/Anthropic: preserve native toolu_* replay ids on direct Anthropic and Anthropic Vertex paths so cache-sensitive history stops rewriting known-valid Anthropic tool-use ids. (#52612)
  • Providers/Google: add model-level cacheRetention support for direct Gemini system prompts by creating, reusing, and refreshing cachedContents automatically on Google AI Studio runs. (#51372) Thanks @rafaelmariano-glitch.
  • Google Gemini CLI auth: detect bundled npm installs by scanning packaged bundle files for the Gemini OAuth client config, so npm install -g @google/gemini-cli layouts work again. (#60486) Thanks @wzfmini01.
  • Google Gemini CLI auth: detect personal OAuth mode from local Gemini settings and skip Code Assist project discovery for those logins, so personal Google accounts stop failing with loadCodeAssist 400 Bad Request. (#49226) Thanks @bobworrall.
  • Google Gemini CLI auth: improve OAuth credential discovery across Windows nvm and Homebrew libexec installs, and align Code Assist metadata so Gemini login stops failing on packaged CLI layouts. (#40729) Thanks @hughcube.
  • Google Gemini CLI models: add forward-compat support for stable gemini-2.5-* model ids by letting the bundled CLI provider clone them from Google templates, so gemini-2.5-flash-lite and related configured models stop showing up as missing. (#35274) Thanks @mySebbe.
  • Google image generation: disable pinned DNS for Gemini image requests and honor explicit pinDns overrides in shared provider HTTP helpers so proxy-backed image generation works again. (#59873) Thanks @luoyanglang.
  • Providers/Microsoft Foundry: preserve explicit image capability on normalized Foundry deployments, repair stale GPT/o-series text-only model metadata across gateway and runtime paths, and keep unknown fallback models from borrowing unrelated image support.
  • Providers/Model Studio: preserve native streaming usage reporting for DashScope-compatible endpoints even when they are configured under a generic provider key, so streamed token totals stop sticking at zero. (#52395) Thanks @IVY-AI-gif.
  • Providers/Z.AI: preserve explicitly registered glm-5-* variants like glm-5-turbo instead of intercepting them with the generic GLM-5 forward-compat shim. (#48185) Thanks @haoyu-haoyu.
  • Amazon Bedrock/aws-sdk auth: stop injecting the fake AWS_PROFILE apiKey marker when no AWS auth env vars exist, so instance-role and other default-chain setups keep working without poisoning provider config. (#61194) Thanks @wirjo.
  • Agents/Kimi tool-call repair: preserve tool arguments that were already present on streamed tool calls when later malformed deltas fail reevaluation, while still dropping stale repair-only state before toolcall_end.
  • Plugins/Kimi Coding: parse tagged tool calls and keep Anthropic-native tool payloads so Kimi coding endpoints execute tools instead of echoing raw markup. (#60051, #60391) Thanks @obviyus and @Eric-Guo.
  • Media understanding: auto-register image-capable config providers for vision routing, so custom GLM-style provider ids with image models stop failing with “no media-understanding provider registered”. (#51418) Thanks @xydt-610.
  • Plugins/media understanding: enable bundled Groq and Deepgram providers by default so configured transcription models work without extra plugin activation config. (#59982) Thanks @yxjsxy.
  • MiniMax/pricing: keep bundled MiniMax highspeed pricing distinct in provider catalogs and preserve the lower M2.5 cache-read pricing when onboarding older MiniMax models. (#54214) Thanks @octo-patch.
  • MiniMax: advertise image input on bundled MiniMax-M2.7 and MiniMax-M2.7-highspeed model definitions so image-capable flows can route through the M2.7 family correctly. (#54843) Thanks @MerlinMiao88888888.
  • Models/MiniMax: honor MINIMAX_API_HOST for implicit bundled MiniMax provider catalogs so China-hosted API-key setups pick api.minimaxi.com/anthropic without manual provider config. (#34524) Thanks @caiqinghua.
  • Usage/MiniMax: invert remaining-style usage_percent fields when MiniMax reports only remaining percentage data, so usage bars stop showing nearly-full remaining quota as nearly-exhausted usage. (#60254) Thanks @jwchmodx.
  • Usage/MiniMax: let usage snapshots treat minimax-portal and MiniMax CN aliases as the same MiniMax quota surface, and prefer stored MiniMax OAuth before falling back to Coding Plan keys.
  • Usage/MiniMax: prefer the chat-model model_remains entry and derive Coding Plan window labels from MiniMax interval timestamps so MiniMax usage snapshots stop picking zero-budget media rows and misreporting 4h windows as 5h. (#52349) Thanks @IVY-AI-gif.
  • Model picker/providers: treat bundled BytePlus and Volcengine plan aliases as their native providers during setup, and expose their bundled standard/coding catalogs before auth so setup can suggest the right models. (#58819) Thanks @Luckymingxuan.
  • Tools/web_search (Kimi): when tools.web.search.kimi.baseUrl is unset, inherit native Moonshot chat baseUrl (.ai / .cn) so China console keys authenticate on the same host as chat. Fixes #44851. (#56769) Thanks @tonga54.
  • Agents/Claude CLI: keep non-interactive --permission-mode bypassPermissions when custom cliBackends.claude-cli.args override defaults, including fallback resolution before the runtime plugin registry is active, so cron and heartbeat Claude CLI runs do not regress to interactive approval mode. (#61114) Thanks @cathrynlavery and @thewilloftheshadow.
  • Agents/Claude CLI: persist explicit openclaw agent --session-id runs under a stable session key so follow-ups can reuse the stored CLI binding and resume the same underlying Claude session.
  • Agents/Claude CLI: persist routed Claude session bindings, rotate them on /new and /reset, and keep live Claude CLI model switches moving across the configured Claude family so resumed sessions follow the real active thread and model. Thanks @vincentkoc.
  • Agents/CLI backends: invalidate stored CLI session reuse when local CLI login state or the selected auth profile credential changes, so relogin and token rotation stop resuming stale sessions.
  • Agents/Claude CLI/images: reuse stable hydrated image file paths and preserve shared media extensions like HEIC when passing image refs to local CLI runs, so Claude CLI image prompts stop thrashing KV cache prefixes and oddball image formats do not fall back to .bin. Thanks @vincentkoc.
  • Agents/compaction: keep assistant tool calls and displaced tool results in the same compaction chunk so strict summarization providers stop rejecting orphaned tool pairs. (#58849) Thanks @openperf.
  • Agents/failover: scope Anthropic An unknown error occurred failover matching by provider so generic internal unknown-error text no longer triggers retryable timeout fallback. (#59325) Thanks @aaron-he-zhu.
  • Agents/subagents: honor allowlist validation, auth-profile handoff, and session override state when a subagent retries after LiveSessionModelSwitchError. (#58178) Thanks @openperf.
  • Agents/runtime: make default subagent allowlists, inherited skills/workspaces, and duplicate session-id resolution behave more predictably, and include value-shape hints in missing-parameter tool errors. (#59944, #59992, #59858, #55317) Thanks @hclsys, @gumadeiras, @joelnishanth, and @priyansh19.
  • Agents/pairing: merge completion announce delivery context with the requester session fallback so missing to still reaches the original channel, and include operator.talk.secrets in CLI default operator scopes for node-role device pairing approvals. (#56481) Thanks @maxpetrusenko.
  • Agents/scheduling: steer background-now work toward automatic completion wake and treat process polling as on-demand inspection or intervention instead of default completion handling. (#60877) Thanks @vincentkoc.
  • Agents/skills: skip .git and node_modules when mirroring skills into sandbox workspaces so read-only sandboxes do not copy repo history or dependency trees. (#61090) Thanks @joelnishanth.
  • ACP/agents: inherit the target agent workspace for cross-agent ACP spawns and fall back safely when the inherited workspace no longer exists. (#58438) Thanks @zssggle-rgb.
  • ACPX/Windows: preserve backslashes and absolute .exe paths in Claude CLI parsing, and fail fast on wrapper-script targets with guidance to use cmd.exe /c, powershell.exe -File, or node <script>. (#60689) Thanks @steipete.
  • Auth/failover: persist selected fallback overrides before retrying, shorten auth_permanent lockouts, and refresh websocket/shared-auth sessions only when real auth changes occur so retries and secret rotations behave predictably. (#60404, #60323, #60387) Thanks @extrasmall0 and @mappel-nv.
  • Gateway/channels: pin the initial startup channel registry before later plugin-registry churn so configured channels stay visible and channels.status stops falling back to empty channelOrder / channels payloads after runtime plugin loads.
  • Prompt caching: order stable workspace project-context files before HEARTBEAT.md and keep HEARTBEAT.md below the system-prompt cache boundary so heartbeat churn does not invalidate the stable project-context prefix. (#58979) Thanks @yozu and @vincentkoc.
  • Prompt caching: route Codex Responses and Anthropic Vertex through boundary-aware cache shaping, and report the actual outbound system prompt in cache traces so cache reuse and misses line up with what providers really receive. Thanks @vincentkoc.
  • Agents/cache: preserve the full 3-turn prompt-cache image window across tool loops, keep colliding bundled MCP tool definitions deterministic, and reapply Anthropic Vertex cache shaping after payload hook replacements so KV/cache reuse stays stable. Thanks @vincentkoc.
  • Status/cache: restore cacheRead and cacheWrite in transcript fallback so /status keeps showing cache hit percentages when session logs are the only complete usage source. (#59247) Thanks @stuartsy.
  • Status/usage: let /status and session_status fall back to transcript token totals when the session meta store stayed at zero, so LM Studio, Ollama, DashScope, and similar OpenAI-compatible providers stop showing Context: 0/.... (#55041) Thanks @jjjojoj.
  • Mattermost/config schema: accept groups.*.requireMention again so existing Mattermost configs no longer fail strict validation after upgrade. (#58271) Thanks @MoerAI.
  • Doctor/config: compare normalized talk configs by deep structural equality instead of key-order-sensitive serialization so openclaw doctor --fix stops repeatedly reporting/applying no-op talk.provider/providers normalization. (#59911) Thanks @ejames-dev.
  • Anthropic CLI onboarding: rewrite migrated fallback model refs during non-interactive Claude CLI setup too, so onboarding and scripted setup no longer keep stale anthropic/* fallbacks after switching the primary model to claude-cli/*. Thanks @vincentkoc.
  • Models/Anthropic CLI auth: replace migrated agents.defaults.models allowlists when openclaw models auth login --provider anthropic --method cli --set-default switches to claude-cli/*, so stale anthropic/* entries do not linger beside the migrated Claude CLI defaults. Thanks @vincentkoc.
  • Doctor/Claude CLI: add dedicated Claude CLI health checks so openclaw doctor can spot missing local installs or broken auth before agent runs fail. Thanks @vincentkoc.
  • Plugins/auth-choice: apply provider-owned auth config patches without recursively preserving replaced default-model maps, so Anthropic Claude CLI and similar migrations can intentionally swap model allowlists during onboarding and setup instead of accumulating stale entries. Thanks @vincentkoc.
  • Plugins/onboarding: write dotted plugin uiHint paths like Brave webSearch.mode as nested plugin config so llm-context setup stops failing validation. (#61159) Thanks @obviyus.
  • Plugins/install: preserve unsafe override flags across linked plugin and hook-pack probes so local --link installs honor the documented override behavior. (#60624) Thanks @JerrettDavis.
  • Plugins/cache: inherit the active gateway workspace for provider, web-search, and web-fetch snapshot loads when callers omit workspaceDir, so compatible plugin registries and snapshot caches stop missing on gateway-owned runtime paths. (#61138) Thanks @jzakirov.
  • Plugin SDK/context engines: export the missing context-engine result and subagent lifecycle types from openclaw/plugin-sdk so context engine plugins can type ContextEngine implementations without local workarounds. (#61251) Thanks @DaevMithran.
  • Tasks/maintenance: reconcile stale cron and chat-backed CLI task rows against live cron-job and agent-run ownership instead of treating any persisted session key as proof that the task is still running. (#60310) Thanks @lml2468.
  • Plugins: suppress trust-warning noise during non-activating snapshot and CLI metadata loads. (#61427) Thanks @gumadeiras.
  • Agents/video generation: accept agents.defaults.videoGenerationModel in strict config validation and openclaw config set/get, so gateways using video_generate no longer fail to boot after enabling a video model.
  • Matrix/streaming: add a quiet preview mode for streamed Matrix replies, keep legacy partial preview-first behavior, and finalize quiet media captions correctly so previews stop notifying early without dropping final text semantics. (#61450) Thanks @gumadeiras.
  • Gateway/shutdown: bound websocket-server shutdown even when no tracked clients remain, so gateway restarts stop hanging until the watchdog kills the process. (#61565) Thanks @mbelinky.
  • Control UI/multilingual: localize the remaining shared channel, instances, nodes, and gateway-confirmation strings so the dashboard stops mixing translated UI with hardcoded English labels. Thanks @vincentkoc.
  • Discord/media: raise the default inbound and outbound media cap to 100MB so Discord matches Telegram more closely and larger attachments stop failing on the old low default.
  • Matrix: keep direct transport requests on the pinned dispatcher by routing them through undici runtime fetch, so Matrix clients resume syncing on newer runtimes without dropping the validated address binding. (#61595) Thanks @gumadeiras.
  • Plugins/facades: resolve globally installed bundled-plugin runtime facades from registry roots so bundled channels like LINE still boot when the winning plugin install lives under the global extensions directory with an encoded scoped folder name. (#61297) Thanks @openperf.

Don't miss a new openclaw release

NewReleases is sending notifications on new releases.