Agent Zero Release Notes
Office & Document Handling
- ODF-first document defaults — ODT, ODS, and ODP are now the primary formats for Writer, Spreadsheet, and Presentation artifacts. OOXML (DOCX, XLSX, PPTX) remains available as an explicit compatibility option. Includes full ODF package generation, validation, and read/edit support.
- Unified Office canvas controls — Active-file header shown in both canvas and modal views with a shared "+ New" menu, inline Save, and Rename/Close File from a document dropdown.
- Live document reload after edits — LibreOffice-backed documents now close and reopen the visible window after artifact edits instead of relying on a canvas repaint. Cold-start sessions from tool results are also supported.
- Reduced automatic document triggering — Meta-discussions about generated files no longer create artifacts; explicit file and canvas requests still work as expected.
Linux Desktop
- Persistent Desktop lifecycle — A single Xpra Desktop iframe stays alive across canvas, modal, and keepalive hosts. Explicit shutdown is distinguished from crashes via a new shutdown/restart state and an XFCE panel "Shutdown Desktop" launcher (requires confirmation click). Unsafe logout, lock, and switch-user affordances are hidden.
- Desktop state controls — New
desktop_statehelper, expandeddesktopctlobserve-act-verify commands, and Xpra bridge diagnostics. Agents now prefer structured/app-native/keyboard workflows and treat coordinate clicks as a last resort. - Generalized CLI agent guidance — The Desktop skill now distinguishes shell prompts from target CLI prompts and uses a generic nested CLI-agent launch pattern (
TARGET_CLI/FALLBACK_CMD) instead of tool-specific instructions.
Browser
- Stable modal switching — Switching from Desktop to Browser reuses existing sessions instead of opening a blank tab. A new Focus mode control in the Browser modal header matches Desktop's fullscreen/restore behavior.
- Explicit screenshot and form actions — New
browser:screenshotaction writes JPEG/PNG files forvision_load. Extended agent-callable input actions and abrowser-formson-demand skill for label-wrapped form controls.
UI & UX
- Unlimited canvas sizing — The fixed right-canvas width cap is removed; the panel can now shrink to zero or grow across the full workspace.
- Surface-switch buttons in modals — Browser/Desktop switcher buttons appear in modal headers using the same registered surface metadata as canvas controls.
- Time Travel modal alignment — Time Travel now uses the standard centered modal shell and shared backdrop, matching Settings behavior.
- Bash-style chat input history — Up/Down arrow navigation through previous chat inputs (based on community contribution). Down-arrow only triggers when the caret is at end-of-text, preserving normal multiline movement.
Infrastructure
- Pinned
pyreqwest-impersonateat 0.5.3 — Avoids Docker build failures caused by the source-only 0.5.5 release requiring cmake to compileboring-sys-imp.