v0.6.0 — Built-in Browser, Document Tools & Multi-Panel
Integrated Browser
Craft Agent now has a full built-in browser — no extensions, no external tools. The agent can browse the web, fill forms, extract data, and take screenshots, all from within the app.
- In-app browser panes — Browser windows open as panels alongside your conversation, so you can watch the agent navigate in real time
- Privileged execution approval — Sensitive browser actions require explicit approval, keeping you in control
- Smart screenshots — JPEG by default for faster, smaller captures with proper DPI scaling
- Full CLI command set — Navigate, click, fill, scroll, screenshot, evaluate JS, manage downloads, and more
Why it's cool
This unlocks true end-to-end automation. Connect any data source and let the agent act on it — fill out forms, submit reports, navigate dashboards, and complete multi-step workflows entirely hands-free. Combine browser actions with your connected sources: "pull this week's metrics from Amplitude, fill out the weekly report form in our internal tool, and post the summary to Slack." The browser closes the gap between data and action.
Document Tools
Built-in tools for working with PDF, Word, Excel, PowerPoint, images, and more — no Python packages or external dependencies needed.
- markitdown — Universal document-to-markdown converter (docx, xlsx, pptx, pdf, html, ipynb)
- pdf-tool — Extract text, merge, split, and inspect PDF files
- xlsx-tool — Read, write, and export Excel spreadsheets
- docx-tool — Create and edit Word documents
- pptx-tool — Inspect and work with PowerPoint files
- img-tool — Resize, convert, and extract metadata from images
- doc-diff — Compare two documents side by side
- ical-tool — Read and parse calendar files
Why it's cool
No more "install pandas" or "pip install python-docx" — document manipulation just works. Ask the agent to "merge these 5 PDFs into one", "convert this Word doc to markdown", "resize all images in this folder to 800px wide", or "compare these two contracts and highlight differences." Combine with sources for workflows like "download the latest report from Google Drive, extract the data table, and create a Linear issue for each action item."
Branching & Multi-Panel Layout
The workspace now supports branching conversations and viewing multiple panels simultaneously.
- Session branching — Fork a conversation at any point to explore different approaches without losing your original thread
- Multi-panel view — Open browser, documents, and conversations side by side
- New session list design — Cleaner, more organized session management
Why it's cool
Explore two different implementation strategies in parallel, keep a browser panel open while coding, or compare outputs side by side. Branching means you never have to worry about "going down the wrong path" — just branch and try both.
Model Updates
- Sonnet 4.6 — Migrated all Sonnet 4.5 connections to Sonnet 4.6 automatically
- Copilot model matching — Fixed model ID resolution for GitHub Copilot connections
- Connection compatibility — Fixed
anthropic_compatconnections rejecting bare model names
Improvements
- Explore mode safety baseline — Default allowlist now includes common typecheck commands (
bun run typecheck,tsc --noEmit, etc.) alongside existing read-only search commands - Session tool reliability — Aligned tool listing and execution across all backends (Claude, Pi, session-mcp-server) to prevent drift
- Permission mode handling — Improved session-state tracking for permission mode transitions
- Workspace scoping — Source and skill broadcasts are now scoped to the active workspace
- Automations discoverability — Added "Learn More" and "Add Automation" to empty automations state, plus Help menu link
- Feature flag robustness — Runtime-safe evaluation prevents import-order issues in debug sessions
- Typecheck coverage —
typecheck:allnow includessession-tools-core
Bug Fixes
- Oversized images — Prevented oversized images from permanently breaking sessions
- Image resize — Hardened image resize with better error scoping and metadata stripping
- Tool input history — Stripped
_displayName/_intentmetadata from session history display - Window drag region — Fixed TopBar drag area
- Noisy logging — Cleaned up debug logging; downgraded
readFileENOENT to debug level - Duplicate models — Fixed duplicate model entries during connection migration
- Git Bash (Windows) — Fixed onboarding check that was skipped since v0.5.0