github lukilabs/craft-agents-oss v0.6.0

7 hours ago

v0.6.0 — Built-in Browser, Document Tools & Multi-Panel


Integrated Browser

Craft Agent now has a full built-in browser — no extensions, no external tools. The agent can browse the web, fill forms, extract data, and take screenshots, all from within the app.

  • In-app browser panes — Browser windows open as panels alongside your conversation, so you can watch the agent navigate in real time
  • Privileged execution approval — Sensitive browser actions require explicit approval, keeping you in control
  • Smart screenshots — JPEG by default for faster, smaller captures with proper DPI scaling
  • Full CLI command set — Navigate, click, fill, scroll, screenshot, evaluate JS, manage downloads, and more

Why it's cool

This unlocks true end-to-end automation. Connect any data source and let the agent act on it — fill out forms, submit reports, navigate dashboards, and complete multi-step workflows entirely hands-free. Combine browser actions with your connected sources: "pull this week's metrics from Amplitude, fill out the weekly report form in our internal tool, and post the summary to Slack." The browser closes the gap between data and action.

Document Tools

Built-in tools for working with PDF, Word, Excel, PowerPoint, images, and more — no Python packages or external dependencies needed.

  • markitdown — Universal document-to-markdown converter (docx, xlsx, pptx, pdf, html, ipynb)
  • pdf-tool — Extract text, merge, split, and inspect PDF files
  • xlsx-tool — Read, write, and export Excel spreadsheets
  • docx-tool — Create and edit Word documents
  • pptx-tool — Inspect and work with PowerPoint files
  • img-tool — Resize, convert, and extract metadata from images
  • doc-diff — Compare two documents side by side
  • ical-tool — Read and parse calendar files

Why it's cool

No more "install pandas" or "pip install python-docx" — document manipulation just works. Ask the agent to "merge these 5 PDFs into one", "convert this Word doc to markdown", "resize all images in this folder to 800px wide", or "compare these two contracts and highlight differences." Combine with sources for workflows like "download the latest report from Google Drive, extract the data table, and create a Linear issue for each action item."

Branching & Multi-Panel Layout

The workspace now supports branching conversations and viewing multiple panels simultaneously.

  • Session branching — Fork a conversation at any point to explore different approaches without losing your original thread
  • Multi-panel view — Open browser, documents, and conversations side by side
  • New session list design — Cleaner, more organized session management

Why it's cool

Explore two different implementation strategies in parallel, keep a browser panel open while coding, or compare outputs side by side. Branching means you never have to worry about "going down the wrong path" — just branch and try both.

Model Updates

  • Sonnet 4.6 — Migrated all Sonnet 4.5 connections to Sonnet 4.6 automatically
  • Copilot model matching — Fixed model ID resolution for GitHub Copilot connections
  • Connection compatibility — Fixed anthropic_compat connections rejecting bare model names

Improvements

  • Explore mode safety baseline — Default allowlist now includes common typecheck commands (bun run typecheck, tsc --noEmit, etc.) alongside existing read-only search commands
  • Session tool reliability — Aligned tool listing and execution across all backends (Claude, Pi, session-mcp-server) to prevent drift
  • Permission mode handling — Improved session-state tracking for permission mode transitions
  • Workspace scoping — Source and skill broadcasts are now scoped to the active workspace
  • Automations discoverability — Added "Learn More" and "Add Automation" to empty automations state, plus Help menu link
  • Feature flag robustness — Runtime-safe evaluation prevents import-order issues in debug sessions
  • Typecheck coveragetypecheck:all now includes session-tools-core

Bug Fixes

  • Oversized images — Prevented oversized images from permanently breaking sessions
  • Image resize — Hardened image resize with better error scoping and metadata stripping
  • Tool input history — Stripped _displayName/_intent metadata from session history display
  • Window drag region — Fixed TopBar drag area
  • Noisy logging — Cleaned up debug logging; downgraded readFile ENOENT to debug level
  • Duplicate models — Fixed duplicate model entries during connection migration
  • Git Bash (Windows) — Fixed onboarding check that was skipped since v0.5.0

Don't miss a new craft-agents-oss release

NewReleases is sending notifications on new releases.