This release improves evaluation output with structured JSON results and fixes several Windows compatibility issues.
What's New
- Adds URL click detection for terminals with mouse tracking support
- Includes structured results, run configuration, and summary in evaluation JSON output
- Includes judge reasons for passed relevance criteria in evaluation results
Bug Fixes
- Fixes Windows OS detection typo in session environment (corrects "window" to "windows")
- Replaces removed claude-3-7-sonnet-latest alias with explicit model ID in examples
- Uses platform-aware shell detection for Windows compatibility in skill expansion, script_shell, post-edit hooks, and bang commands
Technical Changes
- Pre-populates criterion names in CheckRelevance results
- Fixes lint issues including gci formatting and testifylint float comparisons
What's Changed
- docs: update CHANGELOG.md for v1.41.0 by @docker-read-write[bot] in #2307
- eval: include structured results, run config, and summary in JSON output by @hamza-jeddad in #2309
- fix: correct Windows OS detection typo in session environment by @trungutt in #2312
- fix: replace removed claude-3-7-sonnet-latest alias in examples by @trungutt in #2313
- fix: use platform-aware shell for skill expansion, script_shell, post-edit hooks, and bang command by @trungutt in #2314
- tui/messages: Add URL click detection for terminals with mouse tracking by @vvoland in #2308
Full Changelog: v1.41.0...v1.42.0