Context-savings reporting + FinOps accuracy
This release aligns context-savings reporting between the local ctx stats view and the Insight platform, and corrects per-turn cost attribution.
Improvements
- The local "Where you are now" savings bar now reflects the whole live conversation window — the main session plus the sub-agent and sandbox sessions it spawns — rather than a single session id.
- The kept-out ratio is reported to one decimal, so a genuine 99.9% window is no longer rounded up to a misleading 100%.
Fixes
- Retrieval access cost (
bytes_retrieved) forctx_search/ctx_fetch_and_indexis now bridged from the MCP server to the forward pipe, so the platform can derive the with/without ratio. Previously the retrieval signal never reached the dashboard. - "With context-mode" counts retrieval-tool returns only, not sandbox compute output, so the ratio is honest.
- A Task sub-agent's cumulative usage is no longer priced as a single turn, preventing inflated per-event cost from poisoning FinOps aggregates.
All changes are additive and backward compatible.