Changelog
The overview page has been changed. It surfaces prompt caching stats and live preview:
Also, load balancing has been added to virtual models feature.
More details:
Features
- f291083 feat(dashboard): add token cache meter and overview refinements (#428)
- 34f13d6 feat(dashboard): live token throughput chart, prompt cache gauge, and overview refinements (#434)
- 9502487 feat(virtualmodels): load-balanced virtual models (round-robin + cost) with IaC config (#433)