What's Changed
- no screenshot in the beginning blank page by @mertunsall in #2154
- Fix dollar symbol escaping in model recommendations for Mintlify compatibility by @Spinny03 in #2153
- Add support for more models by @gregpr07 in #2143
- Structured output optimizations by @gregpr07 in #2157
- Implement structured output for judge system by @MagMueller in #2160
- Fix schema definition by @MagMueller in #2161
- Fix structured output schema conversion and improve error handling by @MagMueller in #2163
- eval-judge-improvements by @MagMueller in #2168
- judge-new-output-format by @MagMueller in #2169
- fix-judge-format by @MagMueller in #2170
- Make login tasks in eval be evaluated by presence of login cookie instead of webjudge by @Alezander9 in #2012
- Fix judge system for structured output compatibility by @MagMueller in #2172
- Nicer structured output by @gregpr07 in #2173
- fix-screenshot-changes-screen-size by @MagMueller in #2174
- added usage data to agent history and evals by @gregpr07 in #2171
- log-token-usage by @MagMueller in #2176
- log-token-usage-specific by @MagMueller in #2177
- feature/thinking-parameter by @MagMueller in #2178
- fix-usage-count-to-json by @MagMueller in #2183
- Fix eval by @mertunsall in #2184
- Add upload file action by @mertunsall in #2185
- Remove save_pdf action by @mertunsall in #2186
- Clean Controller by @mertunsall in #2187
- eval-runners-use-blacksmith by @MagMueller in #2190
- eval-remove-env-from-runner by @MagMueller in #2191
- eval-runners-logging by @MagMueller in #2192
- eval-runners-disable-cache by @MagMueller in #2193
- eval-runners-cache-enable by @MagMueller in #2194
New Contributors
Full Changelog: 0.4.1...0.4.2