v0.2.7
boost- support per-request parameterization- small improvements to
llmandchatAPIs - a series of experimental custom modules
- small improvements to
benchshortjudge prompt type- fix tasks report generation to correctly display used prompt
- allow specifying judge
max_tokensvia config and CLI
Full Changelog: v0.2.6...v0.2.7