What's Changed
- Fix small typos by @anakin87 in #356
- remove constraint on python version by @samsja in #368
- Fix typo in README.md: 'with along' → 'along with' by @CodeSinghh in #378
- Fix: reasoning-gym : match load_environment args to init() args by @code-juicer in #377
- fix/update links by @anakin87 in #372
- Fix **kwargs in
load_environmentbreaking by @mikasenghaas in #385 - Add average_reward column to make_dataset by @faresobeid in #365
- Add repeatable --header support to vf-eval for sending additional headers to OpenAI client by @AmeenP in #386
- Updates for ToolEnv + StatefulToolEnv for sandboxes by @willccbb in #384
- Truncate prompt mask of overly long prompts + completions by @nreHieW in #382
- Deserialize function tool call argument before applying chat template by @mikasenghaas in #376
- finish_reason=length if env caused truncation by @cat-state in #360
- quick fix by @willccbb in #390
- Fix string match condition by @mikasenghaas in #395
- Make
max_turnsan env arg inmath-pythonby @mikasenghaas in #394 - Set log level package-wide in eval script by @mikasenghaas in #403
- update toolenv to support openai param type by @code-juicer in #407
- Add install script by @mikasenghaas in #412
- Do not log error on overlong multi-turn prompt by @mikasenghaas in #402
- fix typos and mentions to
devgroup by @anakin87 in #396 - fix reasoning gym enviroment by @anakin87 in #401
- QoL improvements for
SandboxEnvby @mikasenghaas in #405 - Fix comment with wrong (old) function name by @AndrewCEmil in #429
- fix(tui): use sampling_args by @ob1-s in #415
New Contributors
- @anakin87 made their first contribution in #356
- @CodeSinghh made their first contribution in #378
- @code-juicer made their first contribution in #377
- @faresobeid made their first contribution in #365
- @AmeenP made their first contribution in #386
- @nreHieW made their first contribution in #382
- @AndrewCEmil made their first contribution in #429
- @ob1-s made their first contribution in #415
Full Changelog: v0.1.4...v0.1.5