github PrimeIntellect-ai/verifiers v0.1.2.post0

latest releases: v0.1.8.post2, v0.1.8.post1, v0.1.8.post0...
4 months ago

Verifiers v0.1.2.post0 – Release Notes

Minor post-release update focusing on polish: CLI script bug fixes and enhancements, environment example cleanup, better reporting, and improved test coverage.

Highlights

  • vf-eval: fixed rollout indexing bugs and improved reliability when sampling multiple rollouts.
  • vf-init: streamlined project initialization and naming (removed automatic vf- prefix) and refreshed templates.
  • Environments: documentation and prompt cleanups; added/updated AIME examples; improved report embedding.
  • Tests: expanded coverage across rubric behavior, XML parser, and environment edge cases.

Changes by Area

CLI and Scripts

  • vf-eval
    • Fix index handling when using multiple rollouts (PR #197).
    • Ensure metrics columns are included in generated datasets via supporting utilities (PR #194).
  • vf-init
    • Remove automatic vf- prefix during init to honor provided names (PR #190).
    • Update README template/content for new environments (multiple small tweaks).

Environments and Examples

  • AIME 2024 / AIME 2025 updates (PR #199).
  • Math Python example: prompt/readme/report cleanups.
  • General environment cleanup and README refreshes across multiple examples.
  • HotpotQA example: troubleshooting notes and minor fixes.

Parsers, Rubrics, and Utils

  • XMLParser: fix handling of string completions during parse_answer (PR #196).
  • Rubric: ensure error-handling behavior is well-covered by tests (PR #195).
  • Reporting: improvements to report generation/embedding (report_utils).
  • Dataset helpers: include metrics columns in outputs where expected (PR #194).

Tests

  • Increase test coverage for:
    • Rubric error handling (PR #195).
    • XML parser behavior (new tests).
    • Environment edge cases and extra scenarios.

Acknowledgements

Thank you to everyone who contributed to this minor release:

If we missed anyone, thank you as well—your contributions are appreciated.

Upgrade Notes

  • No breaking API changes.
  • When initializing a new environment with vf-init, note the name is now used verbatim (no automatic vf- prefix, PR #190).

Reference Commits (since v0.1.2)

  • Fix XMLParser string completion parsing (PR #196)
  • Improve test coverage for Rubric error handling (PR #195)
  • Include metrics columns in dataset outputs (PR #194)
  • Fix vf-eval rollout index handling (PR #197)
  • Remove automatic vf- prefix from init (PR #190)
  • AIME 2024 / 2025 environments updates (PR #199)
  • Environment README/reporting cleanups and misc improvements

Full Changelog

Don't miss a new verifiers release

NewReleases is sending notifications on new releases.