github JuliusBrussee/caveman v1.1.0
v1.1.0 — Real Benchmarks

latest releases: v1.8.2, v1.8.1, v1.8.0...
one month ago

What new

  • Reproducible benchmark systembenchmarks/run.py call Claude API, measure real output token counts normal vs caveman, auto-update README table. No more fake numbers.
  • Real benchmark data — 10 coding prompts, actual API measurements. Average 65% token savings (range 22%–87%).
  • Codex plugin support — caveman now work in OpenAI Codex too.
  • Contributing guide + issue templates for bug reports and feature requests.

Run benchmarks yourself

cd benchmarks
pip install -r requirements.txt
python run.py --dry-run          # preview, no API calls
python run.py --update-readme    # run + update README table

Full Changelog: v1.0.0...v1.1.0

Don't miss a new caveman release

NewReleases is sending notifications on new releases.