1.15.3 (2026-02-25)
Bugs Fixed
- Fixed Foundry red team double-evaluation that caused ~3x scan latency by removing redundant evaluation_processor.evaluate() call.
- Fixed lost red team attack details where baseline overwrites wiped evaluation results.
- Fixed incorrect per_testing_criteria that showed attack strategies alongside risk categories.
- Fixed metadata leak of internal scorer fields (attack_success, attack_strategy, score) into results.json.
- Improved error logging with run_id, display_name, and full stack traces for red team scan failures.