1.13.1 (2025-11-05)
Features Added
- Improved RedTeam coverage across risk sub-categories to ensure comprehensive security testing
- Made RedTeam's
AttackStrategy.Tenseseed prompts dynamic to allow use of this strategy with additional risk categories - Refactors error handling and result semantics in the RedTeam evaluation system to improve clarity and align with Attack Success Rate (ASR) conventions (passed=False means attack success)
Bugs Fixed
- Fixed RedTeam evaluation error related to context handling for context-dependent risk categories
- Fixed RedTeam prompt application for model targets during Indirect Jailbreak XPIA (Cross-Platform Indirect Attack)