12.31.0 (2026-01-21)
Features
- add cursor rule for creating new built-in metrics (llm classification evaluators) (#10987) (d329bea)
- add FaithfulnessEvaluator and deprecate HallucinationEvaluator (#10962) (fc8b1b5)
- add span_id_key to link dataset examples to traces (#10942) (01eb1fb)
- dataset and experiment cli commands (#10997) (32343ff)
- phoenix cli (#10944) (d69eb80)
Bug Fixes
- cost: update built-in model token prices (#11001) (f6ec754)
- normalize tool return content before rendering (#10941) (3ce4ca8)