confident-ai/deepeval v3.2.6 on GitHub

⚙️ New Features

DeepEval's 3.2.6 release focuses on single-vs multi-turn use cases in datasets!

🧩 Support for Single-Turn and Multi-Turn Datasets

Single-turn datasets: Simple input → output pairs for one-off prompt testing.
Multi-turn datasets: Full conversation flows with alternating user/assistant turns. Perfect for simulating real chat interactions.

DeepEval now automatically detects whether a dataset is single-turn or multi-turn based on structure and routes to the appropriate evaluation logic.

🧪 Conversational Goldens

Introduced a new concept: conversational goldens, which contains scenario, (and optionally expected_outcome) but not things like input and expected output as with single-turn use cases..

✅ Improvements

Smarter dataset evaluation routing: Whether single-turn or multi-turn, DeepEval figures it out and builds test cases accordingly.
Improved multi-turn context preservation: Each conversational turn is maintained during evaluation, giving more accurate multi-turn metrics.

This release is setting the stage for future multi-turn use cases.

Docs here: https://deepeval.com/docs/evaluation-datasets

confident-ai/deepeval v3.2.6 🎉 Renewed datasets, single vs multi-turn on GitHub

⚙️ New Features

🧩 Support for Single-Turn and Multi-Turn Datasets

🧪 Conversational Goldens

✅ Improvements

confident-ai/deepeval v3.2.6
🎉 Renewed datasets, single vs multi-turn

on GitHub