github confident-ai/deepeval v3.2.6
๐ŸŽ‰ Renewed datasets, single vs multi-turn

latest release: v3.3.5
one month ago

โš™๏ธ New Features

DeepEval's 3.2.6 release focuses on single-vs multi-turn use cases in datasets!

๐Ÿงฉ Support for Single-Turn and Multi-Turn Datasets

  • Single-turn datasets: Simple input โ†’ output pairs for one-off prompt testing.
  • Multi-turn datasets: Full conversation flows with alternating user/assistant turns. Perfect for simulating real chat interactions.

DeepEval now automatically detects whether a dataset is single-turn or multi-turn based on structure and routes to the appropriate evaluation logic.

๐Ÿงช Conversational Goldens

Introduced a new concept: conversational goldens, which contains scenario, (and optionally expected_outcome) but not things like input and expected output as with single-turn use cases..

โœ… Improvements

  • Smarter dataset evaluation routing: Whether single-turn or multi-turn, DeepEval figures it out and builds test cases accordingly.
  • Improved multi-turn context preservation: Each conversational turn is maintained during evaluation, giving more accurate multi-turn metrics.

This release is setting the stage for future multi-turn use cases.

Docs here: https://deepeval.com/docs/evaluation-datasets

Don't miss a new deepeval release

NewReleases is sending notifications on new releases.