Lots of new features this release:
JudgementalGPT
now allows for different languages - useful for our APAC and European friendsRAGAS
metrics now supports all OpenAI models - useful for those running into context length issuesLLMEvalMetric
now returns a reasoning for its scoredeepeval test run
now has hooks that call on test run completionevaluate
now displaysretrieval_context
for RAG evaluationRAGAS
metric now displays metric breakdown for all its distinct metrics