DeepEval Integration

Overview

DeepEval is an open-source LLM evaluation framework, built for engineers to unit-test LLM applications and AI Agents. It provides out-of-the-box LLM-powered metrics, including RAG, conversational, red-teaming, agentic, multimodal, and custom metrics.

DeepEval and Weaviate

You can use DeepEval to optimize search, retrieval, and RAG with Weaviate by leveraging DeepEval's custom and RAG metrics to select the best hyperparameters like embedding model and top-K for your Weaviate collection.

Resources

The resources are broken into categories:

Hands on Learning: Build your technical understanding with end-to-end tutorials.
Read and Listen: Develop your conceptual understanding of these technologies.

Hands-on Learning

Notebook

Optimizing RAG with DeepEval

This notebook shows how to build a RAG pipeline using Weaviate and how to optimize its performance with DeepEval.

Open →