
DeepEval
OperationsIntegration
Overview
DeepEval is an open-source LLM evaluation framework, built for engineers to unit-test LLM applications and AI Agents. It provides out-of-the-box LLM-powered metrics, including RAG, conversational, red-teaming, agentic, multimodal, and custom metrics.
DeepEval and Weaviate
You can use DeepEval to optimize search, retrieval, and RAG with Weaviate by leveraging DeepEval's custom and RAG metrics to select the best hyperparameters like embedding model and top-K for your Weaviate collection.
Resources
The resources are broken into categories:
- Hands on Learning: Build your technical understanding with end-to-end tutorials.
- Read and Listen: Develop your conceptual understanding of these technologies.