際際滷

際際滷Share a Scribd company logo
LLM based Evaluator for RAG
Mayank Bhaskar
Dave Campbell
Sujit Pal
question context
answer
ground truth
Retriever
LLM
question context
answer
ground truth
Retriever
LLM
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
JSON
Snapshot
LLM
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
LCEL
JSON
Snapshot
LLM
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
LCEL
JSON
Snapshot
LLM
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
DSPy
predicted scores
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
LCEL
JSON
Snapshot
LLM
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
DSPy
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
LCEL
JSON
Snapshot
LLM
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
DSPy
Manual
Evaluation Tool
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
LCEL
JSON
Snapshot
LLM
Faithfulness
Answer
Relevance
Context
Precision
Context
Utilization
Context
Relevance
Answer
Correctness
Answer
Similarity
Context
Recall
DSPy
Manual
Evaluation Tool
Synthetic
Data
Predictive
Models
https://github.com/sujitpal/llm-rag-eval

More Related Content

Google AI Hackathon: LLM based Evaluator for RAG