ragway v0.1.0 is live on PyPI → pip install ragway
docsEvaluation

Evaluation

ragway includes built-in evaluation modules to benchmark pipeline quality.

Metrics

  • Faithfulness
  • Answer accuracy
  • Context recall
  • Context precision
  • Hallucination score
  • Latency

CLI evaluation

rag evaluate --dataset eval.json --config rag.yaml

Programmatic evaluation

from ragway.evaluation.faithfulness import FaithfulnessEval
 
# Evaluate generated answer and retrieved context against a question.

Use the same dataset across pipeline variants to compare quality and cost trade-offs.