EP is an open specification, Python SDK, and pytest wrapper that provides a standardized way to write evaluations for large language model (LLM) applications. Start with simple single-turn evals for ...
Modular Design: Easily plug in your own retriever, generator, and LLM client functions. Core RAG Metrics: Calculates standard metrics out-of-the-box: Crucially, you ...
一部の結果でアクセス不可の可能性があるため、非表示になっています。
アクセス不可の結果を表示する