Step 1: Add Test Questions
Enter questions to test your chatbot
Step 2: Select Evaluation Rubrics
Choose how to evaluate responses
π Core Quality
π― Quality & Style
π Safety & Compliance
π¬ Conversation Flow
Above rubrics not sufficient? Define your own criteria below.
π‘ Tip: You can combine predefined rubrics with custom
ones
No custom rubrics yet.
Step 3: Review & Customize Rubrics
Fine-tune your evaluation criteria and select grading strictness
Step 4: Chatbot URL
Enter your chatbot's URL
Step 5: Configure Tools & Settings
Select enabled tools and preferences
Step 6: Review & Run Evaluation
Review your configuration before running
Questions
Rubrics
Bot URL
Tools
π Preview Evaluation Prompt
See exactly what criteria and examples will be sent to the AI judge
π No Results Yet
Run an evaluation to see results here
π¬ Hallucination Inspector
Evaluate your chatbot for hallucinations by comparing answers against expected truth
Balanced: Minor = -10pts, Major = -25pts, Critical = -50pts
Lenient: Minor = -5pts, Major = -15pts, Critical = -30pts
| Question | Expected Answer | |
|---|---|---|
| No Q/A pairs yet. Upload CSV or add manually. | ||
β‘ GEPA Optimizer
Optimize prompts for smaller models using GEPA
β‘ Transcript Inspector
Compare your chatbot responses against transcript answers
π No Results Yet
Run an evaluation to see results here