What You’ll Build
A complete evaluation loop for a customer support bot, from creating scenarios to extracting actionable insights.Prerequisites
Sign up for Sageloop
Create your free account to get started
Step 1: Create a New Project
- Click “New Project” on your dashboard
- Enter name: “Support Bot - First Try”
- Paste system prompt:
- Click “Create Project”
Step 2: Add 15 Test Scenarios
Click “Bulk Add Scenarios” and paste these:Step 3: Generate Outputs
- Click “Generate Outputs”
- Wait 30-60 seconds
- Review the table of responses
Step 4: Rate the Outputs
Using the 5-star scale, rate each output:- 5 stars: Perfect, ready for production
- 4 stars: Good, minor issues
- 3 stars: Okay, needs improvement
- 2 stars: Problem, major issues
- 1 star: Unacceptable
- “Too vague”
- “Missing key information”
- “Wrong tone”
- “Doesn’t match policy”
Typical pattern you’ll see:
- Some outputs say “soon” for refund timeline
- Some don’t apologize
- Some are too formal
- Some are perfect
Step 5: Extract Patterns
- Go to “Insights” tab
- Click “Run Pattern Extraction”
- Wait 5-10 seconds for results
Failure Analysis
Groups of low-rated outputs with their root causes:Cluster 1: Vague Timelines (3 outputs)Issues: Says “soon” instead of specific timeframeFix: Add “specific refund timeline (5-7 days)” to prompt
Cluster 2: Missing Apology (2 outputs)Issues: Doesn’t acknowledge customer concernFix: “Always start by apologizing”
Quality Patterns
What 5-star outputs have in common:5-Star Pattern:
- Starts with apology
- Specific information (not vague)
- Clear next steps
- Professional but warm tone
Step 6: Apply a Fix & Retest
- Click “Apply Fix & Retest” on Cluster 1 (Vague Timelines)
- Review the suggested prompt update
- Click “Update & Retest”
- Only the 3 failed scenarios regenerate
- You get new outputs to rate
- Check if they’re better
Step 7: Check Your Progress
Review your Success Rate:- Started: ~65% (9/15 passing)
- After first fix: ~85% (13/15 passing)
What You’ve Learned
1
Created a project
2
Added realistic scenarios
3
Generated AI outputs
4
Rated based on intuition
5
Discovered patterns from your ratings
6
Applied concrete improvements
7
Validated improvements
Next Steps
Now that you understand the workflow:Rating Guide
Master rating technique
Pattern Extraction
Advanced insights
Use Cases
See other examples
Creating Scenarios
Build better test suites
Tips for Success
Start simple
Start simple
Don’t aim for perfection in first iteration
Use real data
Use real data
Real scenarios > made-up ones
Iterate
Iterate
2-4 iterations is normal
Share with team
Share with team
Export golden examples
Export golden examples
Use for CI/CD
Congratulations! You’ve experienced the core Sageloop workflow. Now explore the guides for deeper learning.