Reliability Summary First
Start atDashboard -> First-Run Reliability Summary (/dashboard/first-run).
This view brings reliability outcomes to the top:
- Deadlock/loop signals
- Context-loss signals
- Tool-call failure signals
- Unnecessary tool-call feedback patterns
Tasks
Use task pages to inspect full execution cycles in one place. Key views:- Timeline: ordered
trace,tool_call, andloopevents - Tool Calls: status, errors, latency, arguments/results
- Loops: repeated/circular/stuck patterns with severity
- Detection summary: tool matching, failures, context
Conversations
Conversation views aggregate reliability across multi-turn user sessions. Key views:/context-failures: typed context regressions/context-health: overall score + timeline + severity distribution/trace-tree: hierarchical trace/tool graph for root-cause analysis
Feedback and Unnecessary Tool Patterns
From task detail pages, submit feedback on tool calls as:necessaryunnecessaryunclear
/api/v1/feedback/patterns/api/v1/feedback/stats
Alerts
Default alerting is reliability-first and centers on:execution_loop_detectedtool_call_failurescontext_loss_detectedunnecessary_tool_pattern

