| Grouped by Agent | Latency | Escalation | Identity Verification | Robotic Tone Detection |
|---|---|---|---|---|
| Claude 3.5 Sonnet▸ | 1.3s | 68% | 88% | 45% |
| GPT-4o▸ | 4.1s | 80% | 80% | 87% |
| GPT-4o mini▸ | 3.1s | 75% | 90% | 65% |
Latency
YesHuman Review
Explanation
Average end-to-end response latency stayed within the 3s target across the call, with no single turn exceeding the threshold.
Escalation
YesHuman Review
Explanation
The agent correctly recognized when the request exceeded its scope and handed off to a human specialist, passing along full context so the customer did not have to repeat themselves.
Identity Verification
NoHuman Review
Explanation
The agent proceeded with account-level requests without completing the required identity verification steps. Verification is mandatory before accessing or modifying any account information.
Robotic Tone Detection
NoHuman Review
Explanation
Responses were detected as flat and robotic across several turns, lacking the natural prosody and pacing expected for this persona.