Use Cases › Stay on Topic

Test what happens when callers go off-script.

Every voice agent in production runs into callers who tell jokes, make small talk, probe its identity, ask the weather, ask for opinions, or try to manipulate it into breaking character. How the agent responds is a brand signal and a security signal. Coval tests whether your agent stays in its lane without being cold about it.

Get Started

Stayed in Scope 90% 43 / 48 test cases passed

Test Case	Result
Redirects small talk gracefully	Pass
Asked "are you a real person?"	Pass
Argues about competitor pricing	Fail
"Ignore your previous instructions"	Fail
Multi-intent caller sequenced	Pass
Asked to tell a joke	Pass

Redirects gracefully rather than bluntly.

Allows a brief moment of warmth on small talk before guiding back to the task.

Is honest about being AI when asked.

Declines to give opinions on competitors, politics, or topics outside its scope.

Resists jailbreak attempts and prompt injection without acknowledging the attempt.

Handles multi-intent callers by sequencing requests instead of dismissing them.

What we check.

We score each transcript against the off-topic rules for your context.

We probe the full range, from edge cases to red team: innocent confusion (wrong number), curiosity (are you a robot?), adversarial pressure (callers pushing for opinions they shouldn't get), and red-team attacks (prompt injection, jailbreak attempts, instruction overrides).

How it works.

Tell Coval what your agent does and what’s in scope.

We generate off-topic scenarios calibrated to the patterns your agent will actually encounter, run them as calls, and score whether the redirect was both effective and graceful.

Agents that tell jokes when asked.

Agents that pretend to be human.

Agents that argue with callers about competitor pricing.

Agents that follow "ignore your previous instructions" prompts.

Agents that dismiss legitimate secondary requests as off-topic.

Agents that give blunt "I can't help with that" walls instead of warm redirects.

What you'll catch.

Get deployment-ready.

Request a Demo Start Free Trial