Overview
A script-based dataset is a collection of test cases written as a conversation script. This gives you full control over the conversation flow and user messages. Each script test case primarily consists of multiple turns of conversation. At every turn, you can specify the exact user message to be used in that turn and optionally the expected agent behavior. Quraite provides three ways to define the expected agent response:Exact Match
The agent’s response must match the expected response exactly.
Regex Match
The agent’s response must match the expected response pattern using regex.
Semantic Match
The agent’s response must be judged by an LLM to match the expected response.
In Order
Tool calls match the specified sequence.
Any Order
Tool calls match regardless of sequence.
How It Works
Quraite invokes the agent using the exact user messages specified at each turn. If expected behavior is defined, Quraite evaluates the agent’s response against it.When to Use Script-Based Datasets
Script-based datasets work best when:- Replaying production traces. Test real conversations exactly as they occurred.
- Debugging issues. Reproduce specific user problems with precise message sequences.
- Running validation tests. Fast, predictable checks with deterministic inputs.
Create and Run Script-Based Test Cases
This guide uses the Retail Agent in the Default Project. Quraite creates this project automatically at signup.
Create a new test case
This test case tests retrieving order status.Click + Test CaseEnter Turn 1 detailsUser message:Expected agent behavior:Enter Turn 2 detailsUser message:Expected agent behavior:
- Select Evaluation Approach as LLM.
- Enter Expected Content:
- Select Evaluation Approach as Regex.
- Enter Expected Content:
- Click + Tool Call.
- Select Tool Call Evaluation Type as In Order.
- Enter tool call name:
- Enter tool call arguments:
- Click + Tool Call.
- Enter tool call name:
- Enter tool call arguments:
Code-based script test case definitions are coming soon.
Next Steps
- Run the remaining sample test cases in the Script-based Dataset.
- Create a new project and run test cases against your own agent.

