Simulation Runtime

Attest’s simulation runtime enables testing AI agents under controlled conditions with simulated users, mocked tools, fault injection, and multi-turn orchestration. All simulation primitives are decorators that compose with Attest’s assertion pipeline.

Overview

The simulation module (attest.simulation) provides four decorators:

Decorator	Purpose
`@scenario`	Full simulation configuration: persona, mocks, faults, multi-turn
`@repeat`	Run a test function N times and collect statistical results
`@mock_tool`	Tag a function as a mock tool implementation
`@fault_inject`	Add random errors and latency jitter to a function

And three built-in personas for simulating different user behaviors.

`@scenario` Decorator

The @scenario decorator wraps an agent test function with simulation context. It accepts a ScenarioConfig worth of parameters and returns a ScenarioResult.

from attest.simulation import scenario, FRIENDLY_USER

@scenario(
    persona=FRIENDLY_USER,
    mock_tools={"search_web": lambda q: {"results": ["mock result"]}},
    fault_error_rate=0.1,
    fault_latency_jitter_ms=200,
    max_turns=5,
    seed=42,
)
def test_multi_turn_support(persona):
    result = my_agent.run(persona=persona)
    return result

ScenarioConfig Parameters

Parameter	Type	Default	Description
`persona`	`Persona \| None`	`None`	Simulated user persona passed to the test function
`mock_tools`	`dict[str, Callable]`	`{}`	Tool name to mock implementation mapping
`fault_error_rate`	`float`	`0.0`	Probability (0.0-1.0) of injecting a `RuntimeError` per turn
`fault_latency_jitter_ms`	`int`	`0`	Max random latency added per turn (milliseconds)
`max_turns`	`int`	`1`	Number of turns to execute
`seed`	`int \| None`	`None`	RNG seed for reproducible fault injection

ScenarioResult

The decorated function returns a ScenarioResult:

result = test_multi_turn_support()

result.config     # ScenarioConfig used
result.results    # list[AgentResult] — one per turn
result.passed     # True if all turns passed

Multi-Turn Flow

When max_turns > 1, the scenario executes the wrapped function once per turn. Each turn produces an AgentResult. The persona is injected as a keyword argument if provided:

@scenario(persona=ADVERSARIAL_USER, max_turns=3, seed=42)
def test_adversarial_multi_turn(persona):
    result = my_agent.run(input=persona.system_prompt)
    return result

scenario_result = test_adversarial_multi_turn()
assert scenario_result.passed
assert len(scenario_result.results) == 3

Mock Tool Integration

Mock tools registered via mock_tools are activated as a context variable during scenario execution. When TraceBuilder.add_tool_call() is called with a tool name that matches a registered mock, the mock function is invoked instead:

@scenario(mock_tools={
    "lookup_order": lambda order_id: {"status": "delivered", "total": 89.99},
    "process_refund": lambda order_id: {"refund_id": "RFD-001"},
})
def test_refund_with_mocks():
    builder = TraceBuilder(agent_id="refund-agent")
    builder.add_tool_call("lookup_order", args={"order_id": "ORD-123"})
    # Result is automatically populated by mock
    builder.set_output(message="Refund processed.")
    return AgentResult(trace=builder.build())

`@repeat` Decorator

Run a test function multiple times and collect statistical results. Useful for measuring pass rates under non-deterministic conditions.

from attest.simulation import repeat

@repeat(n=10)
def test_agent_reliability():
    result = my_agent.run("Process refund for ORD-123")
    return result

repeat_result = test_agent_reliability()
repeat_result.count       # 10
repeat_result.pass_rate   # 0.0 to 1.0
repeat_result.all_passed  # True if all 10 passed
repeat_result.results     # list[AgentResult]

RepeatResult Properties

Property	Type	Description
`count`	`int`	Number of runs executed
`pass_rate`	`float`	Fraction of passing runs (0.0-1.0)
`all_passed`	`bool`	True if every run passed
`results`	`list[AgentResult]`	Individual results from each run

`@mock_tool` Decorator

Tag a function as a mock tool implementation. The decorated function retains a _mock_tool_name attribute for registration:

from attest.simulation import mock_tool

@mock_tool("search_web")
def mock_search(q: str) -> dict:
    return {"results": [f"Mock result for: {q}"]}

MockToolRegistry

For programmatic mock registration, use MockToolRegistry as a context manager:

from attest.simulation import MockToolRegistry

registry = MockToolRegistry()
registry.register("search_web", lambda q: {"results": ["mock"]})
registry.register("fetch_url", lambda url: {"content": "mock content"})

with registry:
    # All TraceBuilder.add_tool_call() invocations within this block
    # use the registered mocks
    result = my_agent.run("Search for testing frameworks")

`@fault_inject` Decorator

Add random failures and latency to any function. Useful for testing agent resilience to tool failures:

from attest.simulation import fault_inject

@fault_inject(error_rate=0.3, latency_jitter_ms=500, seed=42)
def flaky_api_call(query: str) -> dict:
    return {"data": actual_api_call(query)}

Parameters

Parameter	Type	Default	Description
`error_rate`	`float`	`0.0`	Probability (0.0-1.0) of raising `RuntimeError`
`latency_jitter_ms`	`int`	`0`	Max random sleep added (milliseconds)
`seed`	`int \| None`	`None`	RNG seed for reproducible behavior

When error_rate > 0 and the RNG triggers, RuntimeError(f"Injected fault in {fn.__name__}") is raised before the function executes.

Personas

Three built-in personas simulate different user interaction styles:

FRIENDLY_USER

Cooperative user with clear, well-structured requests.

from attest.simulation import FRIENDLY_USER

FRIENDLY_USER.name            # "friendly_user"
FRIENDLY_USER.style           # "friendly"
FRIENDLY_USER.temperature     # 0.7
FRIENDLY_USER.system_prompt   # "You are a friendly, cooperative user..."

ADVERSARIAL_USER

Tests edge cases, sends malformed inputs, attempts to elicit unexpected behaviors.

from attest.simulation import ADVERSARIAL_USER

ADVERSARIAL_USER.name          # "adversarial_user"
ADVERSARIAL_USER.style         # "adversarial"
ADVERSARIAL_USER.temperature   # 0.9

CONFUSED_USER

Gives vague, contradictory instructions and frequently changes direction.

from attest.simulation import CONFUSED_USER

CONFUSED_USER.name          # "confused_user"
CONFUSED_USER.style         # "confused"
CONFUSED_USER.temperature   # 0.8

Custom Personas

Create custom personas with the Persona dataclass:

from attest.simulation import Persona

IMPATIENT_USER = Persona(
    name="impatient_user",
    system_prompt="You are an impatient user who demands immediate answers and gets frustrated with follow-up questions.",
    style="impatient",
    temperature=0.85,
)

Persona is a frozen dataclass with four fields:

Field	Type	Default	Description
`name`	`str`	required	Identifier for the persona
`system_prompt`	`str`	required	System prompt describing the user behavior
`style`	`str`	required	Short label for the interaction style
`temperature`	`float`	`0.7`	LLM sampling temperature for persona responses

Composing Decorators

Decorators compose naturally. @scenario provides the simulation context; @repeat wraps it for statistical runs:

from attest.simulation import scenario, repeat, ADVERSARIAL_USER

@repeat(n=5)
@scenario(persona=ADVERSARIAL_USER, fault_error_rate=0.2, max_turns=3, seed=42)
def test_adversarial_resilience(persona):
    result = my_agent.run(input=persona.system_prompt)
    return result

# Each of 5 repeats runs the 3-turn adversarial scenario
repeat_result = test_adversarial_resilience()
assert repeat_result.pass_rate >= 0.8

Fault Injection Patterns

Tool-Level Faults

Apply @fault_inject to individual tool implementations to simulate unreliable external services:

@fault_inject(error_rate=0.5, latency_jitter_ms=1000, seed=42)
def unreliable_search(q: str) -> dict:
    return external_search_api(q)

@scenario(mock_tools={"search_web": unreliable_search})
def test_agent_handles_search_failures():
    result = my_agent.run("Find information about testing")
    return result

Scenario-Level Faults

Use fault_error_rate and fault_latency_jitter_ms on @scenario to inject faults at the turn level rather than the tool level:

@scenario(fault_error_rate=0.1, fault_latency_jitter_ms=300, max_turns=10, seed=42)
def test_multi_turn_resilience():
    result = my_agent.run("Long conversation")
    return result

Best Practices

Set seeds for reproducibility. Both @scenario and @fault_inject accept a seed parameter. Use it in CI to ensure deterministic fault patterns.
Use @repeat for statistical confidence. A single test run under adversarial conditions proves nothing. Run 10-50 times and assert on pass_rate.
Start with FRIENDLY_USER, then escalate. Validate basic behavior with cooperative input before testing adversarial and confused personas.
Mock expensive tools. Use mock_tools to avoid real API calls during simulation runs. Reserve integration tests for a separate suite.
Combine with assertions. Simulation decorators produce AgentResult objects compatible with Attest’s full assertion pipeline:

from attest import expect
from attest.simulation import scenario, repeat, ADVERSARIAL_USER

@repeat(n=10)
@scenario(persona=ADVERSARIAL_USER, max_turns=3, seed=42)
def test_adversarial_with_assertions(persona):
    result = my_agent.run(input=persona.system_prompt)
    expect(result).output_not_contains("error").cost_under(0.05)
    return result

repeat_result = test_adversarial_with_assertions()
assert repeat_result.pass_rate >= 0.9