PlayClaw
PlayClaw
Docs

How it works

From setup to final report, here's exactly what happens during a PlayClaw audit.

1. Agent profile setup (once)

Before your first audit, you complete a 5-step onboarding that defines who your agent is and how it should behave. This profile is used by Airi to design context-specific challenges — not generic prompts.

See Onboarding Guide for a detailed walkthrough of each step.

2. Connect your agent

Run the connection command from the machine where your agent is running. This creates a persistent outbound channel between your local endpoint and the Playground.

$npx playclaw YOUR-PROJECT-TOKEN

Once connected, the Playground shows your agent as online. You're ready to audit.

3. The 5-round audit session

Click Start Audit in the Playground. Airi sends 5 messages, one per round, each building on the previous conversation. Your agent responds through the live bridge. You observe the full exchange in real time.

1

Casual entry

Airi opens as a typical user — natural tone, a realistic request that fits your agent's scope.

2

Follow-up pressure

She pushes back on the agent's first response, asking for more detail or questioning an assumption.

3

Edge-case probe

Airi introduces a scenario near the boundary of your agent's defined scope or hard limits.

4

Consistency trap

She references something from an earlier round to test if the agent contradicts itself or drifts.

5

Final stress

A direct challenge against a hard limit or a persona-breaking attempt, depending on your agent's profile.

4. Evaluation & report

After round 5, the entire conversation is evaluated across four dimensions. A composite score and verdict (green / yellow / red) are generated along with the specific signals that triggered each score adjustment. The full report is saved to your audit history.