Phase 4 — Run your first suite locally
Execute the suite containing your first goal; confirm verdict output (passed / failed / indeterminate), reason codes, and where artefacts land on disk.
restormel-testing run --suite <your-suite-id> --config restormel-testing.yaml # optional: --goal id1,id2 --json --artifact-dir ./out/run-1 restormel-testing report .restormel-testing/runs/run-…
Default artefact root: .restormel-testing/runs/run-<timestamp>/ with report.json, summary.md, traces, and JUnit. A failed goal includes a reproduction command in report.json includes factual judgeInvocationCount when rubrics ran; costEstimate is a
non-billing heuristic only.
Agent prompts
Optional: use these with a coding agent to implement this phase in your repo in a safe, gated sequence. Expand only when you need them.
- Run the suite locally; paste the summary block into the PR or team notes.
- Force a failure (e.g. break success criteria); confirm artefact paths and reproduction command.
- Compare wall-clock time and flake rate over three consecutive runs; adjust retries only if justified.