04Validation Playbooks

04 · VALIDATION PLAYBOOKS

Repeatable procedures, like a flight checklist

Each playbook is a numbered procedure with a clear trigger and a defined output. Run them the same way every time — that is what makes the result trustworthy.

PB-01Requirements

Executable Acceptance Spec

Frame → Constrain → Make testable → Sign off
TRIGGERBefore any generation.
PB-02Validation

Adversarial Eval Harness

Seed → Needle → Gate → Report
TRIGGEREvery model or prompt change.
PB-03Build

Tool Contract Pinning

Define → Pin → Test → Alert on drift
TRIGGEREvery external integration.
PB-04Release

Rollback Rehearsal

Trigger → Drill → Time → Document
TRIGGERBefore every release.
PB-05Validation

Source-Pinned Summarization

Retrieve → Cite span → Verify → Refuse-or-answer
TRIGGERAny factual synthesis task.
PB-06Build

Increment Bounding

Scope → Bound to one testable unit → Smoke → Merge
TRIGGEREvery build increment.