QE Framework automatically enforces spec definition · execution · verification when you ask AI to code, preventing "sloppy work" or "built the wrong thing" at the system level.
Why it's needed. When you tell AI to "fix this", one of three things usually breaks — the request is ambiguous so AI guesses, execution happens but there's no trace of what changed, or "all done" when really it's half-done. Even among human developers, we prevent this with spec → implementation → review. QE enforces the same discipline on AI.
How it's different. It's not just a prompt collection. Every task first lives in two files — TASK_REQUEST (what to do) and VERIFY_CHECKLIST (how to confirm it works) — before execution happens. Only after each checklist item is marked yes/no is the task considered complete. If something fails, a retry spec is automatically generated.
What's guaranteed. (1) You see what AI plans to do before it runs. (2) AI doesn't grade itself — there's a separate verify and supervise stage. (3) You can swap Claude or Codex in at any step, and since we don't call external APIs directly, dependencies stay isolated.
The fastest way to understand QE — imagine a studio workshop.
/Qinit — Install the framework into your project (auto-creates CLAUDE.md, .qe/ directory)/Qplan "one-line description of what you want" — AI suggests a roadmap and phases/Qgs Phase 1: short-name — Two spec documents are generated (approve them and execution starts immediately)/Qplan. Qplan auto-detects task scope and hands off directly to /Qgs for simple tasks (Micro scope) without building a roadmap. You don't have to judge the scope yourself — Qplan automatically splits it into Micro / Small / Full and picks the right path.
/Qplan judges the scope — "one bug, so Micro scale" — and proposes a one-line plan.
No roadmap or phases, it moves straight to the next stage.
/Qgs creates TASK_REQUEST (which file, which function, exactly how to fix it) and
VERIFY_CHECKLIST (what to check after the fix). You just glance at both files, say
"yep, do it that way", and approve.
/Qatomic-run executes the checklist in parallel. If there are multiple files, multiple AI workers (Haiku)
fix them at the same time. Hooks auto-check "am I doing this right?" each time.
/Qcode-run-task runs tests and checks the VERIFY_CHECKLIST one by one.
If any item fails, it auto-fixes → re-runs → re-checks (up to 3 times). Once all pass, you get a completion report.
Words worth knowing before reading the full Reference.
/Qplan.main, or chain destructive steps. Do not enable unless requirements are explicit, every step is reversible, and you're on an isolated feature branch. See USAGE_GUIDE §10 for the full pre-flight checklist. Always run /Qutopia off before ending the session./Qcontract and /Qverify-contract./Qcontractcreate / edit / list / approve author, modify, review, and lock contracts under .qe/contracts/active/. Approval is enforced by a two-layer defense: interactive gate (AskUserQuestion) + hash record in .qe/contracts/.lock./Qverify-contractEcontract-judge agent). A 3-hash cache (contract / impl / test) prevents re-judging unchanged content. Supports single-contract (<name>) and bulk (--all) modes. Auto-invoked by /Qcode-run-task at Step 4.10.Once the big picture clicks — the full system architecture unfolds across 10 sections.
View full Reference→