Staff Engineers
Keep agents aligned with architecture, invariants, and repo conventions.
Specification-driven engineering context
Spellbook turns requirements, architecture, tests, and local conventions into enforceable context for coding agents.
Without executable specifications, AI-generated code drifts from product intent, architecture boundaries, security rules, and team conventions.
Spellbook turns an agent run into a governed task record: spec, plan, code, tests, and evidence stay connected.
requirement pack: AUTH-FUNC-002 AUTH-SEC-004 - architecture - local conventions - context - session boundary
1. load invariants 2. inspect routes 3. update service 4. add tests 5. produce report
changed files: auth.routes.ts session.store.ts login.spec.ts me.spec.ts
npm test 18 passed 0 failed 0 skipped
review_ready requirements 7/8 forbidden shortcuts 0 human review required
Spellbook is for teams adopting coding agents in systems where architecture, security, compliance, and correctness cannot be left to prompt memory.
Keep agents aligned with architecture, invariants, and repo conventions.
Adopt AI codegen with governance, auditability, and measurable quality signals.
Standardize agent workflows across repos, stacks, and delivery gates.
Use AI speed without losing control of product intent and code structure.
Preserve evidence for requirements, tests, approvals, and releases.
The governed loop keeps agent work connected to executable requirements, architecture constraints, delivery gates, and review evidence.
Create the workspace contract, repo boundary, and allowed agent tools.
Capture product intent, domain truth, requirements, and acceptance checks.
Map components, ownership, integration patterns, and system constraints.
Load repo-specific rules for naming, errors, tests, logging, and layout.
Generate an implementation plan before code changes begin.
Run the agent in an isolated workspace with controlled permissions.
Execute required checks and attach results to the task record.
Compare the diff against requirements, architecture, and conventions.
Promote only changes that satisfy delivery gates.
Run post-merge or environment-level validation.
Track runtime behavior, failures, latency, cost, and quality signals.
Feed discoveries back into specs, requirements, and future plans.
A small authentication system is enough to show why prompt-first codegen breaks down.
Build login.
Intent: Users can register and log in. Domain: User, Session States: User: Active, Disabled Session: Active, Revoked, Expired Invariants: DisabledUserCannotLogin SessionHasExpiry PasswordHashNeverReturned OnlyActiveSessionMayAuthorizeProtectedRoute
Generated: POST /register POST /login POST /logout GET /me Evidence: 8 tests passed 4 requirements satisfied 0 forbidden shortcuts detected 1 review note created
Manifesto and docs now live as real offline pages with the same brutalist system and local theme state.
Why agentic codegen needs governed execution, not better prompt memory.
Preview task records, packs, gates, commands, and evidence surfaces.
Turn a rough software idea into structured project truth before agent execution begins.