The argument in one line.
AI agents produce better code when you encode strict, process-driven workflows as reusable skills that force them to mirror how senior engineers think through design, testing, and architecture.
Read if. Skip if.
- A developer using Claude Code or similar AI agents who wants to establish repeatable processes that keep autonomous coding assistants on track and improve output quality.
- An engineer integrating AI into your development workflow who needs practical patterns for steering agent behavior without relying on memory or context carryover.
- A software engineer with 3+ years of experience who's experimenting with AI-assisted coding and wants tactical skills to encode your domain knowledge into agent instructions.
- You're new to programming or haven't shipped production code — this assumes solid engineering fundamentals and experience debugging complex systems.
- You're looking for Claude Code setup tutorials or beginner onboarding — this is advanced applied technique, not foundational tooling instruction.
- You work primarily in non-code domains or don't use Claude Code as part of your daily workflow — these skills are specific to that tool and process.
The full version, fast.
Process matters more than ever when working with AI agents that have no memory, and structured skills are how you encode that process so agents walk the same path every time. The approach chains five composable skills: a grill-me skill that forces the agent to interview you down every branch of the design tree before writing a plan, a write-a-PRD skill that turns the shared understanding into a GitHub issue describing the destination, a PRD-to-issues skill that slices the destination into vertical tracer-bullet tickets with blocking relationships, a TDD skill that drives a strict red-green-refactor loop, and an architecture skill that spawns parallel sub-agents to propose deeper module designs. Treat agents like humans with weird constraints, refactor toward deep modules with thin interfaces, and code quality compounds.
Chat with this breakdown.
Modern Creator members can chat with any breakdown — ask for the hook, quote a framework, find the exact transcript moment. Unlocks at T2: refer 3 friends + add your own API key.
Create a free account →Where the time goes.

01 · Intro - process is the product
Sets up the argument: agents have no memory, so process documentation is the multiplier. Shows the skills repo README.

02 · /grill-me
Three-sentence skill forcing Claude to interview you about every branch of a design tree before touching code. Live 16-question session demonstrated.

03 · /write-a-prd
Five-step skill producing a GitHub issue PRD with problem statement, solution, user stories, and implementation decisions.

04 · /prd-to-issues
Converts the destination PRD into tracer-bullet GitHub issues with blocking relationships, safe for parallel agents.

05 · /tdd
Red-green-refactor loop for autonomous agents. Tests verify behavior through public interfaces, not implementation.

06 · /improve-codebase-architecture
John Ousterhout deep-module philosophy. Spawn 3+ sub-agents with radically different interfaces, compare, recommend hybrid, create GitHub RFC.

07 · Course pitch + outro
Claude Code for Real Engineers - 2-week cohort. Curriculum overview shown.
Lines worth screenshotting.
- The grill-me skill is only three sentences long and reliably produces sixteen or more clarifying questions on a complex feature — proof that skill length has no correlation to impact.
- The design tree framework from Frederick Brooks treats a feature design as a branching structure where every decision creates downstream dependencies that must be resolved before coding begins.
- Invoking grill-me after giving Claude some research forces it to walk every branch of the design tree and surface decisions the developer had not thought through.
- The write-a-PRD skill skips steps already completed by grill-me — it recognizes when a deep interview has already happened and jumps directly to the module sketching phase.
- PRDs are submitted as GitHub Issues, which makes them the source of truth for a Ralph loop that processes each issue until it is done.
- User stories in the PRD describe desired system behavior in language — the format is agile-inspired but the goal is durability so the PRD stays useful as the code evolves.
- Process is more important than ever when the engineers have no memory — strict, well-defined workflows are the only way to steer agents consistently toward useful output.
- The write-a-PRD skill chain (grill-me → PRD → GitHub Issue → Ralph loop) is a complete autonomous development pipeline that runs without human intervention once the spec is locked.
- Implementation decisions in a PRD should be non-prescriptive to stay durable — over-specifying the implementation creates conflicts when the code diverges from the document.
- A grilling session of 30 to 50 questions on a really complex feature is normal and worthwhile — the time invested upfront eliminates far more time lost to misaligned builds.
- Skills encode process so the agent has a strict path to walk every single time — this is the mechanism by which code quality improves as a skill library grows.
- The five daily skills together form a complete development loop: understand the problem, write the spec, verify the modules, execute the build, and review the result.
Steal the skill chain.
The five skills form a complete, repeatable workflow from vague idea to production-quality code, and each one is three to fifty lines of markdown.
- Build a grill-me skill first: three sentences that force the model to interview you before touching code. Ship nothing before this.
- Make write-a-prd output GitHub issues, not local docs. Issues become the memory the agent fetches on every task.
- Use prd-to-issues to break work into vertical slices through ALL layers, not horizontal layer-by-layer sprints.
- Wire tdd into your autonomous loop so every issue gets red-green-refactor by default.
- Run improve-codebase-architecture weekly. Shallow modules are the number one reason AI outputs degrade over time.
- Skills compound: the chain is the system, any skill in isolation is half the value.
Terms worth knowing.
- Claude Code
- Anthropic's command-line coding agent that runs in a terminal and edits, runs, and reasons about code in a repository. Used here as the primary interface for invoking AI engineering workflows.
- Skill
- A reusable instruction file that tells an AI coding agent how to perform a specific task or follow a specific process. Skills are invoked by name and inject their contents into the model's context to steer behavior.
- Agent
- An autonomous AI worker that reads instructions, plans steps, and executes code-related actions like editing files, running commands, or opening pull requests without step-by-step human input.
- Plan mode
- A Claude Code mode where the agent drafts a written plan before touching any files, letting the human review and adjust the approach before implementation begins.
- LLM
- Large language model — the underlying AI that generates text and code from prompts. In coding agents, it is the engine doing the reasoning and writing.
- Context window
- The fixed amount of text a language model can consider at once, including prior messages, instructions, and code. Once it fills up, older information must be summarized or dropped.
- Design tree
- A branching map of design decisions where each choice spawns further sub-decisions that must be resolved before committing to an implementation. Walking the tree forces every dependency between decisions to be made explicit.
- PRD
- Product requirements document — a written spec that defines the problem, the desired solution, and the user stories a feature must satisfy. Acts as the durable destination description for a piece of work.
- User story
- A short statement describing a feature from the user's point of view, typically framed as 'as a user, I want X so that Y.' Comes from agile methodology and is used to capture desired behavior in plain language.
- Cucumber language
- A Given-When-Then syntax for writing executable behavior specifications, popularized by the Cucumber testing framework. Used to express user stories in a structured, testable form.
- GitHub issue
- A tracked task or ticket inside a GitHub repository, used to record bugs, features, or work items. Issues can reference each other to establish parent-child or blocking relationships.
- Kanban board
- A visual workflow board where tasks move through columns like To Do, In Progress, and Done. Used here as a metaphor for a queue of independently grabbable issues.
- Vertical slice
- A small piece of work that cuts through every layer of a system — UI, logic, data, integrations — to deliver one end-to-end behavior. Contrasts with a horizontal slice that only builds one layer at a time.
- Tracer bullet
- A development approach where the team builds a minimal end-to-end path through the whole system first, then iterates. Borrowed from gunnery, where tracer rounds let the shooter see where shots are landing and adjust aim.
- Unknown unknowns
- Risks or problems you do not yet know exist. Good task ordering attacks these first so they surface early, while there is still time to change the plan.
- Blocking relationship
- A dependency between tasks where one issue cannot start until another is finished. Tracking these explicitly lets agents pick up only the work that is currently unblocked.
- Ralph loop
- An autonomous loop that repeatedly picks the next open task, runs an AI agent against it, commits the result, and moves on until the queue is empty. Named after a popular pattern for unattended agent runs.
- TDD
- Test-driven development — a practice where you write a failing test first, then write just enough code to make it pass, then clean up. Forces design decisions to be made through tests rather than guesswork.
- Red green refactor
- The three-step rhythm of test-driven development: write a failing test (red), make it pass (green), then improve the code without changing behavior (refactor).
- Interface
- In software design, the set of functions, types, or endpoints a module exposes to its callers — the contract — separated from the internal implementation that fulfills it.
Things they pointed at.
Lines you could clip.
“Skills don't have to be long to be impactful. You've just gotta choose the right words for the LLM at the right time.”
“If you have a garbage code base, the AI is gonna produce garbage within that code base.”
“The most successful way to get code quality up from agents is just to treat them like humans. Humans with weird constraints.”
“At your fingertips now, you have access to a fleet of middling to good engineers that you can deploy at any time. But the weird thing about these engineers is they have no memory.”
Word for word.
The bait, then the rug-pull.
Process is the product. Matt Pocock opens with a deceptively plain observation: AI agents have no memory, which means the only way to get reliable output is to encode senior engineering decision-making into the prompts themselves.
Named ideas worth stealing.
Design Tree
From Frederick Brooks: walk every branch of a design decision tree before committing to implementation.
Deep Module Philosophy
A deep module has a small interface hiding a large implementation. Makes code testable at boundaries and easier for AI to navigate.
Vertical Slice / Tracer Bullet
Each issue cuts through ALL integration layers end-to-end, not a horizontal slice of one layer. Flushes unknown unknowns first.
Skill Chain
- grill-me
- write-a-prd
- prd-to-issues
- tdd
- improve-codebase-architecture
Five skills chaining: idea to shared understanding to destination doc to journey to implementation to refactor.
How they asked for the click.
“Claude Code for Real Engineers. Two week cohort starting March 30, 40% off for 7 days.”
Pitched twice - near the top and at the end. Course landing page shown full-screen via picture-in-picture.









































































