The argument in one line.
The Claude Certified Architect exam guide is really a working syllabus for Claude Code, and five cross-cutting rules - hooks over prompts for anything high-stakes, structured errors, 4-5 tools per agent, separate-session review, and few-shot over instructions - capture roughly 80% of its value.
Read if. Skip if.
- A developer using Claude Code daily who wants to stop guessing at best practices and lock in a mental model that matches Anthropic's own.
- An engineer thinking about taking the Claude Certified Architect exam who wants a 39-minute orientation before paying for the official prep materials.
- A founder or team lead building agentic features in production who keeps hitting reliability issues and suspects the problem is architectural, not prompt-level.
- Anyone shipping MCP servers, sub-agents, or CI/CD-driven Claude workflows who wants to know which patterns the exam treats as correct.
- You have never used Claude Code or the Anthropic SDK - the video assumes you already know the basics of agents, tools, and prompts.
- You want a deep tutorial on writing hooks, MCP servers, or CI scripts - this is a conceptual map, not a hands-on build.
The full version, fast.
Anthropic released a five-domain Claude Certified Architect exam, and the official 40-page guide doubles as the best Claude Code syllabus currently in print. The video walks through every domain - Agentic Architecture (27%), Tool & MCP (18%), Claude Code Configuration (20%), Prompt Engineering (20%), and Context Management & Reliability (15%) - pulling out the highest-leverage idea from each. The closing five rules are the through-line: when a behavior must hold 100% of the time use a hook not a prompt, always return structured errors, cap each agent at four to five tools, review code in a fresh session that did not write it, and show two or three concrete examples instead of writing more instructions.
Chat with this breakdown.
Modern Creator members can chat with any breakdown — ask for the hook, quote a framework, find the exact transcript moment. Unlocks at T2: refer 3 friends + add your own API key.
Create a free account →Where the time goes.

01 · Anthropic just dropped a real certification
Cold open + pitch: a 40-page exam guide is the best Claude Code syllabus available, and the host has read it cover to cover.

02 · The 5 domains explained
Pie-chart breakdown of the exam's five domains and their weighting.

03 · Domain 1: Agentic Architecture (27%)
Why agent architecture is the single most important domain - how Claude thinks, coordinates, and enforces rules.

04 · The Agentic Loop
Request -> response -> tool_use -> execute -> repeat. The only field that decides when the loop ends is stop_reason.

05 · Anti-patterns to avoid
Three mistakes: parsing text for 'I'm done', setting hard iteration caps, and ignoring stop_reason.

06 · Hub-and-Spoke: coordinator + subagents
One coordinator agent decomposes the task, subagents run in isolated contexts, results merge at the end.

07 · The narrow decomposition mistake
Coordinators that decompose too narrowly miss whole branches of the problem - fix by giving broad goals, not narrow checklists.

08 · Live demo: spawning 3 subagents in parallel
Terminal demo: a single prompt spawns video/written/audio research subagents in Claude Code, each using its own tools and tokens, with a final synthesis.

09 · Prompts vs Hooks (most important concept)
Prompts are best-effort suggestions; hooks are deterministic scripts that physically block actions. The exam guide's most contentious section.

10 · When to use hooks over prompts
Prompts for style/tone/formatting; hooks for compliance, financial, and security. When money is on the line, hooks every time.

11 · Live demo: /hooks in Claude Code
Walk-through of /hooks in the terminal and the claude-code-guide agent as a second way to discover the right hook.

12 · Domain 2: Tool Design & MCP (18%)
Tool descriptions are the single highest-leverage thing you can fix - they decide which tool fires when descriptions overlap.

13 · Why tool descriptions matter most
Bad descriptions cause 40% misrouting; good descriptions with explicit do-not-use clauses drop misrouting to under 2%.

14 · The tool overload problem
Project vs user-level MCP, where to put .mcp.json, how to keep API keys in environment files.

15 · MCP server scoping: project vs user level
How project-level and user-level MCP differ; community vs custom servers; checking what is wired up with claude mcp list.

16 · tool_choice: auto, any, forced
An agent with 18 tools makes worse decisions than one with 5. tool_choice can be auto, any, or forced - force the first move, then loosen the leash.

17 · Domain 3: Claude Code Configuration (20%)
Most people dump everything into one CLAUDE.md. The guide splits it into three layers: user, project, and path-specific.

18 · The 3-layer CLAUDE.md hierarchy
Layer 1 user-level (~/.claude/CLAUDE.md), Layer 2 project-level (.claude/CLAUDE.md, version-controlled), Layer 3 path-specific (.claude/rules/*.md).

19 · Path-specific rules (.claude/rules/)
Rule files load only when their glob matches - testing rules for test files, API rules for the API folder. Keeps CLAUDE.md lean.

20 · Commands vs skills vs plan mode
Commands are reusable slash prompts; skills are scoped, isolated mini-agents with their own tool allow-list; plan mode is for ambiguous multi-file work.

21 · Claude Code in CI/CD pipelines
Using claude -p with --output-format json to make Claude Code run non-interactively inside a GitHub Action.

22 · The -p flag and --output-format json
How the -p and --output-format json flags turn Claude Code from a chat tool into something a pipeline can drive.

23 · Why you need a separate review session
A session biased by writing the code is bad at reviewing it. Always spin a fresh, stateless session for review.

24 · Domain 4: Prompt Engineering (20%)
When Claude is inconsistent, the instinct is to write more rules. The exam guide says: show 2-3 examples instead.

25 · Few-shot examples vs instructions
Two or three concrete input/output examples beat a paragraph of detailed instructions every single time. Claude learns the underlying pattern.

26 · Guaranteed JSON with tool_use
Define a tool with a JSON schema, set tool_choice to forced, and extract from the tool_use response. Eliminates syntax errors, not semantic ones.

27 · The validation loop
When the model misreads data, retry with specific feedback - the original document, the extracted field, the literal mismatch. Not just 'try again'.

28 · Domain 5: Context Management (15%)
Why Claude pays attention at the beginning and the end of context but goes fuzzy in the middle, and what to do about it.

29 · The lost in the middle effect
The first 40% of context is well-primed, the end has recency bias, and the middle drifts. Tool outputs push the important stuff into the fuzzy zone.

30 · 3 ways to fix context bloat
Pin a key-facts block at the top, trim verbose tool outputs, delegate messy work to subagents whose context never pollutes yours.

31 · /memory in Claude Code
/memory shows what is actually in scope right now - project memory, user memory, auto memory - so you can audit before a fresh session.

32 · When to escalate to a human
Three escalation scenarios: customer asks for a human (escalate immediately), policy is ambiguous (escalate with a structured handoff), or issue is straightforward (resolve, but still offer a human).

33 · Error propagation done right
Never return a generic 'failed'. Include what broke, what was tried, what partially worked, and what alternatives exist.

34 · The 5 rules that apply across every domain
Closing recap: hooks for high-stakes work, structured errors, 4-5 tools per agent, separate session for review, few-shot examples beat instructions.

35 · Interactive study prompts (X post resource)
A community-built prompt set turns Claude Code into an interactive instructor that quizzes you on each domain.

36 · Free study guide + where to go next
Mark plugs his own mega-guide and Skool community as deeper next-step resources.
Lines worth screenshotting.
- Anthropic's Claude Certified Architect exam weights its five domains at 27/18/20/20/15 - agent architecture is by far the biggest single chunk of the test.
- The only reliable signal that Claude is finished with an agentic loop is the stop_reason field; parsing text for 'I'm done' or capping iterations both break in production.
- Sub-agents do not communicate with each other - each runs in its own context, and the main agent only sees their final summaries when it stitches the answer together.
- The most common coordinator failure is decomposing the task too narrowly; the fix is to give broad goals and let the sub-agents pick their own subtasks.
- In Anthropic's own example, a prompt-based refund check fails 12% of the time and refunds the wrong customer; a hook drops that to zero because it physically blocks the action.
- Prompts are for style, tone, and formatting; hooks are for anything where one failure causes real harm - financial, compliance, or security.
- Tool descriptions are the highest-leverage thing in MCP work: ambiguous descriptions misroute 40% of the time, while good ones with explicit do-not-use clauses drop misrouting under 2%.
- An agent with 18 tools makes worse decisions than one with 5, because each extra option is a chance for the model to wander outside its lane.
- tool_choice has three modes - auto, any, forced - and the guide recommends forcing the first move so the agent always starts the same way, then loosening the leash.
- CLAUDE.md should be split across three layers: user preferences in your home directory, team rules in the project, and path-specific rules that only load when a matching file is open.
- Path-specific rules in .claude/rules/ are the underused unlock - testing rules only load for test files, API rules only load in the API folder, and your token bill drops.
- claude -p combined with --output-format json turns Claude Code from a chat into something a CI pipeline can call non-interactively, returning structured JSON.
- Never let the Claude session that wrote the code also review it - the writer is biased toward its own work, and a fresh session catches what the first one misses.
- Two or three concrete few-shot examples beat a full page of written instructions every time, because Claude learns the underlying pattern instead of memorising the examples.
- Forcing tool_use with a JSON schema eliminates syntax errors like malformed JSON but does not eliminate semantic errors - you still need a validation loop for wrong values.
- Claude pays high attention to the start and end of a context window and gets fuzzy in the middle; tool outputs piling up in the middle quietly degrade reliability over time.
- The three context-management fixes are: pin a key-facts block at the top, trim verbose tool outputs, and delegate messy work to sub-agents whose context never pollutes yours.
- If a customer asks for a human, escalate immediately - sentiment scoring will misread sarcasm and cultural tone, and the policy line is clearer than the model.
- When escalating to a human, send a structured handoff - customer ID, root cause, what was tried, recommended action - not a 'sorry, transferring' message.
- Generic 'failed' errors are useless; every error should include what failed, what was tried, what partially worked, and what alternatives exist so the next agent can act.
Five rules that decide how reliable your Claude agents will be.
Reliable Claude systems come from picking the right level of determinism for each problem - hooks where it must work, prompts where it just needs to read well, and structured handoffs everywhere in between.
- Check stop_reason after every Claude response - it is the only reliable signal that the loop is done.
- Do not parse the assistant's text for phrases like 'I'm done' or 'task complete' - they break.
- Do not cap iterations at a fixed number; you do not know in advance how deep a task needs to go.
- Sub-agents run in isolated contexts and never see each other's work; only the coordinator merges results.
- Give the coordinator broad goals, not narrow checklists, so it does not decompose the task too narrowly.
- Let each sub-agent pick its own tools and tokens within its narrow goal - that is what makes them precise.
- Anthropic's own example shows a prompt-based refund check fails 12% of the time and refunds the wrong customer.
- Use prompts for style, tone, and formatting - anything where 90% is good enough.
- Use hooks for compliance, financial, and security - anything where one failure causes real harm.
- Ambiguous overlapping descriptions cause misrouting in roughly 40% of cases.
- Adding explicit 'use INSTEAD OF X when Y' clauses drops misrouting under 2%.
- The description is the interface - fix descriptions before adding new tools or routing layers.
- An agent with 18 tools makes worse decisions than one with 5; less choice means better decisions.
- Force the first tool call so step one is always consistent, then loosen the leash for the rest.
- Use tool_choice = forced for extraction pipelines where the schema must be filled exactly.
- Layer 1 user-level stays in your home directory and is never shared via git.
- Layer 2 project-level lives at .claude/CLAUDE.md and is version-controlled for the whole team.
- Layer 3 path-specific rules in .claude/rules/ only load when an open file matches their glob.
- The session that wrote the code is biased toward thinking it is correct.
- Spin up a fresh, stateless Claude session for review - it catches what the writer never will.
- claude -p with --output-format json lets a CI pipeline run that review automatically.
- Two or three concrete examples beat a full page of written instructions every time.
- Claude does not memorise the examples; it generalises the underlying pattern.
- Show null for one missing field once, and Claude uses null for all missing fields after.
- The first 40% of context is well-primed and the end has recency bias - the middle drifts.
- Pin a key-facts block at the top so critical state stays in high-attention territory.
- Trim verbose tool outputs and delegate messy work to sub-agents whose context never leaks back.
- If the customer asks for a human, escalate immediately - do not try to resolve first.
- If policy is ambiguous, escalate with a structured handoff package, not a vague apology.
- If the issue is straightforward, resolve it but still offer the option of a human.
- Hooks beat prompts whenever a behaviour must hold 100% of the time.
- Always return structured errors - never a generic 'failed' message.
- Cap each agent at 4-5 tools to keep decisions sharp.
- Use a fresh Claude session to review code; the writer is too biased.
- Show two or three examples instead of writing more instructions.
Terms worth knowing.
- Claude Certified Architect
- A new Anthropic certification exam, pass/fail, organised around five weighted domains covering how to design and operate Claude-based agentic systems.
- Agentic loop
- The repeating cycle of request, response, tool_use, execute, and continue that drives every Claude agent - the loop ends when stop_reason indicates completion.
- stop_reason
- The field on a Claude response that signals why the agent stopped - 'tool_use' means it wants to call a tool, 'end_turn' means it is done. The only reliable end-of-loop signal.
- Hub-and-spoke architecture
- A coordinator agent at the centre delegates subtasks to specialised sub-agents that run in isolated contexts and return summaries the coordinator stitches together.
- Sub-agent
- A Claude agent spawned by a coordinator with its own context window, its own tool set, and no memory of what other sub-agents are doing.
- Hook
- A deterministic script Claude Code runs before or after an action; if the hook returns a block, the action physically cannot happen. The opposite of a prompt suggestion.
- MCP server
- Model Context Protocol server - a process exposing tools or data to Claude through a standard interface, scoped at either project (.mcp.json) or user level.
- Tool description
- The natural-language text attached to a tool that tells Claude when to call it; the single largest lever for reducing misrouting between similar tools.
- tool_choice
- An API parameter with values auto, any, or forced that controls whether Claude can decide to skip tools, must call some tool, or must call one specific tool.
- CLAUDE.md
- A markdown file Claude Code loads into context on every session; the exam guide recommends splitting it across user, project, and path-specific layers to avoid loading everything every time.
- Path-specific rules
- Rule files in .claude/rules/ that declare a glob pattern at the top and only load when an open file matches, keeping CLAUDE.md lean.
- Skill
- A scoped Claude-Code capability defined in its own file with a frontmatter declaring allowed tools, running in an isolated context so its messy work does not clutter the main conversation.
- Plan mode
- A Claude Code mode where the agent explores, reads, and proposes changes without modifying any file, so a human can approve or redirect before execution.
- CI/CD pipeline
- Continuous Integration / Continuous Deployment - an automated chain that builds, tests, and ships code without manual intervention; the guide covers how to drop Claude Code into this chain.
- claude -p
- The Claude Code flag that runs a single task non-interactively with no confirmation prompts, suitable for scripting and CI.
- Few-shot examples
- A prompting technique where you show the model two or three concrete input/output pairs instead of describing the rules in prose; Claude generalises the pattern.
- Lost in the middle
- The phenomenon where a long-context model pays close attention to the start and end of its context but degrades on information buried in the middle.
- Forced tool use
- Setting tool_choice to a specific tool so Claude must produce a tool_use block in that schema; eliminates JSON syntax errors but not semantic ones.
Things they pointed at.
Lines you could clip.
“Giving an agent 18 tools is like hiring a brand new employee and giving them access to every single system from day one.”
“Prompts are suggestions and hooks are laws.”
“Fresh eyes, even AI eyes, catch more.”
“Claude doesn't just copy paste your examples. It learns the underlying patterns behind them.”
“Two to three examples will beat a full page of instructions each and every single time.”
“It's infinitely better to start a brand new session with a summarized version of outputs from before versus pushing through a conversation even if you're at that million context window.”
“The description of the tool is really the interface of tooling.”
Word for word.
The bait, then the rug-pull.
Anthropic just shipped a real, pass-or-fail Claude Certified Architect exam, and the 40-page exam guide doubles as the best Claude Code syllabus in print. The host reads every page, then translates the five domains into plain English with diagrams and live demos so you do not have to.
Named ideas worth stealing.
The Five Exam Domains
- Agentic Architecture & Orchestration (27%)
- Tool Design & MCP Integration (18%)
- Claude Code Configuration & Workflows (20%)
- Prompt Engineering & Structured Output (20%)
- Context Management & Reliability (15%)
Anthropic's exam weighting tells you where to spend prep time - agent architecture alone is more than a quarter of the test.
The Agentic Loop
- Code sends request
- Claude responds with stop_reason
- If stop_reason = tool_use, execute the tool
- Feed result back, loop again
- If stop_reason = end_turn, stop
Every Claude agent - SDK, Claude Code, custom framework - runs this loop. The exit signal is the stop_reason field, not text in the response.
Hub-and-Spoke (Coordinator-Subagent) Pattern
- Coordinator decomposes the task
- Specialised sub-agents run in parallel, each in its own context
- Sub-agents do NOT communicate with each other
- Coordinator merges results at the end
The canonical architecture for complex multi-step work. Each sub-agent keeps its own tools and tokens; the coordinator only sees summaries.
Prompts vs Hooks
Prompts are best-effort suggestions (probabilistic, ~88% reliable in Anthropic's refund example). Hooks are deterministic scripts that physically block actions. Style/tone -> prompt. Money/compliance/security -> hook.
Tool Description Anti-Pattern
Vague overlapping descriptions cause ~40% misrouting between similar tools. Adding explicit 'use INSTEAD OF X when Y' clauses drops misrouting under 2%. The description IS the interface.
The 3-Layer CLAUDE.md Hierarchy
- Layer 1: user-level (~/.claude/CLAUDE.md) - personal preferences, never shared
- Layer 2: project-level (.claude/CLAUDE.md) - team rules, version-controlled
- Layer 3: path-specific (.claude/rules/*.md) - conditional, only loaded when matching files are open
Split CLAUDE.md across three layers so Claude only loads what is relevant to the current task. Most people dump everything into one giant file and waste tokens on every session.
The Five Cross-Cutting Rules
- If it has to work 100% of the time, use a hook not a prompt
- Always return structured errors - what failed, what was tried, what alternatives exist
- Cap each agent at 4-5 tools max
- Always review code in a separate Claude session from the one that wrote it
- Two or three concrete examples beat a paragraph of instructions
The closing distillation - five rules that apply across every domain. The unifying principle is determinism: pick the tool with the right level of predictability for the job.
How they asked for the click.
“Check out the first link in the description and maybe join me in my early AI adopters community.”
Soft, value-led. Framed as a community of learners with a course coming, paired with a free mega-guide so the ask does not read as purely commercial.








































































