Big Idea

The argument in one line.

The Claude Certified Architect exam guide is really a working syllabus for Claude Code, and five cross-cutting rules - hooks over prompts for anything high-stakes, structured errors, 4-5 tools per agent, separate-session review, and few-shot over instructions - capture roughly 80% of its value.

Who This Is For

Read if. Skip if.

READ IF YOU ARE…

A developer using Claude Code daily who wants to stop guessing at best practices and lock in a mental model that matches Anthropic's own.
An engineer thinking about taking the Claude Certified Architect exam who wants a 39-minute orientation before paying for the official prep materials.
A founder or team lead building agentic features in production who keeps hitting reliability issues and suspects the problem is architectural, not prompt-level.
Anyone shipping MCP servers, sub-agents, or CI/CD-driven Claude workflows who wants to know which patterns the exam treats as correct.

SKIP IF…

You have never used Claude Code or the Anthropic SDK - the video assumes you already know the basics of agents, tools, and prompts.
You want a deep tutorial on writing hooks, MCP servers, or CI scripts - this is a conceptual map, not a hands-on build.

TL;DR

The full version, fast.

Anthropic released a five-domain Claude Certified Architect exam, and the official 40-page guide doubles as the best Claude Code syllabus currently in print. The video walks through every domain - Agentic Architecture (27%), Tool & MCP (18%), Claude Code Configuration (20%), Prompt Engineering (20%), and Context Management & Reliability (15%) - pulling out the highest-leverage idea from each. The closing five rules are the through-line: when a behavior must hold 100% of the time use a hook not a prompt, always return structured errors, cap each agent at four to five tools, review code in a fresh session that did not write it, and show two or three concrete examples instead of writing more instructions.

Free for members

Chat with this breakdown — free.

Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.

Create a free account →

Chapters

Where the time goes.

00:00 – 01:39

01 · Anthropic just dropped a real certification

Cold open + pitch: a 40-page exam guide is the best Claude Code syllabus available, and the host has read it cover to cover.

01:39 – 02:03

02 · The 5 domains explained

Pie-chart breakdown of the exam's five domains and their weighting.

02:03 – 03:25

03 · Domain 1: Agentic Architecture (27%)

Why agent architecture is the single most important domain - how Claude thinks, coordinates, and enforces rules.

03:25 – 04:07

04 · The Agentic Loop

Request -> response -> tool_use -> execute -> repeat. The only field that decides when the loop ends is stop_reason.

04:07 – 05:18

05 · Anti-patterns to avoid

Three mistakes: parsing text for 'I'm done', setting hard iteration caps, and ignoring stop_reason.

05:18 – 06:38

06 · Hub-and-Spoke: coordinator + subagents

One coordinator agent decomposes the task, subagents run in isolated contexts, results merge at the end.

06:38 – 07:34

07 · The narrow decomposition mistake

Coordinators that decompose too narrowly miss whole branches of the problem - fix by giving broad goals, not narrow checklists.

07:34 – 08:49

08 · Live demo: spawning 3 subagents in parallel

Terminal demo: a single prompt spawns video/written/audio research subagents in Claude Code, each using its own tools and tokens, with a final synthesis.

08:49 – 10:00

09 · Prompts vs Hooks (most important concept)

Prompts are best-effort suggestions; hooks are deterministic scripts that physically block actions. The exam guide's most contentious section.

10:00 – 10:55

10 · When to use hooks over prompts

Prompts for style/tone/formatting; hooks for compliance, financial, and security. When money is on the line, hooks every time.

10:55 – 11:42

11 · Live demo: /hooks in Claude Code

Walk-through of /hooks in the terminal and the claude-code-guide agent as a second way to discover the right hook.

11:42 – 12:07

12 · Domain 2: Tool Design & MCP (18%)

Tool descriptions are the single highest-leverage thing you can fix - they decide which tool fires when descriptions overlap.

12:07 – 13:10

13 · Why tool descriptions matter most

Bad descriptions cause 40% misrouting; good descriptions with explicit do-not-use clauses drop misrouting to under 2%.

13:10 – 13:54

14 · The tool overload problem

Project vs user-level MCP, where to put .mcp.json, how to keep API keys in environment files.

13:54 – 15:28

15 · MCP server scoping: project vs user level

How project-level and user-level MCP differ; community vs custom servers; checking what is wired up with claude mcp list.

15:28 – 16:57

16 · tool_choice: auto, any, forced

An agent with 18 tools makes worse decisions than one with 5. tool_choice can be auto, any, or forced - force the first move, then loosen the leash.

16:57 – 18:51

17 · Domain 3: Claude Code Configuration (20%)

Most people dump everything into one CLAUDE.md. The guide splits it into three layers: user, project, and path-specific.

18:51 – 20:04

18 · The 3-layer CLAUDE.md hierarchy

Layer 1 user-level (~/.claude/CLAUDE.md), Layer 2 project-level (.claude/CLAUDE.md, version-controlled), Layer 3 path-specific (.claude/rules/*.md).

20:04 – 21:06

19 · Path-specific rules (.claude/rules/)

Rule files load only when their glob matches - testing rules for test files, API rules for the API folder. Keeps CLAUDE.md lean.

21:06 – 22:25

20 · Commands vs skills vs plan mode

Commands are reusable slash prompts; skills are scoped, isolated mini-agents with their own tool allow-list; plan mode is for ambiguous multi-file work.

22:25 – 23:40

21 · Claude Code in CI/CD pipelines

Using claude -p with --output-format json to make Claude Code run non-interactively inside a GitHub Action.

23:40 – 24:31

22 · The -p flag and --output-format json

How the -p and --output-format json flags turn Claude Code from a chat tool into something a pipeline can drive.

24:31 – 25:45

23 · Why you need a separate review session

A session biased by writing the code is bad at reviewing it. Always spin a fresh, stateless session for review.

25:45 – 26:26

24 · Domain 4: Prompt Engineering (20%)

When Claude is inconsistent, the instinct is to write more rules. The exam guide says: show 2-3 examples instead.

26:26 – 27:15

25 · Few-shot examples vs instructions

Two or three concrete input/output examples beat a paragraph of detailed instructions every single time. Claude learns the underlying pattern.

27:15 – 28:06

26 · Guaranteed JSON with tool_use

Define a tool with a JSON schema, set tool_choice to forced, and extract from the tool_use response. Eliminates syntax errors, not semantic ones.

28:06 – 29:23

27 · The validation loop

When the model misreads data, retry with specific feedback - the original document, the extracted field, the literal mismatch. Not just 'try again'.

29:23 – 29:45

28 · Domain 5: Context Management (15%)

Why Claude pays attention at the beginning and the end of context but goes fuzzy in the middle, and what to do about it.

29:45 – 30:37

29 · The lost in the middle effect

The first 40% of context is well-primed, the end has recency bias, and the middle drifts. Tool outputs push the important stuff into the fuzzy zone.

30:37 – 31:46

30 · 3 ways to fix context bloat

Pin a key-facts block at the top, trim verbose tool outputs, delegate messy work to subagents whose context never pollutes yours.

31:46 – 32:24

31 · /memory in Claude Code

/memory shows what is actually in scope right now - project memory, user memory, auto memory - so you can audit before a fresh session.

32:24 – 33:41

32 · When to escalate to a human

Three escalation scenarios: customer asks for a human (escalate immediately), policy is ambiguous (escalate with a structured handoff), or issue is straightforward (resolve, but still offer a human).

33:41 – 35:00

33 · Error propagation done right

Never return a generic 'failed'. Include what broke, what was tried, what partially worked, and what alternatives exist.

35:00 – 37:12

34 · The 5 rules that apply across every domain

Closing recap: hooks for high-stakes work, structured errors, 4-5 tools per agent, separate session for review, few-shot examples beat instructions.

37:12 – 38:07

35 · Interactive study prompts (X post resource)

A community-built prompt set turns Claude Code into an interactive instructor that quizzes you on each domain.

38:07 – 39:01

36 · Free study guide + where to go next

Mark plugs his own mega-guide and Skool community as deeper next-step resources.

Atomic Insights

Lines worth screenshotting.

Anthropic's Claude Certified Architect exam weights its five domains at 27/18/20/20/15 - agent architecture is by far the biggest single chunk of the test.
The only reliable signal that Claude is finished with an agentic loop is the stop_reason field; parsing text for 'I'm done' or capping iterations both break in production.
Sub-agents do not communicate with each other - each runs in its own context, and the main agent only sees their final summaries when it stitches the answer together.
The most common coordinator failure is decomposing the task too narrowly; the fix is to give broad goals and let the sub-agents pick their own subtasks.
In Anthropic's own example, a prompt-based refund check fails 12% of the time and refunds the wrong customer; a hook drops that to zero because it physically blocks the action.
Prompts are for style, tone, and formatting; hooks are for anything where one failure causes real harm - financial, compliance, or security.
Tool descriptions are the highest-leverage thing in MCP work: ambiguous descriptions misroute 40% of the time, while good ones with explicit do-not-use clauses drop misrouting under 2%.
An agent with 18 tools makes worse decisions than one with 5, because each extra option is a chance for the model to wander outside its lane.
tool_choice has three modes - auto, any, forced - and the guide recommends forcing the first move so the agent always starts the same way, then loosening the leash.
CLAUDE.md should be split across three layers: user preferences in your home directory, team rules in the project, and path-specific rules that only load when a matching file is open.
Path-specific rules in .claude/rules/ are the underused unlock - testing rules only load for test files, API rules only load in the API folder, and your token bill drops.
claude -p combined with --output-format json turns Claude Code from a chat into something a CI pipeline can call non-interactively, returning structured JSON.
Never let the Claude session that wrote the code also review it - the writer is biased toward its own work, and a fresh session catches what the first one misses.
Two or three concrete few-shot examples beat a full page of written instructions every time, because Claude learns the underlying pattern instead of memorising the examples.
Forcing tool_use with a JSON schema eliminates syntax errors like malformed JSON but does not eliminate semantic errors - you still need a validation loop for wrong values.
Claude pays high attention to the start and end of a context window and gets fuzzy in the middle; tool outputs piling up in the middle quietly degrade reliability over time.
The three context-management fixes are: pin a key-facts block at the top, trim verbose tool outputs, and delegate messy work to sub-agents whose context never pollutes yours.
If a customer asks for a human, escalate immediately - sentiment scoring will misread sarcasm and cultural tone, and the policy line is clearer than the model.
When escalating to a human, send a structured handoff - customer ID, root cause, what was tried, recommended action - not a 'sorry, transferring' message.
Generic 'failed' errors are useless; every error should include what failed, what was tried, what partially worked, and what alternatives exist so the next agent can act.

Takeaway

Five rules that decide how reliable your Claude agents will be.

WHAT TO LEARN

Reliable Claude systems come from picking the right level of determinism for each problem - hooks where it must work, prompts where it just needs to read well, and structured handoffs everywhere in between.

04The Agentic Loop

Check stop_reason after every Claude response - it is the only reliable signal that the loop is done.
Do not parse the assistant's text for phrases like 'I'm done' or 'task complete' - they break.
Do not cap iterations at a fixed number; you do not know in advance how deep a task needs to go.

06Hub-and-Spoke: coordinator + subagents

Sub-agents run in isolated contexts and never see each other's work; only the coordinator merges results.
Give the coordinator broad goals, not narrow checklists, so it does not decompose the task too narrowly.
Let each sub-agent pick its own tools and tokens within its narrow goal - that is what makes them precise.

09Prompts vs Hooks (most important concept)

Anthropic's own example shows a prompt-based refund check fails 12% of the time and refunds the wrong customer.
Use prompts for style, tone, and formatting - anything where 90% is good enough.
Use hooks for compliance, financial, and security - anything where one failure causes real harm.

13Why tool descriptions matter most

Ambiguous overlapping descriptions cause misrouting in roughly 40% of cases.
Adding explicit 'use INSTEAD OF X when Y' clauses drops misrouting under 2%.
The description is the interface - fix descriptions before adding new tools or routing layers.

16tool_choice: auto, any, forced

An agent with 18 tools makes worse decisions than one with 5; less choice means better decisions.
Force the first tool call so step one is always consistent, then loosen the leash for the rest.
Use tool_choice = forced for extraction pipelines where the schema must be filled exactly.

18The 3-layer CLAUDE.md hierarchy

Layer 1 user-level stays in your home directory and is never shared via git.
Layer 2 project-level lives at .claude/CLAUDE.md and is version-controlled for the whole team.
Layer 3 path-specific rules in .claude/rules/ only load when an open file matches their glob.

23Why you need a separate review session

The session that wrote the code is biased toward thinking it is correct.
Spin up a fresh, stateless Claude session for review - it catches what the writer never will.
claude -p with --output-format json lets a CI pipeline run that review automatically.

26Few-shot examples vs instructions

Two or three concrete examples beat a full page of written instructions every time.
Claude does not memorise the examples; it generalises the underlying pattern.
Show null for one missing field once, and Claude uses null for all missing fields after.

29The lost in the middle effect

The first 40% of context is well-primed and the end has recency bias - the middle drifts.
Pin a key-facts block at the top so critical state stays in high-attention territory.
Trim verbose tool outputs and delegate messy work to sub-agents whose context never leaks back.

31When to escalate to a human

If the customer asks for a human, escalate immediately - do not try to resolve first.
If policy is ambiguous, escalate with a structured handoff package, not a vague apology.
If the issue is straightforward, resolve it but still offer the option of a human.

34The 5 rules that apply across every domain

Hooks beat prompts whenever a behaviour must hold 100% of the time.
Always return structured errors - never a generic 'failed' message.
Cap each agent at 4-5 tools to keep decisions sharp.
Use a fresh Claude session to review code; the writer is too biased.
Show two or three examples instead of writing more instructions.

Glossary

Terms worth knowing.

Claude Certified Architect: A new Anthropic certification exam, pass/fail, organised around five weighted domains covering how to design and operate Claude-based agentic systems.
Agentic loop: The repeating cycle of request, response, tool_use, execute, and continue that drives every Claude agent - the loop ends when stop_reason indicates completion.
stop_reason: The field on a Claude response that signals why the agent stopped - 'tool_use' means it wants to call a tool, 'end_turn' means it is done. The only reliable end-of-loop signal.
Hub-and-spoke architecture: A coordinator agent at the centre delegates subtasks to specialised sub-agents that run in isolated contexts and return summaries the coordinator stitches together.
Sub-agent: A Claude agent spawned by a coordinator with its own context window, its own tool set, and no memory of what other sub-agents are doing.
Hook: A deterministic script Claude Code runs before or after an action; if the hook returns a block, the action physically cannot happen. The opposite of a prompt suggestion.
MCP server: Model Context Protocol server - a process exposing tools or data to Claude through a standard interface, scoped at either project (.mcp.json) or user level.
Tool description: The natural-language text attached to a tool that tells Claude when to call it; the single largest lever for reducing misrouting between similar tools.
tool_choice: An API parameter with values auto, any, or forced that controls whether Claude can decide to skip tools, must call some tool, or must call one specific tool.
CLAUDE.md: A markdown file Claude Code loads into context on every session; the exam guide recommends splitting it across user, project, and path-specific layers to avoid loading everything every time.
Path-specific rules: Rule files in .claude/rules/ that declare a glob pattern at the top and only load when an open file matches, keeping CLAUDE.md lean.
Skill: A scoped Claude-Code capability defined in its own file with a frontmatter declaring allowed tools, running in an isolated context so its messy work does not clutter the main conversation.
Plan mode: A Claude Code mode where the agent explores, reads, and proposes changes without modifying any file, so a human can approve or redirect before execution.
CI/CD pipeline: Continuous Integration / Continuous Deployment - an automated chain that builds, tests, and ships code without manual intervention; the guide covers how to drop Claude Code into this chain.
claude -p: The Claude Code flag that runs a single task non-interactively with no confirmation prompts, suitable for scripting and CI.
Few-shot examples: A prompting technique where you show the model two or three concrete input/output pairs instead of describing the rules in prose; Claude generalises the pattern.
Lost in the middle: The phenomenon where a long-context model pays close attention to the start and end of its context but degrades on information buried in the middle.
Forced tool use: Setting tool_choice to a specific tool so Claude must produce a tool_use block in that schema; eliminates JSON syntax errors but not semantic ones.

Resources

Things they pointed at.

00:00productEarly AI Adopters (Skool community) ↗

38:07productClaude Architect Study Guide (Gumroad) ↗

00:22linkAnthropic Claude Certified Architect exam guide

16:58toolWarp (terminal used in live demo)

15:40toolclaude mcp list (Claude Code command)

31:38tool/memory (Claude Code command)

38:07linkX post: I want to become a Claude code architect (community prompt set)

Quotables