I Turned Claude Opus 4.8 Into My Entire AI Operating System
A 29-minute walkthrough of the Four Cs framework for running your entire business through Claude Code.
May 29thA 68-minute screen-share where Cole Medin walks through the five-part system that turns prompting-and-praying into directing your coding agent.
You stop vibe coding the moment you sandwich the agent's work between heavy upfront planning and a verification harness that proves the work is actually done, then turn every failure into a permanent rule.
The conversation reframes Claude Code from a coding tool into an operating system you direct like a product manager. The core is a four-step loop — plan, build, verify, evolve — wrapped around two disciplines most people skip: heavy upfront planning that manages a scarce context window, and a verification harness that lets the agent prove its own work instead of just claiming it is done. Context matters because every model has a dumb zone (roughly 250k tokens on Opus) where it starts missing the obvious, so you load only what is needed and reset between tasks. For large jobs you chain multiple single-purpose agent sessions assembly-line style rather than trusting one session end to end. Security means assuming the agent will touch anything it can reach, enforced with hooks rather than prompts. And every bug becomes a permanent upgrade — a new rule, doc, or skill — so the system gets smarter every week.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Cold open promise — be the director of your coding agents. Cole's background (engineer since age 8, Fortune 500, all-in on agentic AI), the two creators' history, and the framing that Claude Code is a second brain, not just a coding tool.

ClickUp / Brain 2 ad read.

The four-step loop — plan, build, verify, evolve — introduced as the system that replaces prompting-and-praying. Planning and validation are the two things people skip. Every loop ends with a chance to evolve the system; treat the agent like an employee you train.

Verification means 'prove to me it's done.' Cole's Excalidraw skill renders a PNG so Claude can inspect its own diagram for overlaps. Defining a harness — the wrapper of model plus tools plus context — and the silly video-game example of slowing the frame rate so an agent can verify like a user. Plus understanding code via slash-by-the-way sidecar conversations.

The PLAN.md north star — goal and success criteria, codebase and docs analysis, integration points, task-specific rules, granular task list, validation strategy — plus the workflow: prime, research with sub-agents, write the plan, have the agent ask questions, then execute. Why Cole skips plan mode for a custom planning skill he controls, and why he builds on raw Claude Code rather than OpenClaude or Hermes.

Attention is scarce. The million-token window is a false sense of security; every model has a dumb zone (around 250k on Opus, ~100-125k on Sonnet) where it starts missing the obvious. Context fills fast, MCP servers dump 20k tokens up front, and the needle-in-a-haystack problem worsens deeper in. Skills give procedures on demand rather than dumping everything at once.

Chaining agent sessions assembly-line style: planner hands off to implementer, implementer writes an execution report, reviewer validates. The Ralph loop as the foundational pattern. Why sub-agents struggle with handoffs, why agent teams are powerful but token-heavy, and Cole's Archon project for deterministic workflows that pick when the model works versus when code runs.

Assume the agent will touch anything it can reach. Three false senses of security: a prompt that says never delete, blocking delete SQL, and even blocking the delete command (it can still write and run a script). The real-world incident where a proactive agent emailed the whole list a discount code by mistake. Hooks as the actual permission layer — inspect every tool call before it runs.

System evolution as the most important habit: after every issue, add a rule, a reference doc, a plan update, or a new command so it never recurs. The floor keeps rising — doc to command to skill — and you start welcoming bugs. Finding edge cases up front by asking the agent 'how could this go wrong,' then engineering a test to break it. Debate panels and adversarial development for decisions.

Cole's three: skills (number one — any reusable prompt becomes a skill, and the skill-plus-CLI combo beats MCP for token efficiency), sub-agents (parallel research and context extraction), and hooks (security plus the second-brain memory loop). Nate's three: skills, status line, and routines. Closing: be the product manager for Claude Code — give it the why (intent engineering).

Where to find Cole Medin (YouTube, LinkedIn), and the free resource guide in the Skool community.
Reliable agent results come from wrapping the build between heavy upfront planning and a real verification harness, while protecting a context window that gets dumb long before it gets full.
“The main thing I wanna talk about today is how we can be the director of our coding agents.”
“Large language models have what's called the dumb zone. With Opus right now, it's usually around 250,000 tokens.”
“Without the verification checks, maybe it's 65 or 70, but now you can get something that is 92 on the first pass.”
“If you have the mindset that anything that the agent can read or can touch, you have to assume that it will.”
“It ended up sending an email to our entire list with a discount code, and it was not supposed to go out.”
“Every bug becomes a permanent upgrade, so once you have this system in place you actually almost welcome bugs.”
“You could think of yourself like the product manager for Claude Code — you don't have to describe how to build something, but it's important for you to shape the vision.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
The promise lands in the first ten seconds: by the end of this you will know how to be the director of your coding agents, not just a user pulling the lever. What follows is a 68-minute screen-share built around one Excalidraw diagram, where software engineer Cole Medin turns a year of building agentic systems into a five-part operating manual for Claude Code — and Nate Herk, by his own admission a non-coder, stress-tests every piece against the knowledge work the rest of us actually do.
The repeatable loop that replaces vibe coding. You plan heavily, delegate the build, verify with a real check, and every pass ends with an opportunity to improve the system so next time is better. Same result, on purpose, every time.
A single markdown document that defines what you are building, what success looks like, which parts of the system get touched, the rules, the step list, and crucially how the agent will know it is done. Its quality determines the agent's success.
Front-load context, fan out sub-agents to research, co-write the plan with the agent, and force it to ask questions so it is not assuming what you want before any building starts.
A harness that lets the agent prove its own work in tiers, with a fail-fix-rerun loop. The agent is allowed to mess up on early passes as long as it iterates to a clean final output. Without it a first pass is ~65-70 percent; with it, ~92 percent.
Because attention is scarce and the dumb zone is real, control what fills the window: reset between tasks, prime deliberately, let skills pull procedures on demand instead of dumping everything, and use sub-agents to fetch only the slice of context you need.
After every issue, ask 'what to fix?' and convert the lesson into a rule, doc, plan change, or command. The floor keeps rising — doc to command to skill — and the system gets smarter every week, so you start welcoming bugs.
Each layer is more robust than the last but none is sufficient. Assume the agent will touch anything it can reach, scope permissions and keys, and enforce with hooks that inspect every tool call.
To give an agent a tool, a CLI paired with a skill that explains how to use it is more token-efficient and controllable than an MCP server. The skill encodes how you want the CLI used in your workflow; Cole's Archon ships exactly this way.
“Don't forget that I broke all of this down into a free resource guide that you can access for completely free using the link in the description to join our free Skool community.”
Soft, value-first CTA repeated from a mid-roll mention; points to a free resource guide and the Cole Medin channel rather than a hard sell.
00:01
00:56
02:07
03:01
03:28
04:41
05:46
05:58
06:51
07:40
08:32
09:48
10:39
11:30
12:21
12:50
14:03
14:55
15:46
16:37
17:29
18:19
19:10
19:48
20:53
21:44
22:35
23:26
24:17
25:08
26:00
27:04
27:42
28:33
29:24
30:15
31:06
31:58
32:49
33:41
34:31
35:22
36:13
37:05
37:56
38:47
39:38
40:40
41:20
42:11
43:03
43:54
44:27
45:36
46:27
47:18
48:09
49:01
49:52
50:43
51:26
52:25
53:16
54:08
54:59
55:50
56:41
57:11
58:23
59:14
59:59
60:57
61:48
62:39
63:30
64:21
65:29
66:04
66:55
68:00A 29-minute walkthrough of the Four Cs framework for running your entire business through Claude Code.
May 29thA 5-minute video that proves its own thesis: one prompt, no filming, no editing, a finished YouTube video.
June 12thA 34-minute live walkthrough of one creator's AI operating system, built on the four Cs: Context, Connections, Capabilities, and Cadence.
June 10thA 13-minute same-day breakdown of Claude Opus 4.8 — what changed, why effort level is now the primary control surface, and five prompting adjustments direct from Anthropic docs.
May 28thA 27-minute live build that wires Claude Code into a context-aware AI that plans your day, runs research, and checks on your team, all in parallel.
March 5thA 14-minute demystification of agent loops for non-hardcore-coders: what they are, why the done-check matters most, and three live demos that prove loops get you closer — not perfect.
June 19th