How to Build Claude Subagents Better Than 99% of People
A 26-minute field guide to Claude Code subagents — when to use them, how to build them, and how to save money by matching model to task.
June 9thA 26-minute live benchmark that runs three real builds side-by-side and reads the session logs to settle the Claude Code vs Codex debate with actual numbers.
Choosing between Claude Code and Codex is not a feature comparison but a workflow-shape question, and the benchmark data shows each tool wins decisively on different task types.
Three real builds, two tools, full telemetry. Claude Code took 14:51 total across three tasks using 5.8M tokens at $11.05; Codex took 25:52 using 6.2M tokens at $7.11. Claude built the marketing dashboard in 1:57 using 283K tokens while Codex took 7:50 and burned 1.64M. Codex won the research PDF -- faster and leaner. The underlying cause is output-token discipline: Codex consistently writes 2-5x fewer output tokens than Claude, which is why it burns through subscription limits more slowly. The practical decision rule: Claude for front-end, planning, and custom workflow automation; Codex for research-heavy tasks, structured documents, and longer-running objectives.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
OpenAI comeback framing, promise of honest head-to-head across features, price, and three specific use cases.

Task delegation, file editing, customization via hooks/skills/sub-agents. Desktop, terminal, web versions. Opus/Sonnet/Haiku models.

GPT family models, gpt-codex-spark in preview. WorkTrees as the defining architectural choice. Included in every ChatGPT paid plan.

Both tools: local code editing, desktop app, VS Code extension, CLI, MCP, skills format, plugin marketplace, cloud delegation, hooks, sub-agents.

30 hook events vs 6. Auto-delegating sub-agents. /ultra-plan, /ultra-review, /loop. Channels integration. Agent SDK. Enterprise auth (Bedrock, Vertex, Foundry).

Native WorkTrees per thread. In-app browser. Computer-use QA. at-Codex GitHub PR integration. /goal. GPT image generation. OpenClaw/Hermes compatibility.

Claude: Pro $20, Max 5x $100, Max 20x $200. Codex: included in ChatGPT free through Pro $200. 1M token context (Claude) vs 256K (Codex).

Three identical prompts: research report PDF, landing page (Glaido), marketing analytics dashboard. Claude wins landing page and dashboard design; Codex wins PDF efficiency.

Raw numbers from JSONL logs. Codex: 25:52, 6.19M tokens, $7.11. Claude: 14:51, 5.8M tokens, $11.05. Output tokens always higher for Claude. Efficiency scatter plot.

Use Claude for front-end, deep planning, custom workflows, enterprise auth. Use Codex for research tasks, structured documents, /goal, GitHub PRs, image generation. Split workflow is valid.

Projects are files in folders -- not locked to either tool. CLAUDE.md becomes AGENTS.md. Closing thesis: which tool is best for this specific task.
The benchmark data splits cleanly: Claude Code wins on front-end quality and planning depth; Codex wins on token efficiency and research-heavy output -- and both tools are portable enough that you do not have to commit to just one.
“It is not a matter of which tool is best, it is a matter of which tool is best for the specific use case in front of you.”
“ClaudeCode right now has 30 different hook events. Codex right now has about six. If you want to fire automated behavior into every part of the workflow, ClaudeCode gives you about five x the granularity.”
“Claude has this way of planning the task tightly before it executes. And Codex tends to just grind through more iterations, which is why the input tokens stack up on its side.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
For months, Claude Code was the only coding agent worth talking about. Then OpenAI shipped Codex -- and the comparison videos started. This one actually runs the tests.
A task-type decision rule rather than a blanket preference for one tool.
Output tokens cost more and burn session limits faster. Codex writes 2-5x fewer output tokens than Claude per equivalent task. This explains why Claude users report hitting limits faster -- and it is measurable from JSONL logs.
“I broke all of this down into a resource guide that you can access for completely free, and you can find that in my free school community.”
Verbal mention only, no overlay shown. Low-friction -- no product pitch, just a free community link.
00:01
00:33
00:46
01:08
01:25
01:41
02:05
02:29
02:44
03:07
03:19
03:41
04:05
04:29
04:40
05:12
05:21
05:41
06:04
06:21
06:41
07:02
07:26
07:42
07:59
08:20
08:39
09:01
09:18
09:40
10:08
10:23
10:44
10:57
11:21
11:39
12:03
12:17
12:38
13:16
13:26
13:46
14:06
14:19
14:46
15:11
15:26
15:46
16:06
16:26
16:44
17:06
17:27
17:45
18:05
18:25
18:45
19:05
19:25
19:45
20:05
20:25
20:45
21:00
21:24
21:35
21:59
22:24
22:37
22:56
23:20
23:49
24:14
24:19
24:41
24:56
25:17
25:42
25:58
26:23A 26-minute field guide to Claude Code subagents — when to use them, how to build them, and how to save money by matching model to task.
June 9thA 7-minute essay arguing that the AI coding war is a free sample phase — and your $200/month is a 12–24 month exemption from real prices.
May 13thA 13-minute walkthrough of the six Claude Code skills Nate Herk says are the only ones businesses will actually pay for — plus how to package them into an AI-automation offer.
May 3rdA 34-minute live walkthrough of one creator's AI operating system, built on the four Cs: Context, Connections, Capabilities, and Cadence.
June 10thA screen-share walkthrough of Anthropic's dual model drop: Fable 5 for everyone, Mythos 5 for Glasswing partners only -- and why the host saw it coming.
June 9thA 29-minute walkthrough of the Four Cs framework for running your entire business through Claude Code.
May 29th