How Claude Code Actually Works (What the Top 1% Know)
A 63-minute plain-English teardown of every layer of Claude Code — tools, context, sessions, skills, hooks, and the live build of a PDF-offloading skill that cuts token usage by 98%.
January 30thA 14-minute cost-routing playbook for the most powerful and expensive model Anthropic has ever shipped.
The unlock with Fable 5 is not switching to it but routing each stage of your build to the cheapest model that still clears the bar, because effort tier and model choice are two separate dials most users never touch.
Fable 5 is the most capable model Anthropic has shipped, but treating it as a daily driver will exhaust your credits in days. The central insight: Anthropic already auto-downgrades Fable to Opus 4.8 for sensitive requests, routing by risk. You apply the same mechanic, routing by cost. Use Fable at high or max only where decisions compound -- planning and final verification. Run all execution volume through Opus 4.8, Sonnet, or local models. Switch mid-session via /model so the spec stays in the thread. At medium effort Fable already beats Opus 4.8 at max, so you rarely need the top tier for anything but the plan.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Hook framing: Fable 5 is real, but so is the cost. The premise is set without benchmarks.

Anthropic announcement on screen: Fable 5 exits flat-rate subscription June 23 and becomes metered. The trap is forming a dependency before the meter hits.

Leaked system prompt compared against Opus 4.8. 80 percent identical. Five new rules around self-harm and life sciences. Filler-word instructions removed -- suggesting retraining.

Fable auto-downgrades to Opus 4.8 on cybersecurity and health requests. Steal that mechanic: route bulk work to cheaper models voluntarily.

Core framework. Fable medium beats Opus 4.8 max. Effort tier is a separate dial from model choice. Planning and shipping = high/max. Execution = medium or lower, Sonnet, or Opus.

Concrete mechanic: run Fable on max for planning, create a spec file, type /model to switch to Sonnet or Opus for execution, then re-invoke Fable to probe edge cases.

Three worked examples. Marketing site: Fable high plan, Opus med build, Fable low verify. 3D website: Fable x-high plan, Opus plus Sonnet agents build, Fable high verify. CRM: Fable max plan, x-high dynamic workflows, high-model verify.

Brief section against model loyalty. Fable plus Codex via OpenAI plugin extension. Use whatever clears the task at lowest cost.

Consolidated: Fable max/high planning, orchestrate execution with agents using skills/MCPs/CLIs, Fable verification, ship, iterate. The paradigm persists as model names change.

Comparison: Fable low ties Opus 4.8 max. Fable medium beats it. Fable max crushes it.

Wrap. Benchmarks are manufactured; results are what matters. Fable short-circuits on cybersecurity. Opus is still more reliable day-to-day. CTA for free cheat sheet and community.
Picking the smartest model is only half the decision -- the effort dial is equally powerful and almost universally ignored.
“The average person will run out of credits by the time they say good morning to Fable.”
“Benchmarks don't matter. They can be doctored. They can be manufactured. They grab the headlines, but the only thing that matters are your results.”
“Fable five on low is still a very competent model.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
With great power comes an even greater token bill. Unlike most AI tool videos that lead with benchmarks, this one leads with the credit limit. The title is a warning, the hook is a price tag, and everything that follows is a routing strategy.
Effort and model are two separate dials. Most users pick a model and never touch the effort dial.
Anthropic auto-downgrades risky asks from Fable to Opus 4.8. Apply the same mechanic for cost: route high-stakes planning to Fable, route volume execution down to cheaper models.
Complexity of the project drives how high you go on planning; sub-agents always use cheaper models; verification scales with integration risk.
“Check out the first thing down below -- early adopters community and free cheat sheet in the second link.”
Double CTA pattern -- community link and free lead magnet. Community pitch appears twice at 7:45 and 13:57.
00:00
00:12
00:20
00:33
00:41
00:54
01:09
01:17
01:29
01:41
01:55
02:08
02:15
02:26
02:37
02:47
02:58
03:09
03:20
03:31
03:41
03:52
04:03
04:14
04:25
04:37
04:48
04:59
05:10
05:21
05:32
05:43
05:54
06:05
06:16
06:26
06:36
06:46
06:56
07:04
07:17
07:28
07:38
07:44
07:53
08:09
08:21
08:32
08:43
08:54
09:05
09:17
09:28
09:39
09:50
10:01
10:13
10:24
10:35
10:45
10:54
11:02
11:12
11:22
11:33
11:44
11:54
12:05
12:15
12:24
12:32
12:39
12:48
12:59
13:11
13:20
13:36
13:39
13:51
14:04A 63-minute plain-English teardown of every layer of Claude Code — tools, context, sessions, skills, hooks, and the live build of a PDF-offloading skill that cuts token usage by 98%.
January 30thA 24-minute practical walkthrough of the 15 features Boris Cherny (Claude Code creator) flagged in his 2M-view tips thread.
March 31stSix composable agent patterns from Anthropic's own internal masterclass, with live prompts and honest advice on when to skip workflows entirely.
June 3rdA 39-minute walk-through of Anthropic's new Claude Certified Architect exam guide, translated from a 40-page PDF into five domains, three demos, and five rules.
March 22ndA 12-minute live demo of a Claude Code mega-skill that routes frontend, copy, and bug-fix tasks to Gemini, Codex, or any OpenRouter model while Claude stays in the driver seat.
March 6thOne line in a skill file chains five Claude Code slash commands into a single orchestrated pipeline -- no human glue between steps.
April 9th