Chat with this breakdown.
Modern Creator members can chat with any breakdown — ask for the hook, quote a framework, find the exact transcript moment. Unlocks at T2: refer 3 friends + add your own API key.
Create a free account →Where the time goes.

01 · Karpathy thread + the four problems
Opens with Karpathy workflow flip (80% manual to 80% agent). Frames story around failures. Names four agent failure modes: silent assumptions, overengineering, scope creep, no verification.

02 · Excalidraw: problems mapped to cost
Three-column diagram: Your Request / What the Agent Does / What You Get. Walks each failure mode with concrete cost examples (400-line OAuth, 200-line date formatter, 40-line diff, untested validation).

03 · The four principles (GitHub README)
Introduces the Karpathy Skills repo. Maps each principle to the problem it solves: Think Before Coding, Simplicity First, Surgical Changes, Goal-Driven Execution.

04 · Installation walkthrough
Two paths: Claude Code plugin (global, recommended) via /plugin marketplace add + /plugin install; and per-project curl with append support for existing CLAUDE.md.

05 · Live demo: ecommerce dashboard
Builds dashboard with guidelines active. Agent asks 3 clarifying questions. Output: 1 file, 120 lines. Without guidelines: 6-8 files, 500+ lines, unasked-for features.

06 · VS Code: clean diff
Shows actual code output. Every changed line traces to what was asked. No renamed variables, no reformatted comments, no drive-by refactors.

07 · Tradeoff + close
Guidelines bias toward caution, not speed. For trivial tasks they are overkill. For nontrivial work where wrong assumptions cost hours, they are the fix.
Your CLAUDE.md is a behavioral contract, not a hint.
The agent does not need better prompts — it needs explicit rules about what NOT to do.
- Install the plugin route (global, 10 seconds): /plugin marketplace add forrestchang/andrej-karpathy-skills then /plugin install andrej-karpathy-skills@karpathy-skills
- Or append to your existing CLAUDE.md with the curl append command — your rules stay on top, Karpathy principles go at the bottom
- The four failure modes (silent assumptions, overengineering, scope creep, no verification) are the ones that kill review time — name them in your own CLAUDE.md
- The Excalidraw diagram format (Your Request / What Agent Does / What You Get) is a reusable content frame for any AI tool breakdown video
- The tradeoff is honest and worth saying out loud: these rules slow down trivial tasks, so scope them to nontrivial work
Things they pointed at.
Lines you could clip.
“Coding agents are capable, but they behave badly. They make silent mistakes that look correct on the surface. They build too much when you need too little. They touch things they should not touch, and they do not verify their own work. This file corrects those patterns in about 50 lines of markdown.”
“Every line that changed traces back to what I asked for. There are no surprise edits, no renamed variables in other files, no reformatted comments, no drive-by refactors.”
“And then you end up reviewing a giant pull request that solves a problem you never actually had.”
Word for word.
The bait, then the rug-pull.
Andrej Karpathy posted a thread saying he now programs mostly in English — and then catalogued exactly how that breaks. Not broken syntax. Something worse: agents that guess silently, build too much, touch things they should not, and call it done without checking. One developer turned those observations into a single file. This is a breakdown of that file.
Named ideas worth stealing.
The Four Karpathy Principles
- Think Before Coding — surface ambiguity first, ask not guess
- Simplicity First — minimum code to solve the problem, no speculative features
- Surgical Changes — touch only what the request requires
- Goal-Driven Execution — define success criteria, loop until met
Four behavioral rules for AI coding agents derived from Karpathy X thread, packaged as a CLAUDE.md file by Forrest Chang.
The Four Agent Failure Modes
- Silent assumptions — picks one interpretation and commits without asking
- Overengineering — writes 200 lines when 30 would do
- Scope creep in edits — reformats, renames, refactors beyond the task
- No verification — says done without checking edge cases
Problem taxonomy from Karpathy thread, named and made concrete with cost estimates.
How they asked for the click.
“If you did, please like this video and subscribe to the channel, and I will see you in the next video.”
Standard end-screen CTA, no mid-roll asks. Clean close after the argument lands.









































































