§ 01 · The Hook

The bait, then the rug-pull.

Documentation used to eat 100,000 tokens before a single line of code was written. Ray Fernando — ex-Apple engineer, decade-plus shipping production software — found the fix in two MCP servers that pull only the context the agent needs, precisely when it needs it. The proof is live on screen: a full Tailwind v4 design-token audit across a real codebase clocks in at 2,800 tokens — 1.4% of what Cursor burned on the same task.

§ · Stated Promise

What the video promised.

stated at 00:02“I'm gonna show you how to get two of my favorite MCPs set up.”delivered at 16:50

§ · Chapters

Where the time goes.

00:00 – 02:05

01 · The context rot problem

Ray introduces the concept of context rot — LLMs getting dumb as the window fills with irrelevant documentation. Establishes why targeted doc-fetching MCPs beat brute-force token dumps.

02:05 – 04:37

02 · Installing ref.tools in Claude Code

One-command install via `claude mcp add`, API key walkthrough, and security warning about committing keys to public repos.

04:37 – 05:38

03 · Installing Exa AI in Claude Code

Same pattern as ref.tools. API key generation and paste into terminal. Both MCPs now available globally across any project.

05:38 – 12:43

04 · Live demo: Tailwind v4 refactor on Anime Leak

Haiku 4.5 runs a full codebase audit using both MCPs. Ray watches Claude Code research documentation, build a phased implementation plan, and hit /context — revealing only 2,800 tokens used.

12:43 – 15:00

05 · The token comparison: Claude Code vs Cursor

Cursor's plan mode finished the same task using 98,000 tokens. Side-by-side makes the 35x gap visceral. Ray notes Cursor asks clarifying questions; Claude Code doesn't.

15:00 – 16:50

06 · Setting up MCPs in Codex

Codex uses config.toml instead of JSON. Ray edits the file in Cursor, pastes the API key, verifies with `codex mcp list`.

16:50 – 17:50

07 · Setting up MCPs in Factory Droid

Copy the MCP JSON block from Cursor's tools panel directly into .factory/mcp.json. Same pattern, different file path.

17:50 – 18:51

08 · Pro tip + CTA

Always call MCPs explicitly in your prompt. Use plan mode (shift-tab) before writing code. Closes with a pitch for his 1337 coaching intensive and a forward-looking take on agentic AI.

§ · Storyboard

Visual structure at a glance.

hookopen — context rot problem00:00

promiseref.tools website00:57

valueboth MCPs installed + verified04:35

valueAnime Leak demo launch05:38

value/context — 2800 tokens11:54

valueCodex CLI config13:04

valueFactory Droid config16:50

ctaplan mode pro tip17:50

§ · Frameworks

Named ideas worth stealing.

00:15concept

Context Rot

The degradation of LLM output quality as the context window fills with broad, untargeted documentation — the agent gets dumb before it writes a line of code.

Steal forany video explaining why vanilla web search MCPs hurt more than they help

01:31model

Targeted Agentic Search

ref.tools — indexed docs, no context bloat
Exa AI — high-quality search built for coding tasks
Explicit MCP calls in prompt
Plan mode before code mode

Pull only the context needed for the specific sub-task, not everything that might be relevant. Combine ref.tools (documentation precision) with Exa (coding-task search quality) and always call them by name in the prompt.

Steal forCLAUDE.md rules section, any agentic coding workflow

§ · Quotables