I Made Opus 4.8 and Fable 5 Build the Same App (RAW RESULTS)
Three identical one-shot prompts. Two models. The gap was not close.
June 11thA 13-minute head-to-head where two Claude models race to clone the same landing page — one burns $30 and 35 minutes, the other $2.70 and 5, and the gap in quality tells you exactly when the expensive model earns its keep.
Fable 5 earns its steep price premium specifically in visual-verification loops: when Claude in Chrome can see the live site, Fable 5 self-corrects across 35 minutes of autonomous iteration while Opus 4.8 stalls after 5, producing a result that would otherwise require a human an afternoon of manual feedback.
Fable 5 introduces a self-verifying coding loop: it implements code, opens the browser via Claude in Chrome, screenshots the result, compares it to the original, and iterates — all without a human in the middle. In a direct race to clone modash.io, Fable 5 took 35 minutes and $30.41 but produced a result that scraped real images, replicated animations, and matched section layouts closely. Opus 4.8 took 5 minutes and $2.70 in round two and barely changed from its generic first pass. The conclusion: use Fable 5 when fidelity matters and the task is complex enough to justify the cost; use Opus 4.8 everywhere else.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Overview of Fable 5's key differentiator: end-to-end verification via vision. Introduces the implement-validate-prompt loop and Claude in Chrome. Sets up the test.

Two Claude Code sessions configured — Fable 5 and Opus 4.8 in separate folders. Same prompt sent to both: replicate modash.io from a full-page screenshot.

Opus 4.8 finishes in ~2:50. Colors roughly right, fonts wrong, testimonial and footer sections generic. Side-by-side comparison reveals the gaps.

Fable 5 finishes in ~4 min. Hero weaker, navbar off, customer section better, FAQ worse, footer closer. Mixed first impression.

Claude in Chrome activated. New prompt uses --goal flag: verify against live site, iterate until identical. Cost check: Fable at $1.60, Opus at $0.61 so far.

Fable 5 takes the Claude in Chrome slot and runs autonomously at x50 timelapse speed. The --goal flag keeps it iterating. Discussion of what a good objective goal looks like.

Opus 4.8 takes ~5 minutes in round 2 and barely changes. Generic cards unchanged. Confirms Fable 5's advantage is specifically in longer autonomous runs.

Section-by-section comparison. Customer section near-identical, resources cards very close, FAQ aligned, scroll-jacking animation implemented with a minor footer overlap bug. Images scraped from live URLs.

Fable 5: $30.41 / 31% context. Opus 4.8: $2.70 / 10% context. Recommendation: trim CLAUDE.md skills for Fable 5, use specific end-result prompts.
Fable 5's advantage is not raw intelligence — it is the number of self-correction cycles it will run without being asked.
“This is the golden phrase that made me consider that this model might be an exceptional model for copying other websites.”
“Honestly, for the price, I would just take Opus 4.8 for this specific example.”
“If I'd need to iterate with the OPUS 4.8 model, I'd probably be spending the whole afternoon sending it images of sections of the website that should be improved.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
The claim was simple: Claude Fable 5 uses vision to validate its own code. So the creator set up two Claude Code sessions side by side, gave them the same prompt, and let them race to clone a live SaaS landing page — one model spending freely, one keeping it cheap.
The core agentic workflow Fable 5 is designed to exploit — autonomous end-to-end verification without human checkpoints.
Use verifiable conditions as the goal (e.g., all tests pass, lint is clean) rather than subjective ones. Gives the model a binary stop signal.
“Make sure to check my latest video where I go through my specific workflow that works pretty well with better models.”
Soft verbal CTA with no on-screen graphic. Delivered as a recommendation rather than a pitch.
00:00
00:15
00:29
00:35
00:45
00:56
01:09
01:16
01:26
01:36
01:41
01:57
02:07
02:15
02:23
02:37
02:47
02:58
03:13
03:14
03:28
03:38
03:48
03:59
04:05
04:15
04:29
04:39
04:49
05:04
05:09
05:20
05:33
05:38
05:50
06:01
06:11
06:21
06:31
06:42
06:51
07:02
07:08
07:22
07:32
07:38
07:50
07:59
08:12
08:19
08:28
08:44
08:54
09:04
09:14
09:21
09:33
09:41
09:54
10:07
10:14
10:28
10:32
10:46
10:56
11:06
11:16
11:26
11:41
11:48
11:57
12:07
12:14
12:27
12:33
12:48
12:58
13:08
13:18
13:28Three identical one-shot prompts. Two models. The gap was not close.
June 11thHow Addy Osmani packaged 14 years of Google engineering judgment into 23 markdown files -- and what is actually worth stealing.
June 11thA 20-minute breakdown of the 10/80/10 system and loop engineering — the cost-efficient way to run the most expensive AI model on the market.
June 11thA head-to-head build of the same finance app in Fable 5, Opus 4.8, and GPT 5.5 - same prompts, same workflow, $200 in 24 hours.
June 11thA 14-minute cost-routing playbook for the most powerful and expensive model Anthropic has ever shipped.
June 11thA 17-minute field report from a solo indie developer who wired his AI agents directly into his simulator, browser, crash tracker, and code review system — and stopped babysitting them.
May 14th