Claude's New Small Business Plugin Is Wild (31 Built-In Skills)
A 20-minute installation guide and live demo of Anthropic's 31-skill Small Business plugin — from zero to a drafted job post and red-lined vendor contract.
May 30thA 17-minute practitioner verdict on whether Claude's fastest-ever release fixes what 4.7 broke.
The compute crisis that made Claude 4.7 painful forced Anthropic to fix the model's core weakness, long-session drift, so 4.8 is a qualitative shift back to the trustworthy agentic behavior last seen in 4.6, not just a benchmark step.
Anthropic grew 80x in one year, ran out of compute, and the degraded service is what made 4.7 feel broken. Opus 4.8 dropped 42 days later, backed by a $1.25B/month xAI compute deal and Anthropic's fastest model turnaround ever. Three side-by-side coding tests show a clear qualitative gap: 4.8 understands the full intent of a prompt, checks its own work before returning results, and holds the thread on long multi-step jobs without drifting off task. On benchmarks it beats GPT-5.5 on agentic coding 69 to 58, loses terminal bench 74 to 78, wins computer use. Same price as 4.7. For practitioners running real agentic builds, it's worth switching back.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Hook via confession (gave up on Claude), Anthropic's own 'modest but tangible' wording sets credibility baseline, promise of three tests and a final verdict.

Anthropic grew 80x vs planned 10x; CEO quote; throttling at peak hours; 4.7 wasn't dumb, the service was buckling.

$1.25B/month, 220k GPUs, 300 MW; same week doubled Claude Code limits; headline: Higher limits plus a compute deal with SpaceX.

$4.75/M input, fast mode 3x cheaper; recap of how bad 4.7 felt: second-guessing loops, tokens draining, Reddit complaints.

Same prompt, two models. 4.7 built a functional demo (Heliocentric). 4.8 built the Orrery: planets, stats panel, surface landing mode, correct scale. 4.8 did more of what was asked and got there faster.

4.7's Compendia nailed both eras. 4.8's Omnipedia lived inside the 2001 era (Netscape button, 56k bar) and tied both versions with one idea: the old blue hyperlink becomes the new site's accent color. 4.7 faster; 4.8 better.

4.7 drift problem: loses thread, redoes old decisions, stops to ask keep going every few minutes. /goal command helped but only prolonged the drift. 4.8 fixes focus in the model itself.

Manager 4.8 splits a huge job into hundreds of parallel sub-agents, each self-checking before assembly. Early preview, higher plans only.

Agentic coding: 4.8 at 69.2 vs 5.5 at 58.0. Terminal bench: 5.5 wins 78 to 74. Computer use, reasoning, knowledge work, financial analysis: 4.8 leads all.

4.8 over 4.7 is not close. 4.8 edges 5.5 on most tasks. Wait on Gemini. CTA to free AI Accelerators community.
Infrastructure constraints, not model architecture, drove the worst of Claude 4.7's failures; the fix came in the model itself, not in the tooling layered on top.
“You don't go from a full on compute crisis to shipping your best model in six weeks unless something big changed in the middle.”
“4.8 fixes this in the model itself, not in some tool wrapped around it.”
“4.7 finished first, 4.8 finished better.”
“The chart says small steps, using it says huge.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
The title earns its punch: this creator actually cancelled GPT-5.5 after six weeks of defection, and the honest why runs deeper than benchmarks; it starts with Anthropic's own confession that 4.8 is only a modest improvement.
The implicit framework for evaluating AI models in real production use: a quick build, a hard combined test, and an unsupervised long job. Each axis reveals a different failure mode.
Manager-worker architecture for agentic AI: one orchestrator breaks work into parallel sub-tasks, each sub-agent verifies its own output before the manager assembles the result.
Per-session thinking effort control in Claude Code. Extra is the sweet spot for heavy agentic work per Anthropic. Ultra Code mode auto-sets effort high and enables workflow spawning.
“If you guys do wanna go deeper on actually testing these models head to head and figuring out how to put them to work in a real business, that is exactly what I do inside of my free community.”
Positioned after a strong earned verdict; the CTA carries weight because the creator spent the whole video being fair about 4.8's one benchmark loss before recommending it.
00:00
00:14
00:35
00:44
00:54
01:06
01:19
01:41
01:41
01:58
02:14
02:27
02:39
02:51
03:02
03:14
03:26
03:37
03:52
04:05
04:19
04:38
04:46
05:00
05:05
05:24
05:37
05:50
06:03
06:16
06:29
06:42
06:55
07:08
07:20
07:32
07:44
07:56
08:08
08:20
08:32
08:40
09:00
09:08
09:20
09:32
09:44
09:52
10:09
10:23
10:36
10:51
11:00
11:15
11:28
11:44
11:55
12:08
12:15
12:35
12:48
12:56
13:18
13:30
13:43
13:49
14:09
14:22
14:35
14:49
14:57
15:15
15:26
15:40
15:50
15:59
16:18
16:25
16:37
16:47A 20-minute installation guide and live demo of Anthropic's 31-skill Small Business plugin — from zero to a drafted job post and red-lined vendor contract.
May 30thA 12-minute walkthrough of how Claude now controls your computer and lets you run tasks from your phone while away from your desk.
March 25thA 14-minute walkthrough of the Noose desktop installer that finally lets non-technical users run one of the most capable open-source AI agents without touching a terminal.
June 6thHow the engineer who built Claude Code actually runs 15 parallel sessions — and the six-part system non-developers can copy today.
June 4thA 17-minute walkthrough of Max Hermes: the cloud-hosted Hermes agent that costs 95% less than Opus 4.7 and writes its own skill playbooks after every task.
June 2ndA 104-minute operational blueprint for becoming AI-first: audit your business, fix your data, build a four-layer stack, and deploy two working Claude Code systems end-to-end.
May 26th