Modern Creator
Alex Finn · YouTube

Claude Opus 4.8 actually blew my mind

A 12-minute field report on every change in the new model — benchmarks, pricing, Dynamic Workflows, Ultracode — plus a live one-shot 3D game demo and a concrete recommendations ladder.

Posted
2 days ago
Duration
Format
Tutorial
hype
Views
20.1K
1.2K likes
Big Idea

The argument in one line.

Claude Opus 4.8 is effectively a public preview of the unreleased Mythos model — matching its hallucination reduction and benchmark ceiling at the same price as Opus 4.7, making it the first AI release in recent memory where more capability did not cost more.

Who This Is For

Read if. Skip if.

READ IF YOU ARE…
  • You use Claude Code as your primary coding environment and want to know whether upgrading to Opus 4.8 is worth doing today.
  • You are on the $200/month Claude plan and deciding whether /fast mode and Ultracode are worth activating.
  • You run agent frameworks like Hermes or Open Claw and need to know the safest upgrade timing.
  • You want a quick benchmark comparison before committing Opus 4.8 to production workflows.
SKIP IF…
  • You want a technical deep-dive into how Dynamic Workflows work under the hood — this is a practitioner overview, not an engineering explainer.
  • You need a neutral comparison; the host is openly bullish on Anthropic and frames most findings favorably.
TL;DR

The full version, fast.

Claude Opus 4.8 tops every coding benchmark, matches the hallucination-reduction numbers Anthropic has been teasing under the Mythos codename, and costs exactly the same as its predecessor. The /fast mode dropped from 6x to 2x the cost of the standard mode. Two new features — Dynamic Workflows (parallel sub-agent swarms) and Ultracode (autonomous workflow activation) — are available now but practically gated behind the $200/month plan. The recommendation is to switch all tasks to Opus 4.8 immediately, default to High effort, and wait for official SDK releases before upgrading agent framework configs.

Free for members

Chat with this breakdown — free.

Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.

Create a free account →
Chapters

Where the time goes.

00:0000:27

01 · Intro

Hook on benchmark supremacy, promise to cover changes not in the official announcement.

00:2705:28

02 · The changes

Systematic walkthrough: benchmark leadership, price parity with 4.7, /fast mode repriced from 6x to 2x, 4x hallucination reduction matching Mythos, Dynamic Workflows (parallel sub-agents), and Ultracode mode.

05:2808:36

03 · Recommendations

Concrete settings ladder: switch to Opus 4.8 now, use standard context (not 1M), default High effort, hold agent framework upgrades 24 hours for official release, reserve /fast and Ultracode for $200 plan. Closes with a focus and discipline argument.

08:3612:42

04 · Product walkthrough

Live Claude Code benchmark: one-shot 3D FPS in Three.js with creative freedom. Output scores 9.1. Bonus tip: remote control setting in Claude Code for mobile monitoring.

Atomic Insights

Lines worth screenshotting.

  • Claude Opus 4.8 matches Mythos-level hallucination reduction at the same price as Opus 4.7 — the first model release in recent cycles where capability went up and price stayed flat.
  • The /fast mode repricing from 6x to 2x the cost of standard mode is more significant than the benchmark news for teams running high-frequency prompts.
  • Dynamic Workflows spawns tens to thousands of parallel sub-agents — a qualitative shift from sequential single-agent coding to concurrent task execution.
  • Filling the million-token context window to capacity degrades output quality; standard context with disciplined session resets performs better in practice.
  • Upgrading agent framework configs on day one of a model release is a known failure pattern — the 24-hour official SDK release cycle exists to absorb this instability.
  • Focus during AI-assisted coding is now the primary leverage point — the model waits idle for longer than it runs when the user is distracted.
  • Ultracode (autonomous Dynamic Workflow activation) is the ceiling of Claude Code's current capability, but economically inaccessible below the $200/month plan.
  • Early adopters of new model releases hold a measurable advantage — competitors are statistically unlikely to be using Dynamic Workflows in the first days of release.
Takeaway

Four settings decisions, one focus rule.

WHAT TO LEARN

Opus 4.8's gains are real — but whether you capture them comes down to a handful of settings choices and one behavioral discipline.

  • Switching to Opus 4.8 is a zero-downside move: same price, higher benchmark ceiling, 4x fewer hallucinations — there is no reason to stay on 4.7.
  • The million-token context window degrades output quality as it fills; standard context with disciplined session resets performs more reliably.
  • High effort is the correct default; Extra and Max are reserved for genuinely complex multi-file builds, not routine prompts.
  • Upgrading agent framework configs before official SDK releases causes crashes — waiting 24 hours for the official rollout prevents most instability.
  • /fast mode and Ultracode are practically reserved for the $200/month plan; activating them on lower tiers depletes rate limits faster than they generate value.
  • Sending a prompt and doom-scrolling while the model works is the single biggest productivity leak in AI-assisted coding workflows.
  • Early adopters of Dynamic Workflows hold a concrete advantage — most competitors are not yet using parallel sub-agent swarms.
Glossary

Terms worth knowing.

Dynamic Workflows
A Claude Code feature that automatically spawns tens to thousands of parallel sub-agents to work on a complex task simultaneously, rather than running a single sequential agent.
Ultracode
A Claude Code operating mode that gives the model full autonomy to invoke Dynamic Workflows whenever it judges them appropriate, without per-task user approval.
Mythos
An unreleased Anthropic model teased for several months, described as capable of advanced cybersecurity tasks. Opus 4.8 is characterized as a slightly weaker public proxy for Mythos.
/fast mode
A Claude API flag that prioritizes response speed at a cost premium. Repriced for Opus 4.8 from 6x to approximately 2x the standard rate.
Hermes / Open Claw
Third-party agent orchestration frameworks that wrap Claude models. These require separate official SDK releases before safely upgrading to a new model version.
Resources

Things they pointed at.

Quotables

Lines you could clip.

00:48
I actually believe this is Mythos but kinda a little bit weaker.
Bold claim with no hedging — immediately polarizing for the AI community.TikTok hook↗ Tweet quote
01:57
Their fast mode was six times more expensive, which is untenable.
Plain-language verdict on a pricing problem most power users felt but did not name.IG reel cold open↗ Tweet quote
04:00
Dynamic Workflows is now Claude Code's ability to tackle months of work in just a day.
Punchy capability claim, no jargon.TikTok hook↗ Tweet quote
08:15
The number one indicator of how successful someone will be in 2026 is their level of focus.
Contrarian framing — productivity insight inside an AI video.newsletter pull-quote↗ Tweet quote
11:43
Your competition probably isn't using the dynamic mode that sends out tens of thousands of sub agents.
Competitive urgency framing — early-adopter advantage claim.IG reel cold open↗ Tweet quote
The Script

Word for word.

metaphoranalogy
00:00Opus four eight is out, and it's an absolute smash home run. They implemented so many changes, and a lot of these were hidden. They weren't even in the announcement.
00:08In this video, we're gonna go over every single one of those changes. I'll tell you exactly what you need to do to take advantage of all these changes, and we'll even run some fun tests with it. If you stick with me until the end, you are going to be a master of the most powerful technology that's out there right now, Claude Opus 4.8.
00:26Let's go. Here we go. We are not a channel where we sit here for twenty minutes reading blog posts.
00:31Let's quickly go through all the changes, then we'll get straight into the product. First things you need to know, it smashes all the benchmarks. It beats Chad GPT five five and all the other models in all the benchmarks.
00:42Google's not even close. No one else is even close. This destroyed all the benchmarks.
00:46I actually believe this is a kinda watered down version of mythos. I'll go into that in a little bit.
00:52I'll go into that a little bit, but I actually believe this is mythos but kinda a little bit weaker. It's the same cost. This is mind blowing, and I think this is a result of all the compute Elon Musk just gave to Claude, but it is the same cost as Opus four seven, which is this is the first release in quite a bit of time where the price didn't go up.
01:13For both OpenAI and Anthropic, all their new releases, the price ticked up slowly and slowly and slowly. This is the first one in a while where the cost did not go up, which, again, just a few weeks ago, Elon sold to Anthropic tens of billions of dollars of compute. I think that is a direct result of that deal, which is really amazing.
01:33Here's a big one and also a big reason. I think Elon's helped out Anthropic a lot. Their fast mode is cheaper.
01:40A big reason I have been using Chad GPT five five inside Codex more the last couple weeks is their fast mode is dirt cheap. You get way better performance for not that much more money.
01:53Claud, their fast mode was six times more expensive, which is untenable. Their limits were already low, so their fast mode wasn't worth it.
02:02Now their fast mode is three times cheaper than it was before, which if we do the advanced algebra here, comes out to just two times more expensive than the regular mode if my math is correct there. So their slash fast mode is actually affordable if you're on the $200 plan, and we'll go into exact recommendations right after this, so stick around.
02:22Beats Chad g b t five five. I thought five five was the first model ever that beat Opus at coding, but this one came right back and beat it four times less hallucinations. So this is a big one, and this is a big reason why I think this is actually just mythos but watered down a little bit is this is one of the things they showed off with mythos was the reduction in hallucinations, about four times reductions.
02:46It's matching mythos and a lot of things they advertise it for. And for those who are new to the channel, kinda new to the AI world, mythos is this model that Claude has been advertising now for a couple months. They've been advertising as some sort of doomsday model that's outrageously good that can hack any website.
03:02They've been teasing us a bit. It appears we're getting closer and closer. One big thing to notice also in their announcement blog post, this wasn't in the tweet as well, they expect to bring mythos class models to all the customers in coming weeks, which means they're doing it.
03:18They're actually gonna release Mythos. Again, I think Elon Musk saved Anthropic from the dead here.
03:24They were starting to lose in every single facet because of their lack of compute, and now they're gonna release Mythos. I don't think it's any coincidence that they're increasing limits, making prices cheaper, and releasing super powerful models within weeks of buying tens of billions of dollars of compute from Elon Musk.
03:42Now from a new functionality perspective, here's the two big things, and we're gonna demo this when we get into the product.
03:49Dynamic workflows in Ultracode. What are these two things?
03:53Why are they so big and important? Dynamic workflows is now ClaudeCode's ability to tackle months of work in just a day.
04:04What does that mean? If you give Opus four eight a very complex task, a big, juicy, meaty, girthy complex task, it will now spin up between tens to thousands of sub agents to tackle that task.
04:20Say you were trying to implement a really big new feature feature or or one one shot a big app. Before, if you gave it to Opus, it would just have one agent go there and add some code, take some code away, add some code, do some research, add some code. Now it's going to take literally thousands of those agents, send them out.
04:39They're all gonna be touching different pieces of your code base, doing research, testing things out. And simultaneously, these tens of thousands of agents are gonna be writing code, testing, using the app, doing regression tests, a whole bunch of things.
04:54It's going to be really, really powerful, and it is now in Opus four eight. Again, this is going to allow you to do months of work in just one afternoon. And then you have ultracode mode, which is basically giving the keys of the kingdom to Claude code, to Opus four eight, and say, hey.
05:11Use dynamic workflows whenever you want. This is all you're only using this if you got that $200 a month plan. So this is Opus four eight.
05:18I'm gonna go into my exact recommendations on how to use it in a second, then we'll go into the product and demo it out. But, again, absolutely massive changes here. Let's go in the recommendations.
05:28Number one, switch all tasks to Opus four eight. There's no reason not to. There's no reason not to go in to Claude code right now, pull it open, and choose Opus four eight.
05:39Now you can choose the million context if you want. I find the million context, you don't absolutely have to use it. I find once you start to fill up that million context window, the performance actually degrades a good amount.
05:51So I'm actually an Opus four eight, which is the regular context type of guy. From a effort perspective, I'm recommending doing a high by default. And then when you are building out much bigger things, switching to extra or max.
06:06But I would, by default, stay in high, and then only switch to extra and max when you have to. Despite the fact that Papa Elon allowed Anthropic to have much more compute and capacity, It's still not as high capacity as ChadGBT.
06:19So I'm sticking with high for default then do an extra in max if necessary. When it comes to Hermes and Open Claw, I wouldn't move it to Opus four eight just yet.
06:28This is a big mistake a lot of people make as they try to force their agents into using the latest version Opus the moment it comes out. The issue is this invariably leads to errors, leads to crashes, and errors and crashes in Open Claw and Hermes are not the most fun to solve.
06:43I would wait until the official releases, which typically come within twenty four hours of the release of the model. So once it officially releases, then you switch it over, and you'll have way less crashes and and way less bad reliability.
06:57As for the slash fast mode and the new ultra code mode, I'm only using those if you're on the $200 a month plan. And even if you're on the $200 a month plan, I don't know if I'm using it for every single prompt.
07:09Like, ChadGPT codecs, I'm using fast mode for literally everything because they give you so much capacity. Claude, their limits still aren't as high as ChadGPT.
07:17So for me, I actually have extra usage on, which means once I get past limits, I just pay through the API. So I'm actually gonna be using fast and ultra code for almost everything. But for you, if you don't have the extra capacity, if you're not on $200 a month plan, then I wouldn't use these modes.
07:32Totally up to you. And here's the last recommendation before we get into the products and we do some cool demos. You need to lock the hell in.
07:39You need to lock the hell in. I've been working with a lot of people lately, watching how they vibe code, seeing what they do. One issue I'm seeing is AI is enabling a lot of people to get wildly distracted.
07:51They will send a prompt to their AI, and then they will go and doom scroll for an hour despite the fact that their AI finished the task, like, fifty minutes earlier. You cannot get distracted. If you can get into a flow state and lock the f in, you are going to get so much more done.
08:08I truly believe the number one indicator of how successful someone will be in 2026 is their level of focus. Do not allow this extra power to mean you can slack off more.
08:21Use this extra power to get more done. So really work on your focus. Put the phone away.
08:27Close social media. Close Twitter. Close YouTube.
08:30And just lock in, and you'll get so much more out of this tech. Now let's jump into the product and build some cool things out. I'm using Claude Co desktop.
08:37You can use the CLI or the extension to take advantage of Opus four eight right now. I'm going to run one of the world famous Alex Spin benchmarks on this model. This is a benchmark I've ran on every single model.
08:49Up until now, Opus four seven's actually been king with by far the best scores in all four of these tests. We're gonna run the three d first person shooter test here, see how it does, see how it compares to the other models. If you wanna run this benchmark yourself, I'll put the prompt for this down below so you can run your own world famous Alex Finn benchmark.
09:10I'm gonna hit enter on this, and I'm gonna send it off, and we're gonna see how it does. Basically, what we're gonna have it do is we're giving it creative freedom. We're saying build a three d first person shooter using three JS.
09:19Do whatever you want. Make it as creative as humanly possible. Add power ups.
09:24Do whatever you want. We'll see how good Opus does here. Side note, remote control is active.
09:29I there's actually a setting in Claude code not many people know about. You should be using this setting. It turns remote control on by default for every single chat.
09:39What this allows you to do is whenever you spin up a new chat in Claude, you can actually go on your phone, go into the code section in the top left. And as you can see here, that chat I just started is now on the screen. Create the stylistic three d first person shooter.
09:56So I can now go mobile whenever I want with every single chat I start. So make sure to turn that on. A little tip for you there.
10:01A little bonus tip. Go in the settings. Turn on remote controls active.
10:05The only thing I ask for for that tip is you tip me with a like down below. Subscribe if you learned anything so far. Turn on notifications.
10:13And I'm going to do a full boot camp on Opus 48 tomorrow in the Vibe Coding Academy. Make sure to join that number one AI community on planet Earth. Link down below.
10:22Best decision you'll ever make in your entire life. Alright. Looks like it's done.
10:25It even tested itself, which is sick. Let's see how this is. Neon assault.
10:29It's always neon themed. I have no idea why. First model that makes a non neon themed game, I'm gonna give it a 10 out of 10.
10:35Here we go. Let's engage. This is nice.
10:39This is nice. These graphics are very, very nice. Much I mean, if you're this is your first time watching my channel, you might think, what the hell is this guy talking about?
10:46This sucks. This isn't cyberpunk 2027. But if you compare this to the default apps that previous models have built, this is pretty nice with from the walls to the ground.
10:56These are the enemies. Oh, to the way the gun shoots, to the way you can see hit markers on the enemies.
11:02I assume these are even the power ups look nicer.
11:09Wave two. So they got combos. They got waves.
11:12This is for sure an upgrade and probably the best version of this we've seen yet. Oh, this is an enemy.
11:19Okay. This is probably a step above what four seven gave to me. Probably just a small step.
11:26So I'm gonna give it a 9.1. I'm gonna run the next three benchmarks probably on a livestream the next week. If you wanna see that, make sure to turn on notifications down below for that.
11:36Again, here's a reminder of my recommendations. You wanna be jumping on this now. When they release new technology, you have a distinct advantage if you start using it right away.
11:48Your competition probably isn't using Opus 40. They're probably not using the dynamic mode that sends out tens of thousands of sub agents. They're probably not using that.
11:56So if you go and you use this tech and you build out really, really cool things, you are going to have a distinct advantage over the rest of the field. So you wanna make sure today, carve off some time in your calendar, go on do not disturb mode, close out all the doom scrolls you got, the tickety tocks, the Twitters, all of that, and lock in and use this and build cool things because you have an advantage right now over everyone else if you take advantage of all these different features and functionality they just released.
12:26Let me know what you want next about Claw. Do you want tutorials and how to build really complex apps? You want deep dives into functionality?
12:32Do you want more benchmarking to see if it's the best? Let me know down in the comments. I'm super curious what you want.
12:38All my videos are based on your feedback. I hope this is helpful. See you in the next
The Hook

The bait, then the rug-pull.

The headline is blunt: every benchmark broken, same price as the model it replaces, and two new features hidden from the announcement post. Alex Finn opens with the claim that Opus 4.8 is not a new model category but a public release of Mythos — the doomsday-coded model Anthropic has been teasing for months — just dialed back slightly.

Frameworks

Named ideas worth stealing.

05:28list

Effort level ladder

  1. High (default)
  2. Extra (complex builds)
  3. Max (last resort)

Three-tier effort system in Claude Code. High is the recommended default; Extra and Max reserved for genuinely complex multi-file tasks.

Steal forAny team setting Claude Code policies for engineers
06:25concept

Agent framework upgrade timing rule

Wait 24 hours after a model release before updating Hermes or Open Claw configs. Official SDK releases absorb the instability that day-one API changes introduce.

Steal forOperations runbooks for teams running agent pipelines
CTA Breakdown

How they asked for the click.

VERBAL ASK
10:05subscribe
Subscribe if you learned anything so far. Turn on notifications.

Mid-video subscribe ask tied to a bonus tip (remote control setting), framed as an exchange of value. Repeated at outro with a community pitch.

MENTIONED ON CAMERA
Storyboard

Visual structure at a glance.

intro slide
hookintro slide00:00
changes begin
promisechanges begin00:27
Mythos screenshot
valueMythos screenshot03:15
dynamic workflows
valuedynamic workflows04:00
Claude Code UI
valueClaude Code UI05:28
recommendations
valuerecommendations05:40
benchmarks sheet
valuebenchmarks sheet08:36
FPS demo
demoFPS demo10:29
closing recs
ctaclosing recs11:28
Frame Gallery

Visual moments.

Chat about this