The argument in one line.
The best coding model is now the best general-purpose model, and whoever builds the most polished interface around it will dominate knowledge work and AI agents for the next decade.
Read if. Skip if.
- A software engineer or technical founder tracking the 2026 AI agent landscape who needs to understand which tools actually ship product polish versus hype.
- Someone building with or evaluating Claude, Codex, or open-source agents who wants insider perspective on their actual competitive positioning and capability gaps.
- A creator or knowledge worker curious whether AI agents will meaningfully change your workflow in the next 12 months, not just coding.
- You're looking for a technical deep-dive on model architecture or how to implement these tools — this is landscape analysis and product positioning, not a tutorial.
- You haven't used any AI coding assistant yet and want a beginner-friendly introduction — this assumes familiarity with Cursor, Claude, or similar platforms.
The full version, fast.
The 2026 AI agent wars reshaped rapidly: OpenClaw became the fastest-growing open-source project in history, Codex emerged as the product-polish leader for both vibe coding and knowledge work, Anthropic spread across 50+ new features while arguably losing ground on execution, and SpaceX's semi-acquisition of Cursor signaled that AI coding tools are now strategic infrastructure. The core thesis is that a great coding model is now the only model that matters — because coding ability is the best proxy for reasoning ability across all knowledge work. The OpenClaw and Hermes proactive-agent paradigm represents a different layer: agents that act without being prompted rather than responding to commands. Knowledge work is about to be disrupted more severely than coding, and the agent platform that wins the developer market will likely win that broader transition too.
Chat with this breakdown.
Modern Creator members can chat with any breakdown — ask for the hook, quote a framework, find the exact transcript moment. Unlocks at T2: refer 3 friends + add your own API key.
Create a free account →Who's talking.
Where the time goes.

01 · Cold open + thesis
Riley solo-frames the four-month AI-agent firehose: OpenClaw's Mac-mini shortage, Anthropic's 50+ feature blitz, OpenAI's Codex surge, SpaceX semi-acquiring Cursor. Names today's guest Ras Mic and previews the full tour.

02 · Q1 shipping frenzy
They name Q1 'Anthropic's quarter' — January through March of nonstop feature drops. Cowork, OpenClaw craze, sold-out Mac minis, two-to-three-month wait lists. Ras Mic flags that Opus 4.5 was the real inflection point.

03 · Opus 4.5 inflection + Anthropic going too broad
Andrej Karpathy's 'never felt this much behind as a programmer' tweet converted the skeptics. But Anthropic then sprawled — Claude Code vs Cowork vs Schedule vs Routine vs Dispatch vs Remote Control — five names for the same primitive. Riley argues OpenAI got focused exactly when Anthropic lost focus.

04 · Codex vs Claude UX war
Codex is winning the bubble. Free usage, raised limits, smoother in-app browser, design mode, terminal-to-GUI continuity. Claude bans heavy users; Codex courts them. 'A great coding model is a great general-purpose knowledge model' — so whoever wraps the best GUI around it wins.

05 · Cursor's bet
Cursor was the innovator — Composer, the agents tab, in-app browser, sandbox preview videos — but their $200 Ultra plan can't compete with subsidized Codex/Claude compute. They are not a model provider. Hence the SpaceX deal.
06 · SpaceX–Cursor acquisition deal
$10B now, $60B opt-in acquisition. xAI has the H100s and Colossus in-house; Cursor has the GUI and distribution. The bet: xAI focuses purely on the model, Cursor becomes its consumer surface.
07 · OpenClaw and the agentic personal computer
Ras Mic walks through his sponsor-vetting workflow on OpenClaw: every inbound email gets researched (scam check, company profile, funding) and the agent only escalates the vetted ones. He frames the 15-minute heartbeat as the real unlock — agents that ping themselves get something close to surprise agency.
08 · Reactive vs proactive work
Riley splits the world in two: 'super app' tools you talk to (Codex, Claude, Cursor — reactive) and OpenClaw-style agents that talk to you (proactive). He says he no longer opens ChatGPT or Claude directly — he iMessages his OpenClaw.
09 · Keep agents narrow
The dominant failure mode is giving one agent 40 skills and 20 connectors. Treat agents like new hires — would you give a new employee 40 jobs on day one? Use one orchestrator agent that delegates to narrow sub-agents.
10 · Memory layer matters
Markdown won. OpenClaw's native memory is just .md files (agents.md, user.md, memory.md). Ras Mic uses Super Memory as a portable graph so memory survives nuking his instance. Obsidian's bet on plain markdown is paying off.
11 · Computer use gets fast
Opus 4.7's image resolution bumped to roughly Mac-screen dimensions specifically for computer use. Browser-use harnesses are appearing; Vercept got scooped by Meta then ended up at Anthropic. The takeaway: 'enjoy this brief period where you can still watch the browser move.'
12 · Who wins the super app
Manus was first; Cursor was first on the agents tab. Both proved you don't actually win by being first — you win by being best. Genspark stalled around $300M. The real prize is post-launch growth velocity, and Codex/Claude have it.
13 · The problem with Google
Google has GDP-level money, infinite talent, and the smartest model on knowledge — but tool-calling is embarrassing and team silos are vicious. Gemini, Notebook LM, AI Studio, Antigravity, Stitch — five product surfaces where there should be one. They're one model train away from being a real contender.
14 · Prompting and skills that scale
LLMs predict tokens — they don't understand. The English you use is the lever. WhisperFlow > typing because speaking is faster than thinking-while-typing. Skills should be domain expertise specific to YOU, not a downloaded pack. Build one skill by reverse-engineering a successful agent run — recursive onboarding.
15 · Integrations beat prompts
Context management > prompt engineering. The best 'prompt' for OpenClaw might be a Linear connector + a ticket title. Codex's Chronicle screenshots your screen every few seconds to build context passively. Aravind: the user is never wrong — everything that goes wrong is on the tool.
16 · All the muxes (cmux, tmux, dmux)
A small detour into terminal multiplexers. CMUX is built on libghosty, gives you a sidebar of terminal tabs plus a browser. The companies all push GUIs, but focused work still feels better in a terminal — 'I don't even know why, but a terminal makes me work on one thing.'
17 · Bold predictions
Coding got better but not the rapture everyone predicted — good engineers still vastly outrun vibe coders. The real shock is knowledge work: Ras Mic ran a 27-page contract through Claude instead of paying a $1000-an-hour lawyer; his friend did the company books with Clark Max and the CPA assumed he was certified. Knowledge work is about to get rocked.
18 · Agent commerce + image-gen scams
Stripe just launched issuable agent cards with addresses, spend limits, and human-in-the-loop approvals. x402 (Coinbase) and Stripe's machine-payment protocol race for the agent-to-agent rail. Image gen got so good it's invisible to Ras Mic now — he NBA-photoshopped himself in one prompt and his family believed it. Voice-cloning + image gen = a new scam wave coming for Facebook-aged relatives. He set up a paraphrase code with family.
19 · Permission to build now
Closing arc. Information is so cheap that what matters now is agency — actually doing the thing. They both reject the doomer line: even if anyone can build, most people won't. You don't need permission, a $4K camera, or a $20 subscription. Just start. Sign-off + handshake.
Lines worth screenshotting.
- OpenClaw became the fastest-growing open-source software in history in early 2026 — causing a Mac Mini shortage as developers rushed to run it locally.
- Q1 2026 belonged to Anthropic — they shipped a new feature every single day from January to March, then lost the lead to OpenAI's Codex by Q2.
- Codex has overtaken Claude on product polish for everyday coding and knowledge work — the model quality gap that once made Claude dominant has been closed.
- SpaceX acquiring a stake in Cursor creates a well-capitalized non-lab competitor with distribution that neither Anthropic nor OpenAI has in the physical world.
- Knowledge work is about to be disrupted harder than coding — coding has absorbed AI shocks incrementally, but knowledge workers haven't fully felt the wave yet.
- OpenClaw's proactive-agent paradigm — taking action without being asked — represents a different category of tool than any chat-based or cursor-based coding agent.
- All major AI labs are converging on building the same super app: a single interface for coding, writing, web browsing, and agentic task execution.
- Anthropic spreading features across 50+ launches in one quarter is a sign of competitive pressure, not strategic clarity — shipping too many things signals you don't know which one wins.
- Agentic payments — an AI agent completing a purchase autonomously on your behalf — is the capability that will determine which agent platform controls consumer commerce.
Codex Wins by Courting the Users Others Ban
The 2026 AI agent race comes down to who wraps the best interface around the best model — and right now Codex is winning by subsidizing usage while everyone else throttles it.
- The framing is a four-month AI-agent firehose — Codex, Anthropic sprawl, SpaceX-Cursor, OpenClaw — map the whole board before picking a winner
- Q1 was Anthropic's quarter in volume — but volume without focus is a liability
- Opus 4.5 was the real inflection point, not the feature count
- Anthropic named five different primitives for the same underlying capability — naming confusion is product confusion
- OpenAI got focused exactly when Anthropic lost focus — that timing is the story of Q1
- Codex is winning the bubble: free usage, raised limits, in-app browser, design mode, GUI continuity
- A great coding model is a great general-purpose knowledge model — the GUI wrapper is the real product
- Cursor innovated first — Composer, agents tab, in-app browser, sandbox previews — but first does not mean winner
- A $200 Ultra plan cannot compete with subsidized compute from model providers
- xAI has the H100s; Cursor has the GUI and distribution — the deal separates model from surface
- Vertical integration is how you compete with OpenAI and Anthropic without being either
- A 15-minute heartbeat gives agents something close to surprise agency — they ping themselves rather than waiting
- Workflow example: every inbound email researched automatically, only vetted ones escalated to the human
- Split your stack: reactive tools you talk to versus proactive agents that message you — the second is the real unlock
- When you stop opening ChatGPT directly and start getting pushed alerts, the workflow has inverted
- One orchestrator agent delegates to narrow sub-agents — never give one agent 40 skills on day one
- Treat agents like new hires: narrow scope, clear handoffs, no context-switching between job families
- Markdown won — plain .md files are the native memory format and they survive nuking the instance
- Portable memory graphs that work across tools beat proprietary memory that locks you to one platform
- Image resolution bumped specifically to Mac-screen dimensions for computer use — the hardware assumption is now baked into the model
- Enjoy the brief period where you can still watch the browser move — automation speed will make that window invisible soon
- Being first proves the category but does not capture it — Manus and Cursor both demonstrated this
- The real prize is post-launch growth velocity, not launch-day press
- Five product surfaces — Gemini, Notebook LM, AI Studio, Antigravity, Stitch — where there should be one is an organizational problem, not a technical one
- One model train could make them a real contender — the capability is there, the org is the bottleneck
- LLMs predict tokens — they do not understand — so the English you use is the actual lever
- Build skills by reverse-engineering your own successful agent runs, not by downloading someone else's pack
- Context management beats prompt engineering — the best input might be a ticket title via a connector, not a crafted paragraph
- The user is never wrong — everything that goes wrong is on the tool to fix, not the user to work around
- Terminal multiplexers give you sidebar tabs and browser in one surface — but focused terminal work still feels better than GUI switching
- A terminal that forces single-task focus is a feature, not a limitation
- Coding got better but not the rapture — good engineers still vastly outrun vibe coders
- Knowledge work is the real shock: contract review and company books done without specialists are already happening
- Issuable agent cards with spend limits and human-in-the-loop approvals are live — agent commerce rails are not theoretical
- Voice-cloning plus image-gen creates a new scam wave — set a paraphrase code with family before you need it
- Information is so cheap that agency is now the scarce resource — most people will not build even when they can
- You do not need permission, expensive equipment, or a subscription — start with what you have
Terms worth knowing.
- Codex (OpenAI)
- OpenAI's AI-powered coding agent platform that can autonomously write, test, and run code in cloud sandboxes, competing with Claude Code and other AI developer tools.
- OpenClaw
- An open-source autonomous AI agent framework that runs locally on a computer, often on a Mac Mini, executing tasks like browsing, file management, and code execution with minimal supervision.
- Super app (AI)
- An AI platform that aims to consolidate multiple workflows — coding, writing, browsing, research, scheduling — into a single agent-driven environment, eliminating the need for separate specialized tools.
- Computer use
- An AI capability that allows a model to visually perceive and interact with a computer screen — moving the cursor, clicking buttons, typing — as a human operator would, enabling browser and desktop automation.
- Browser use
- A specific form of computer use where an AI agent navigates web browsers autonomously — filling forms, clicking links, reading pages — to complete tasks that require live web interaction.
- Agentic payments
- The ability of an AI agent to autonomously initiate financial transactions on behalf of a user — such as purchasing a subscription or completing a checkout — without requiring manual approval for each transaction.
- Knowledge work
- Professional work that primarily involves processing, creating, or applying information — including writing, analysis, research, strategy, and decision-making — as distinguished from physical or manual labor.
- Vibe coding
- A development style where a programmer guides an AI agent with natural language prompts rather than writing code manually, trusting the model to handle implementation details.
- Mac Mini (AI context)
- Apple's compact desktop computer that became a popular local AI server for running open-source agents like OpenClaw, leading to reported supply shortages in early 2026.
- Agent wars
- Informal term for the competitive race among major AI labs and startups to build the dominant AI coding and productivity agent, with OpenAI Codex, Anthropic Claude Code, and open-source alternatives as the primary contenders.
Things they pointed at.
Lines you could clip.
“Opus 4.5 was the inflection for me. It didn't happen right away — it took a week or two when we all collectively realized, like, woah, this is good.”
“I've never felt this much behind as a programmer — that Karpathy tweet was the final moment where the skeptics said fine, we'll use it.”
“A great coding model is a great general-purpose knowledge model. They're all just files in a file system.”
“Anthropic doesn't care about people in the bubble — they only care about perception outside the bubble. And they won that.”
“The way I frame it is it's a $10 billion acquisition with a $60 billion opt-in clause.”
“I'm more excited about this knowledge-work category than I am about the coding improvements. The models will get more intelligent — but this is where my time goes back to my family.”
“If you came in on day one and listed 40 things you could do, I'd be like — wait, what are you actually good at? Just say coding.”
“Markdown just won. The Obsidian founder bet his company on 'own your files' and now he's winning.”
“You don't have to be first — you have to be best. Cursor was first on the agents tab. Manus was first on the super app. Neither of them won.”
“The English you use is way more powerful than you think. The model doesn't think — it predicts the next token. So if you give it garbage, you get garbage.”
“I had a 27-page contract. The lawyer's quote was a thousand dollars. I ran it through Claude and asked 'where can they screw me?' — it broke down every page. That's the unlock.”
“I have a paraphrase with my family now — because my X account got hacked. If someone sounds like me and asks for money, ask them for the paraphrase.”
“Information is so cheap. What matters now is actual agency — searching for the information, trying to do something, figuring it out. It is the best time to be alive to build.”
Where the conversation goes.
Word for word.
The bait, then the rug-pull.
Riley Brown opens cold on a solo shot with one number on screen — 2026 — and a thesis: the agent wars have spun out so fast that nobody, not even people in the bubble, can keep up. He pulls in Ras Mic for an 80-minute audit of who's actually winning the super-app race, why Anthropic's lead is slipping, and what knowledge workers should brace for next.











































































