The argument in one line.
Claude Code and OpenAI Codex can build complete full-stack applications from a product spec in under 32 minutes using the /goal feature, which enables long-running agents to work autonomously through multi-phase task lists until verification confirms completion.
Read if. Skip if.
- You're a founder or product manager with a spec doc ready who wants to see how Claude Code handles complex multi-step builds compared to alternatives.
- A developer exploring AI coding agents for the first time and wants a concrete side-by-side comparison of how /goal actually performs on a real project.
- You're building a Next.js SaaS app and want to understand whether Claude Code's /goal feature can handle your full product roadmap without manual intervention.
- You're already shipping production apps with Claude Code or Codex — this is an introductory comparison, not an advanced optimization guide.
- You need to build something outside the Next.js stack or with custom infrastructure that doesn't fit standard SaaS patterns.
- You're looking for how to fix or debug broken AI-generated code — this focuses on the feature itself, not troubleshooting workflow.
The full version, fast.
Claude Code's new /goal command and Codex CLI's equivalent let an AI coding agent grind through a long-running task autonomously, looping turn after turn until a defined completion condition is met instead of returning control after each step. The mechanism pairs a measurable end state (typically a checked-off roadmap of 40-80 tasks tied to a PRD) with auto-mode permissions, so a small verifier model decides after each turn whether the goal is satisfied or another iteration is needed. To use it well, write detailed spec documents first�a PRD, a product roadmap, and a design.md�then phrase the goal with one verifiable end state plus constraints. In a head-to-head, both agents scaffolded a full Next.js content-repurposing app across 62 tasks in roughly 32 minutes, producing a working offline-first foundation ready for credential wiring and deployment.
Chat with this breakdown.
Modern Creator members can chat with any breakdown — ask for the hook, quote a framework, find the exact transcript moment. Unlocks at T2: refer 3 friends + add your own API key.
Create a free account →Where the time goes.

01 · Introduction and promise
Host introduces /goal as the feature that changes how everyone builds with AI, frames the video as both an explainer and a live Claude vs Codex head-to-head build.

02 · How /goal actually works
Documentation walkthrough of official Claude Code and Codex docs plus the host's own guide. Explains the loop: a small fast model checks the condition after every turn; if unmet, Claude starts another turn. Up to 4,000 characters in the condition.

03 · Project setup and goal submission
Shows the project folder structure: CLAUDE.md and AGENTS.md at root, docs folder with PRD, product-roadmap.md (62 tasks across 6 phases), and design.md generated via PLAID skill. Submits /goal to both Claude Code and Codex simultaneously in separate terminals.

04 · Goal completes -- terminal summaries
Both agents finish in about 32 minutes. Claude Code: all 62 roadmap checkboxes flipped, committed to local git, dev server running. Codex: same. Neither has live integrations -- Convex, Clerk, Polar, YouTube OAuth all stubbed.

05 · Codex app walkthrough
Browser demo of the Codex-built Content Engine app: landing page with red accents and Geist font, demo workspace, review queue with per-platform tabs, regenerate modal, billing page. Chris is impressed by the design quality driven by design.md guidance.

06 · Claude Code app walkthrough
Claude built Content Machine with the same design language but richer copy, testimonial section, more verbose landing page. Review queue functional. Billing and settings pages present. Claude's copy wins; Codex's tab UI wins.

07 · Summary and best practices
Three keys: right spec docs, right context, clear end condition. /goal shifts the paradigm from prompt-by-prompt to long-running agentic builds. CTA to school.com/aiapps community.
Lines worth screenshotting.
- The /goal command tells Claude Code to keep working after each turn — a small fast model checks whether the completion condition is met and restarts Claude if it is not.
- A good goal is bigger than one prompt but smaller than an open-ended backlog — it defines what to achieve, what to change, how to validate progress, and when to stop.
- The measurable end state for /goal should be a roadmap of 40-80 tasks that the agent can check off and verify as complete, not a vague description of a desired outcome.
- A product roadmap paired with a PRD gives Claude Code the structured context it needs to work autonomously for hours without drifting off-spec or repeating completed work.
- The /goal feature is an evolution of Ralph loops — it uses the same principle of loop-until-done but builds the loop natively into the CLI rather than requiring a bash wrapper.
- Running /goal with auto mode enabled gives the agent permission to make tool calls without requesting approval at each step, which is what enables multi-hour unattended runs.
- The condition text for /goal can be up to 4,000 characters — enough to specify not just the end state but also what must not change along the way.
- Two agents — Claude Code and Codex — submitted identical goals with identical spec docs built a complete 62-task Next.js app in 32 minutes, demonstrating that goal-based builds are now reproducible.
- The /goal feature shifts the human's role from prompter to reviewer — you submit the goal, wait for completion, and evaluate the result rather than managing each step.
- Constraints inside the goal condition — do not touch this file, preserve this API contract — prevent the agent from breaking existing functionality while completing new work.
- Both Claude Code and Codex implemented the /goal feature independently, which signals that long-running autonomous agent loops are the next standard paradigm for AI-assisted development.
- The weakest part of a /goal run is still the spec — a roadmap with vague task descriptions produces a working app with wrong decisions baked in, while a precise roadmap produces something reviewable.
The /goal Command Shifts AI Coding From Prompt-by-Prompt to Long-Running Agentic Builds
Chris Ashby's 62-task head-to-head shows that /goal with proper spec documents — PRD, roadmap, and design file — produces a complete full-stack application in 32 minutes, and that the quality difference between Claude Code and Codex comes down to copy quality, not architecture.
- A small fast model checks the completion condition after every agent turn — if unmet, the agent starts another turn without human input
- Up to 4,000 characters of condition specification — the condition should be specific enough to be machine-checkable, not a vague description of done
- Folder structure: CLAUDE.md and AGENTS.md at root, docs folder with PRD, product-roadmap.md with 62 tasks across 6 phases, and design.md generated via design skill
- The product roadmap with explicit checkboxes is the completion condition — the agent flips checkboxes as it completes tasks, and done means all 62 are checked
- Both agents completed in approximately 32 minutes — Claude Code committed to local git and started the dev server; Codex did the same
- Live integrations were stubbed in both builds — this is expected and correct behavior for a spec-driven scaffold build
- Same design language as Codex build but richer copy — Claude's writing quality is the differentiated output when both agents receive identical spec inputs
- Both apps had functional review queues and billing pages — the spec documents drove equivalent architectural coverage
- Three keys: right spec documents, right context, clear end condition — without all three, the loop either never completes or completes the wrong thing
- The paradigm shift: from prompt-by-prompt supervision to defining the outcome and reviewing the result
Terms worth knowing.
- /goal
- A slash command in Claude Code and Codex CLI that sets a completion condition and lets the agent keep working autonomously, checking after each turn whether the goal is satisfied before continuing or stopping.
- Claude Code
- Anthropic's command-line coding agent that runs inside a terminal, reads and edits files in a local project, and executes shell commands on the developer's behalf.
- Codex CLI
- OpenAI's command-line coding agent, a terminal-based counterpart to Claude Code that uses GPT models to plan and execute software tasks against a local codebase.
- PRD
- Product Requirements Document — a written spec that describes what an app should do, who it's for, and how features should behave, used as input context for AI coding agents.
- Product roadmap
- An ordered checklist of build tasks broken out of a PRD, used as a verifiable to-do list the agent ticks off as it completes each step.
- design.md
- A markdown file following an open-source Google format that captures visual direction — colors, typography, spacing, components — so AI coding tools produce consistent UI instead of generic defaults.
- Ralph loop
- An older pattern of looping a coding agent through a defined task list one item at a time so it completes each task and moves on without needing a new prompt at every step.
- Auto mode
- A setting in Claude Code and Codex that pre-approves tool calls so the agent can keep running without stopping to ask permission for each file edit or command.
- Plan mode
- A mode in Claude Code and Codex where the agent first drafts a written plan of what it will do and asks for confirmation before touching any code.
- CLAUDE.md / AGENTS.md
- Project-root instruction files that Claude Code and Codex read on startup to pick up house rules, conventions, and guardrails for that specific codebase.
- Next.js
- A popular React-based web framework used to build full-stack JavaScript apps with built-in routing, server rendering, and API endpoints.
- Convex
- A hosted backend platform that provides a real-time database and serverless functions, commonly wired into Next.js apps as the data layer.
- Clerk
- A drop-in authentication service that handles sign-up, sign-in, and user sessions for web apps via prebuilt components and APIs.
- Polar
- A payments and subscription platform aimed at developers and creators, used to handle billing, plans, and checkout inside SaaS apps.
- Playwright
- A browser-automation framework used by developers and AI agents to script and verify how a web app behaves in a real browser.
- Scaffold
- To generate the starting file and folder structure of a new app — boilerplate config, dependencies, and entry files — before any feature code is written.
- Stubs and mocks
- Placeholder code and fake data wired into an app where real services aren't connected yet, so the UI runs end-to-end before credentials and integrations are added.
- Environment variables
- Configuration values like API keys and service URLs kept outside the codebase and loaded at runtime, typically required before an app can talk to external services.
- OAuth
- An industry-standard authorization protocol that lets an app request limited access to a user's account on another service, like pulling videos from a user's YouTube channel.
- Magic moment
- The point early in onboarding where a new user first experiences the core value of a product, often used as a design target inside PRDs.
Things they pointed at.
Lines you could clip.
“They stole it directly from Codex.”
“You could legitimately build a full application in one go to a very high quality.”
“This is almost an evolution of the Ralph loop.”
“The creation process is getting condensed massively with these new features to the point where its even viable to build a full application in just a matter of hours sometimes.”
Word for word.
The bait, then the rug-pull.
When Claude Code dropped /goal, Chris from Build Great Products spotted the obvious: Anthropic took OpenAI's playbook. But instead of debating the politics, he did what builders do -- he ran both agents in parallel on the same 62-task product roadmap and let the output speak for itself.
Named ideas worth stealing.
/goal condition requirements
- One measurable end state
- A stated check Claude can verify (e.g., run test, check git status)
- Constraints that matter -- what must NOT change on the way there
The three components every strong /goal condition needs. Condition can be up to 4,000 characters.
Spec doc trinity
- PRD -- product requirements document
- product-roadmap.md -- 40-80 task checklist with verifiable milestones
- design.md -- visual direction (Google open-source format)
The three files Chris puts in every project before running /goal. Together they give the agent context, end condition, and design guardrails -- neutralizing model-specific quirks.
Good goal size rule
A good goal is bigger than one prompt but smaller than an open-ended backlog. It defines what to achieve, what to change, how to validate progress, and when to stop.
How they asked for the click.
“if you are building with AI and you wanna build real applications and real software and launch them to real paying customers, then I got a community helping people do just that over at school.com/aiapps”
Clean verbal CTA at the end; no mid-roll sponsor. Secondary CTAs for PLAID skill and the /goal guide woven into the tutorial naturally.









































































