Claude Opus 4.8 Built an Agentic OS in 15 Minutes
How one creator handed Claude a game plan, walked away, and came back to a working agentic OS.
May 31stHow the worker-plus-evaluator loop actually works, why most devs will write it wrong, and the good-condition pattern that makes it finish for real.
The /goal command turns Claude Code into an autonomous loop that runs until a second model declares the condition met from the transcript alone, which means vague conditions pass on hallucinations and specific proof-naming conditions do not.
/goal sets a condition and spins up worker agents (Sonnet or Opus) that loop until a separate Haiku evaluator declares the condition met from the text transcript alone. A vague condition will pass the evaluator even on a hallucination; conditions must name the proof method (toggle clicked, localStorage persists, screenshots reported). Cap turns with stop-after-N-turns to prevent runaway token burn. Use /goal for multi-turn feature work with a verifiable end state; skip it for simple tasks or subjective work.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Goal command shipped 48 hours ago. Video covers how it works, live demos, and when to use it.

/goal, /goal clear, turn limit flag, token usage warning.

Flowchart: user sets condition, worker agents execute, evaluator (Haiku) reads transcript only, loop continues until condition met or goal cleared.

Circular elapsed countdown timer built in 9 minutes, passed evaluator on first turn.

Full new AI chat page with LLM agent, candidate matching, draft emails, built in 5 minutes with Dribbble reference.

Bad: dark mode works. Good: toggle clicked, data-theme flips, localStorage persists, CSS variable updates, screenshots reported. 4000-char cap.

Use: multi-turn, verifiable end state, migration/refactor, headless cleanup. Skip: simple tasks, subjective work, conditions Claude cannot prove. Use /loop for polling.

Subscribe for AI workflows and latest tech news.
The evaluator that grades your /goal reads only a text transcript and has no tool access, so the quality of your condition determines whether it finishes correctly or passes on a hallucination.
“It's important to note that this actual agent can't inspect the work. It just has the transcript.”
“The end state alone is not enough. Just saying dark mode is not it.”
“Write conditions that name the proof method.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
Forty-eight hours after shipping, the /goal command already has a clear failure mode baked in -- and the video opens by naming it before most developers have even typed the command once.
The evaluator can only read the transcript, so conditions must give workers something provable to produce.
A two-column decision matrix for routing work to the right Claude Code command.
“If you learned anything, you are obligated to like. Subscribe for more AI workflows like these and the latest news in the tech world.”
Relaxed and light -- the obligated-to-like framing is a low-pressure joke rather than a hard ask.
00:00
00:08
00:14
00:20
00:26
00:32
00:38
00:44
00:49
00:55
01:01
01:07
01:13
01:19
01:22
01:31
01:36
01:42
01:48
01:54
02:00
02:06
02:12
02:18
02:23
02:29
02:35
02:41
02:47
02:53
02:59
03:05
03:10
03:16
03:22
03:28
03:34
03:40
03:46
03:52
03:57
04:01
04:09
04:15
04:21
04:30
04:33
04:39
04:44
04:50
04:56
05:02
05:08
05:14
05:20
05:26
05:31
05:37
05:43
05:49
05:55
06:01
06:07
06:13
06:18
06:24
06:30
06:36
06:42
06:48
06:54
07:00
07:05
07:11
07:17
07:23
07:29
07:35
07:41
07:47How one creator handed Claude a game plan, walked away, and came back to a working agentic OS.
May 31stA 9-minute demo showing how native cron sessions turn any Claude Code skill into a self-healing, self-improving background agent.
March 7thA 12-minute walkthrough of three agentic workflows built around one principle: a second verification pass is what turns a fast answer into one you can trust.
June 1stA 15-minute first-look at Claude Code's new Workflow tool — the code-as-orchestrator pattern that kills the token tax on multi-agent pipelines.
May 22ndA 23-minute systems teardown comparing Claude Code default memory, MemSearch, and Hermes -- then synthesising the hybrid setup that beats all three.
May 16thA 21-minute walkthrough of spinning up three competing AI agents that debate your ideas and write their reasoning to a shared file.
January 23rd