Omnigent: The New Meta-Harness for EVERY Coding Agent
A 14-minute demo of the open-source tool that lets Claude Code, Codex, and Pi work together under one orchestration layer.
June 15thA 25-minute honest breakdown of loop engineering — what the AI coding elite actually mean by it, why it gets expensive fast, and how to build a harness that makes it reliable.
Replacing manual prompting with an automated loop only works when the loop is constrained by a deterministic harness that routes work to the right model at each step and keeps all state in an external database, not in any agent session.
The viral claim that the best AI engineers no longer prompt their agents contains a real insight but glosses over brutal economics. Running an LLM orchestrator that decides everything uses a million tokens on a simple app, and a single-session loop bloats context until the agent collapses. The practical answer is a two-layer system: deterministic YAML workflows (Archon) handle the process logic so agents only reason when they must, different models are assigned per step by cost, and an external Postgres database keeps all state so any run can be resumed. The video demos this end-to-end with four parallel GitHub-issue fixers, Neon database branches for isolation, and an open-source dashboard that makes every agent decision visible.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Peter Steinberger's 8.3M-view tweet plus Boris Cherny's Fortune article set up the premise. Cole signals honest skepticism and promises a practical three-part breakdown.

Live demo of /loop, /goal, and /routines in Claude Code. Shows the agent writing its own /loop prompt and working through a PLAN.md task list one item per cycle.

Three problems: quality ceiling, token cost explosion (1M+ tokens for a simple app), and context bloat in same-session loops.

Archon YAML DSL: each step runs in its own isolated agent session. Bash steps are deterministic. Different models per node — Haiku for classify, Claude for implement, Codex for review. Human-in-the-loop gates.

One orchestrator Claude Code session dispatches four Archon workflows in parallel, each in a git worktree plus Neon branch. Validates PRs then launches four review workflows.

Agent Control Plane demo: TypeScript app, Kimi K2 orchestrator, Neon Postgres state store. ExcaliDraw diagram shows state-outside-model architecture. Live Kanban board built over 16 rounds.

Retool (sponsored) used to deploy the React dashboard to a cloud URL with permission groups, audit trails, and human-in-the-loop approval gates.

Closes with Agents Prompting Agents diagram and the reframe: call it harness engineering, not loop engineering.
Autonomous agent loops only pay off when the process logic is deterministic, the state is stored outside the model, and different models handle different cost tiers of work.
“I don't prompt Claude anymore. I write loops, and the loops do the work. My job is to write loops.”
“Loop engineering is really not that complicated. I don't even know if it deserves its own term.”
“You don't always need to spend the most per token for every step of your workflow.”
“I would just fold loop engineering into harness engineering. It doesn't quite deserve its own buzzword.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
A viral tweet from OpenClaw's creator and a Fortune headline about the head of Claude Code both landed the same week: the best AI engineers have stopped prompting their agents. They write loops instead. Cole Medin watched both, built the thing himself, and came back with a more honest report than either headline delivered.
The three built-in Claude Code slash commands that implement loop engineering natively.
The three structural problems that make naive loop engineering impractical for real production work without a harness.
Assign cheap small models to classify/explore steps and frontier models only to implementation and review. Reduces per-workflow cost without sacrificing output quality.
All task state lives in an external database. Each agent round starts by reading fresh state, so context windows never fill up regardless of loop length. Enables resume-after-crash and team observability.
“If you go now, you get free app imports through July 1 and bonus AI credits on all paid plans.”
Sponsor read integrated into a live workflow demo (Retool) so it demonstrates real utility rather than interrupting.
00:01
00:18
00:46
01:04
01:23
01:40
02:00
02:18
02:37
02:55
03:14
03:32
03:51
04:09
04:28
04:46
05:05
05:23
05:42
05:59
06:18
06:37
06:55
07:14
07:32
07:51
08:09
08:28
08:46
09:05
09:23
09:42
10:00
10:19
10:37
10:56
11:14
11:33
11:51
12:10
12:28
12:47
13:05
13:24
13:42
14:01
14:19
14:38
14:56
15:15
15:33
15:52
16:10
16:29
16:47
17:06
17:24
17:43
18:01
18:20
18:38
18:56
19:15
19:33
19:52
20:10
20:29
20:47
21:06
21:24
21:40
22:01
22:12
22:32
22:50
23:22
23:34
23:52
24:11
24:29A 14-minute demo of the open-source tool that lets Claude Code, Codex, and Pi work together under one orchestration layer.
June 15thHow a new viral tweet revealed the next tier of AI engineering: designing loops that prompt your agents, so you never have to.
June 9thA plain-English field guide to every loop type — heartbeat, cron, hook, and goal — with two live builds in Claude Code and Codex.
June 17thA step-by-step guide to turning one Claude Code session into a coordinated team of specialized agents that remember your preferences and improve over time.
April 1stAn 8-step agentic pipeline that takes you from naive AI slop to a pixel-near Linear replica, deployed to Vercel with an MCP server, in under 20 minutes.
June 8thA 28-minute field guide to the setup decisions that separate Claude Code power users from people still using it like a chatbot.
June 12th