Anthropic Just Dropped a Masterclass on Building Agent Harnesses (for Large Codebases)
Cole Medin turns Anthropic's high-level blog post into a working repo — seven concrete components for the AI Layer that wraps Claude Code.
May 21stA 14-minute demo of the open-source tool that lets Claude Code, Codex, and Pi work together under one orchestration layer.
The bottleneck in AI coding is no longer the model but the absence of a clean orchestration layer that assigns each sub-task to the agent best suited for it, and Omnigent solves that with a single open-source install.
Top engineers no longer commit to a single AI coding provider -- they route each job to the best agent for it. Omnigent is the open-source meta-harness that makes this practical: one install, one server holding your system prompts, MCP servers, and guardrails, and any number of coding agents (Claude Code, Codex, Pi, custom YAML configs) you can delegate to from a single UI. The Poly orchestrator ships ready to use: it sends implementation to Claude Code in an isolated git worktree and routes the finished diff to Codex for independent review. Human-in-the-loop policies are plain Python files that live next to agent configs. Sessions persist across terminals, browsers, and phones. It was built by Databricks and is already used in their daily engineering.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
Cold open on the Omnigent UI; frames the meta-harness concept via a live Poly run that delegates implementation to Claude Code and review to Codex.

The harness matters as much as the model. The Fable 5 ban proves you cannot bet on a single LLM; the meta-harness is the reliable alternative.

One curl command installs everything. Reuses existing CLI credentials. GitHub repo walkthrough and feature overview diagram.

Poly orchestrator in action: Claude Code runs implementation in a git worktree, then Codex reviews the diff. Ten-minute setup to first working workflow.

VS Code deep-dive into the Poly config YAML: executor, system prompt, sandboxing, guardrails, tools (sub-agents), and skills.

Building a custom guarded agent: Python policy file blocks git push --force and requires human approval before proceeding.

Debby orchestrator pits Claude and GPT against each other on a question, then synthesizes the debate into a final answer.

Cross-device demo: message sent from phone appears on desktop instantly via LAN. Server hosting options covered.

Meta-harnesses are where serious AI coding is heading. Subscribe CTA.
When you cannot control which LLMs stay available, the only durable bet is building a system that can swap models without rebuilding your workflow.
“The harness matters as much as or maybe even more than the model.”
“If the LLM can't get better, then we better make the system around the LLM more powerful.”
“At least at a very fundamental level, do your code review in a separate coding agent session from your implementation. Otherwise, the LLM builds up way too much bias.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
A tool released over the weekend by Databricks changes how you think about AI coding workflows: not one agent doing everything, but an orchestration layer that routes each job to the right model. This is the walkthrough.
Every orchestrator in Omnigent is composed of these three parts. The same structure applies recursively to each individual sub-agent.
Route implementation to one model in an isolated git worktree, then route the resulting diff to a different model for review in a separate session. Prevents the implementing model from rationalizing its own decisions.
The Debby pattern: use two models as adversaries and a third as the synthesizer to stress-test a question before committing to an answer.
“If you appreciate this video and you're looking forward to more things on harness engineering and AI coding, I would really appreciate a like and a subscribe.”
Clean single ask at the end, framed around topic interest rather than generic engagement bait.
00:01
00:16
00:27
00:38
00:50
01:01
01:12
01:23
01:34
01:45
01:56
02:07
02:18
02:30
02:41
02:52
03:03
03:14
03:25
03:36
03:47
04:02
04:10
04:21
04:32
04:43
04:54
05:05
05:16
05:27
05:38
05:50
06:01
06:12
06:23
06:34
06:45
06:56
07:07
07:18
07:30
07:41
07:52
08:03
08:14
08:25
08:36
08:47
08:58
09:10
09:21
09:32
09:43
09:54
10:05
10:16
10:27
10:38
10:50
11:01
11:12
11:23
11:34
11:45
11:56
12:07
12:18
12:30
12:41
12:52
13:03
13:14
13:25
13:36
13:47
13:58
14:10
14:21
14:32
14:43Cole Medin turns Anthropic's high-level blog post into a working repo — seven concrete components for the AI Layer that wraps Claude Code.
May 21stA 28-minute practical breakdown of seven tools that attack token waste at session startup, during input, and in model output.
May 27thTheo scraps cursor, plan mode, and Claude after five months — here is exactly what replaced them.
May 27thA no-fluff synthesis of Anthropic's official best practices and Boris Cherny's personal Claude Code workflow, distilled into 15 concrete sections.
February 28thA 33-minute first-take from a developer who spent $3,000 on inference in 24 hours — benchmarks, real demos, session math, and the hidden safety intervention that silently degrades the model without telling you.
June 11thA 13-minute teardown of Claude Code memory and a fix assembled from the best pieces of Hermes, MemSearch, and GBrain.
June 10th