The bait, then the rug-pull.
Before Mark Kashef says a word, you are looking at something most Claude Code users have never seen: a real-time dashboard showing a team lead delegating to a designer and a developer, messages streaming in, a Kanban board updating live. Instead of just looking at JSON files or markdown files, he says, I built a system to give me full surveillance. That dashboard is the promise of the entire video.
What the video promised.
stated at 01:15“I'm gonna walk you through how the agent team infrastructure works so you can leave this video understanding where to use it, when to use it, and how to actually surveil your agents.”delivered at 23:17
Where the time goes.

01 · Live dashboard demo
Shows the custom localhost surveillance UI: team lead + designer + developer Kanban board, real-time message streaming, history tab.

02 · Token cost warning
Do not use teams frivolously. Only for complex recurring tasks. Token burn is the main risk.

03 · Setup: enabling the feature flag
Set CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 in ~/.claude/settings.json. Cheat code: paste Anthropic docs into Claude/Warp and let it configure everything.

04 · Verifying access
Ask Claude do you know how to spin up agent teams? If it lists TeamCreate, TaskUpdate, SendMessage, TeamDelete tools, you are configured correctly.

05 · Model selection
Opus 4.6 for team lead, Sonnet 4.5 for teammates. Rule: create a team, use Sonnet for each teammate.

06 · Writing the invocation prompt
Magic words: create an agent team to review PR 142, spawn three reviewers. Define roles explicitly.

07 · How teams work: architecture
Team lead breaks down work, spawns teammates (each a full separate Claude Code session with its own context window), coordinates via shared task board.

08 · The messaging system
Agents DM each other directly via inbox JSONs. Frontend agent notifies backend; both report up to team lead. Full lifecycle: 80K tokens for a one-shot web page.

09 · Display modes
In-process mode (default, all in one terminal) vs. split-pane mode (each agent in its own TMux pane). Custom dashboard is the third option.

10 · Sub-agents vs. agent teams
Sub-agents = hub-and-spoke (report to you, never talk to each other). Agent teams = mesh (peer DMs, shared task board, can challenge each other).

11 · Four use cases for agent teams
Parallel code review (stuck-at-85% projects), cross-layer features (frontend/backend/database), debugging hypotheses (competing theories), research committees.

12 · Applied: building auth with three agents
UI agent scaffolds login page, API agent creates endpoint, database agent creates users table. Chain completes to a working auth system.

13 · The decision flowchart
Do agents need to talk to each other? If no, use sub-agents. If yes, is the task complex enough? If yes, use agent teams.

14 · Gotchas and pro tips
Token cost 100K-300K per run. Agents can overwrite the same file. No session resume. Lead sometimes codes instead of delegating. Use Delegate Mode, require plan approval, aim 5-6 tasks per teammate.

15 · File storage map and messaging protocol
~/.claude/teams/<name>/config.json (who is on the team), inboxes/<agent>.json, tasks.json. Protocol types: task_assignment, task_completed, shutdown_request, plan_approval_request.

16 · The surveillance skill demo
Dashboard is a Claude Code skill: say surveil my agents and it spins up localhost:3847 with live + history tabs, Kanban board, inbox threads, SQLite persistence.
Visual structure at a glance.
Named ideas worth stealing.
The Decision Flowchart
- Do agents need to talk to each other?
- If no: use sub-agents
- If yes: is the task complex enough to justify token overhead?
- If no: use sub-agents
- If yes: use agent teams
Two-question decision tree for choosing between sub-agents and agent teams. Default to sub-agents; upgrade to teams when communication between agents is the bottleneck.
Hub & Spoke vs Mesh
Sub-agents form a hub-and-spoke network (you are the hub). Agent teams form a mesh network where any agent can message any other, enabling competing hypotheses and cross-layer contracts.
Model Allocation Rule
- Team lead: Opus 4.6 (highest complexity, manages others)
- Teammates: Sonnet 4.5 (fast, saves tokens)
- Long sessions: Opus 4.6 1M for extended context
Assign most capable model to the lead; use cheaper/faster models for workers. Same logic as hiring: senior lead, junior executors.
Agent Surveillance Skill
A Claude Code skill that spins up a localhost dashboard (always port 3847) showing live agent activity via SQLite-backed streaming of inbox JSONs and task boards.
Lines you could clip.
“Unlike before with sub agents, they worked in parallel, basically would give you the TLDR of the result at the very end, but there's no interplay in between the different agents.”
“When in doubt, start with subagents. Upgrade to teams when you hit communication walls.”
“You could spend anywhere between 100 to 300,000 tokens just spinning up and executing an agent team.”
“It's literally like having a possible committee go through and vote on what the problem is to get to the bottom of it.”
“The team lead could end up coding instead of delegating. Now you could see it stepping on the toes of its subemployees.”
How they spent the runtime.
Things they pointed at.
How they asked for the click.
“If you want access to all the diagrams along with a prompt you can use to start your journey of creating your own skill for your own agent dashboard, check the second link in the description. If you want access to my skill itself, that is in my exclusive early AI adopters community.”
Two-tier offer: free diagrams + prompt in description, paid skill behind community paywall. Clean and non-pushy.
Word for word.
Steal the surveillance skill.
The real unlock here is not the agent teams feature itself — it is the idea of building a Claude Code skill that makes your agent fleet legible in real time.
- The decision flowchart (talk to each other? complex enough?) is a ready-made prompt template — steal it verbatim.
- Model allocation rule is concrete and actionable: Opus for the lead, Sonnet for every teammate. Use this in any agent prompt you write.
- The surveillance skill pattern (keyword phrase -> localhost dashboard always on the same port) is directly applicable to JoeFlow Sessions panel.
- The gotchas section (overwrite conflicts, no session resume, lead coding instead of delegating) is a standalone short-form video waiting to happen.
- Mark's two-tier CTA (free diagrams + prompt in description, paid skill behind community paywall) is a clean offer architecture worth borrowing.
When to actually use agent teams.
Agent teams cost real money (100K-300K tokens per run) — use them only when agents genuinely need to negotiate with each other.
- Ask one question before spinning up a team: do the agents need to talk to each other? If no, sub-agents are cheaper and simpler.
- The right use cases are: parallel code review on a stuck project, building features that span frontend + backend + database, and debugging where you want competing hypotheses.
- Start every new project with sub-agents. Upgrade to teams only when you hit communication walls.
- Use Opus 4.6 for the team lead, Sonnet 4.5 for workers — this alone can cut token cost significantly.
- Budget 5-6 tasks per teammate maximum; beyond that, returns diminish fast.











































































