Modern Creator Network
Mark Kashef · YouTube · 23:17

Claude Code Agent Teams Explained (Complete Guide)

A 23-minute illustrated walkthrough of how agent teams work in Claude Code, when to use them over sub-agents, and how to build a live surveillance dashboard to monitor your fleet.

Posted
3 months ago
Duration
Format
Tutorial
educational
Channel
MK
Mark Kashef
§ 01 · The Hook

The bait, then the rug-pull.

Before Mark Kashef says a word, you are looking at something most Claude Code users have never seen: a real-time dashboard showing a team lead delegating to a designer and a developer, messages streaming in, a Kanban board updating live. Instead of just looking at JSON files or markdown files, he says, I built a system to give me full surveillance. That dashboard is the promise of the entire video.

§ · Stated Promise

What the video promised.

stated at 01:15I'm gonna walk you through how the agent team infrastructure works so you can leave this video understanding where to use it, when to use it, and how to actually surveil your agents.delivered at 23:17
§ · Chapters

Where the time goes.

00:0001:15

01 · Live dashboard demo

Shows the custom localhost surveillance UI: team lead + designer + developer Kanban board, real-time message streaming, history tab.

01:1501:52

02 · Token cost warning

Do not use teams frivolously. Only for complex recurring tasks. Token burn is the main risk.

01:5203:17

03 · Setup: enabling the feature flag

Set CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 in ~/.claude/settings.json. Cheat code: paste Anthropic docs into Claude/Warp and let it configure everything.

03:1704:23

04 · Verifying access

Ask Claude do you know how to spin up agent teams? If it lists TeamCreate, TaskUpdate, SendMessage, TeamDelete tools, you are configured correctly.

04:2305:09

05 · Model selection

Opus 4.6 for team lead, Sonnet 4.5 for teammates. Rule: create a team, use Sonnet for each teammate.

05:0905:36

06 · Writing the invocation prompt

Magic words: create an agent team to review PR 142, spawn three reviewers. Define roles explicitly.

05:3607:10

07 · How teams work: architecture

Team lead breaks down work, spawns teammates (each a full separate Claude Code session with its own context window), coordinates via shared task board.

07:1008:54

08 · The messaging system

Agents DM each other directly via inbox JSONs. Frontend agent notifies backend; both report up to team lead. Full lifecycle: 80K tokens for a one-shot web page.

08:5410:48

09 · Display modes

In-process mode (default, all in one terminal) vs. split-pane mode (each agent in its own TMux pane). Custom dashboard is the third option.

10:4812:35

10 · Sub-agents vs. agent teams

Sub-agents = hub-and-spoke (report to you, never talk to each other). Agent teams = mesh (peer DMs, shared task board, can challenge each other).

12:3515:13

11 · Four use cases for agent teams

Parallel code review (stuck-at-85% projects), cross-layer features (frontend/backend/database), debugging hypotheses (competing theories), research committees.

15:1316:37

12 · Applied: building auth with three agents

UI agent scaffolds login page, API agent creates endpoint, database agent creates users table. Chain completes to a working auth system.

16:3717:31

13 · The decision flowchart

Do agents need to talk to each other? If no, use sub-agents. If yes, is the task complex enough? If yes, use agent teams.

17:3118:56

14 · Gotchas and pro tips

Token cost 100K-300K per run. Agents can overwrite the same file. No session resume. Lead sometimes codes instead of delegating. Use Delegate Mode, require plan approval, aim 5-6 tasks per teammate.

18:5621:40

15 · File storage map and messaging protocol

~/.claude/teams/<name>/config.json (who is on the team), inboxes/<agent>.json, tasks.json. Protocol types: task_assignment, task_completed, shutdown_request, plan_approval_request.

21:4023:17

16 · The surveillance skill demo

Dashboard is a Claude Code skill: say surveil my agents and it spins up localhost:3847 with live + history tabs, Kanban board, inbox threads, SQLite persistence.

§ · Storyboard

Visual structure at a glance.

live dashboard
hooklive dashboard00:00
prerequisites
setupprerequisites01:15
feature flag
valuefeature flag01:52
model selection
valuemodel selection04:23
team architecture
valueteam architecture05:36
messaging system
valuemessaging system07:10
sub-agents vs teams
valuesub-agents vs teams10:48
use cases
valueuse cases12:35
decision flowchart
valuedecision flowchart16:37
gotchas & pro tips
valuegotchas & pro tips17:31
file storage map
valuefile storage map18:56
surveillance demo
ctasurveillance demo22:20
§ · Frameworks

Named ideas worth stealing.

16:37model

The Decision Flowchart

  1. Do agents need to talk to each other?
  2. If no: use sub-agents
  3. If yes: is the task complex enough to justify token overhead?
  4. If no: use sub-agents
  5. If yes: use agent teams

Two-question decision tree for choosing between sub-agents and agent teams. Default to sub-agents; upgrade to teams when communication between agents is the bottleneck.

Steal forPrompt template, Claude Code cheat sheet, short-form explainer
12:00concept

Hub & Spoke vs Mesh

Sub-agents form a hub-and-spoke network (you are the hub). Agent teams form a mesh network where any agent can message any other, enabling competing hypotheses and cross-layer contracts.

Steal forAny content explaining agent architecture tradeoffs
04:23concept

Model Allocation Rule

  1. Team lead: Opus 4.6 (highest complexity, manages others)
  2. Teammates: Sonnet 4.5 (fast, saves tokens)
  3. Long sessions: Opus 4.6 1M for extended context

Assign most capable model to the lead; use cheaper/faster models for workers. Same logic as hiring: senior lead, junior executors.

Steal forAgent architecture guide, pricing explainer
21:40concept

Agent Surveillance Skill

A Claude Code skill that spins up a localhost dashboard (always port 3847) showing live agent activity via SQLite-backed streaming of inbox JSONs and task boards.

Steal forJoeFlow agent monitoring, any orchestration UI
§ · Quotables

Lines you could clip.

01:00
Unlike before with sub agents, they worked in parallel, basically would give you the TLDR of the result at the very end, but there's no interplay in between the different agents.
Clean contrast statement, no setup needed, lands the key product distinctionIG reel cold open
18:06
When in doubt, start with subagents. Upgrade to teams when you hit communication walls.
Standalone rule, zero context needed, highly quotableTikTok hook / Twitter pull-quote
18:33
You could spend anywhere between 100 to 300,000 tokens just spinning up and executing an agent team.
Concrete number, practical warning, builds credibilityNewsletter pull-quote
15:12
It's literally like having a possible committee go through and vote on what the problem is to get to the bottom of it.
Memorable metaphor, no setup neededIG reel cold open
18:33
The team lead could end up coding instead of delegating. Now you could see it stepping on the toes of its subemployees.
Humanizes the failure mode with workplace languageShort-form hook
§ · Pacing

How they spent the runtime.

Hook length75s
Info densityhigh
Filler3%
§ · Resources Mentioned

Things they pointed at.

09:24toolTMux
07:10toolVibe Kanban (open source)
§ · CTA Breakdown

How they asked for the click.

22:20link
If you want access to all the diagrams along with a prompt you can use to start your journey of creating your own skill for your own agent dashboard, check the second link in the description. If you want access to my skill itself, that is in my exclusive early AI adopters community.

Two-tier offer: free diagrams + prompt in description, paid skill behind community paywall. Clean and non-pushy.

§ · The Script

Word for word.

metaphoranalogy
00:00So in this video, I'm gonna walk through a deep dive on how the agent teams from Anthropix new Opus 4.6 release work in-depth. So as you can see on my screen here, I have my agent team running. And instead of just looking at JSON files or markdown files, I actually built a system on top of it to give me full surveillance as to what's happening in real time.
00:22So we can see here is I'm spinning up a web page, and you have a designer actively working on the task assignment, and all of the discourse between that agent to us and the agent to the team lead is fully transparent and something that you can audit along the way. And you can see that we go from designer to developer, and each one waits for its turn.
00:43Unlike before with sub agents, they worked in parallel, basically would give you the TLDR of the result at the very end, but there's no interplay in between the different agents. Here, we can actually click, see the entire discourse, the description, and if there are conversations between them, kind of like sending an email to each other or messages to each other on Telegram, we'll be able to audit that with this infrastructure as well.
01:05And what's cool is you can click on the history tab and go through any of the prior sessions we had along with the associated messages. You can audit what happened and why. So with that little teaser out of the way, I'm gonna walk you through how the agent team infrastructure works so you can leave this video understanding where to use it, when to use it, and more importantly, how to actually surveil your agents.
01:27And the reason why is this agent's feature is amazing, but it swallows up tokens like a vacuum in a very short amount of time. And last thing is you wanna make sure that you're not using this feature frivolously, not using it just because it looks cool, but because you actually have a complex task where you need a series of agents with ideally a sequence in order to execute on a recurring basis.
01:49And to enable the agents team specifically, what you have to do is enable the feature flag. Now if you are nontechnical, I have a cheat code for you.
01:58So instead of going to navigate this Claude settings dot JSON folder and worrying about what the heck you're doing, what you could do is literally go to the docs page on orchestrating these Claude code sessions and click on copy page. And then my cheat code is literally to give this to Claude code or to something like Warp, which is like a smart terminal to go and set everything that I need so I can just open Cloud Code and it's there.
02:22So as an example, I can go to Warp, then I could say, go read and double check that we have this installed correctly.
02:32I'll literally just throw that all in there. This will go through and just double check that our settings are where they need to be so you don't have to worry about putting this feature flag, all the agent team stuff, and you can even ask something like Cloud Code or Warp to help you with a proper prompt because the invocation of these teams is typically in this form where you say, I'm designing x, and I wanna be able to create an agent team or spawn an agent team.
02:59If you say something semantically similar to that, then that should suffice. If you wanted to make it even easier, then you could just give the URL right here and just say, read this document. It will come back with an understanding of the architecture, the feature flag, everything that you'll need, and you can just say, listen.
03:15Go and implement this all in my system. And once you're good and good to go from there, you can just restart Cloud Code. It should work in a brand new session.
03:23And then ideally, you should interview it to access and ask it. Do you have access to agent teams? Once it says yes, then it'll explain what tools it has.
03:33If it doesn't say that and it assumes that teams means sub agents, that means it's not properly installed. So as a quick preview, if I go to my prior session, if I just said, do you know how to spin up an agent team right here? Then it replies saying, yes.
03:48I know how to create a team, and it should lecture you on all the tools it has. So the name of the tools include these agent types, the task updates, the team delete.
03:59So once it's ready to clean up, the way teams work is once they spin them up, they persist until the actual goal is completed or the task is done, unlike sub agents where they spin up and die, spin up and die. And at the bottom here, I have a reference to the agent surveillance skill that I show you at the beginning of the video, and I'll even walk you through the TLDR of how you can approach building such a system yourself.
04:23And for step four, the goal is to assign the best model to the right agent at the right time. So in this case, the team lead always usually deserves Opus 4.6 just because it will be managing the other sub agents as well.
04:35So the other team members will report up to the team lead to get approval, to get direction, so that's why you wanna be able to equip it with the highest complexity possible. For the additional team members, that's where it might make sense to use something like Sonnet 4.5 or even Haiku depending on the task.
04:51And similar to invoking a spell, you have to say the magic words like we saw before to say something like create an agent team to review p r one four two. This could be an example of you actually reviewing an existing code base and looking for a team of reviewers. And these reviewers will be able to look at security, at performance, and validating test coverage in a way that it can finally discuss with each other.
05:14And similar to invoking a magic spell, you have to say those magic words like we saw before and say, create an agent team to review p r one four two. In this case, you could spawn up three different reviewers, one for security, one for code quality and performance, and one for actually validating the test coverage of the other two agents.
05:32So with all this out of the way, how do the teams work in-depth? So step one is you have the team lead looking around and understanding when it makes sense to spawn a brand new teammate. Even if the generation is in flight, meaning you've already assigned and hired three agents on top of your team lead, it could still decide that for this particular task or permutation, it might need to employ a new agent.
05:57Now the main thing to keep in mind is that as you spin up more and more agents, naturally, you will take more and more tokens. So if you're using something like Opus 4.6 for all of them, you will see your usage evaporate really quickly.
06:10The theoretical mental model of spinning up different agents is having them work on completely different areas of the build where it makes sense for them to communicate. So if you're using sub agents, then what would usually happen is if you created a front end sub agent and then a back end sub agent, they would both work on their separate tasks, but there would be no communication between them.
06:31So you could run the back end after the front end, but ideally, there's a contract between the front end and the back end. So to create that contract in a way that there's cohesion from day one, ideally, they should be able to work on their own context window, comma, but be able to message each other and ask each other on direction.
06:50Because if the front end is gonna implement a brand new framework or use a library, you wanna make sure that it's compatible with your back end. And this is where this feature becomes ingenious, where you have that cross communication and cross pollination of these agents in a way that you have full transparency as to what's happening.
07:06And in a way, this shared task list replaces a lot of the trend that I've been seeing over the past few months. We have things like Vibe Kanban, an open source framework where people can put different tasks, and it auto arranges those tasks.
07:20I found it to be buggy, sometimes slow, sometimes doesn't activate at all, and technically, this is built in. So if we take a look at the agent dashboard that I showed you, I didn't actually engineer anything new here.
07:32It created all these categories right here inspired by the categories that are in the JSON files themselves. So I'm purely just visualizing what's happening and streaming the events in real time. The TLDR of the messaging system is you have the team lead who's always in the know, and then when the front end agent, agent a or agent b, are finished, they communicate with each other.
07:53So agent a will say, okay. I'm done the front end. And then the agent b will say, okay.
07:57I'm updating my endpoints. And then the front end, because they're closest to the fire, will update the team lead that it's all done. So the full life cycle becomes you say, build this app and spawn a team to help me accomplish it.
08:10The lead then spawns. Typically, it creates three to five subemployees depending on the task. These become the teammates that all have their defined roles, and then they coordinate together to create a plan.
08:21Once the plan is approved or you've kind of put it into YOLO mode or bypass permissions mode to execute it, then you essentially get to the final result. And if we take a peek here at running that web page that I showed you earlier, Once it finishes, you could see all the tokens that it's taken.
08:38So I think it's around 80,000 tokens for this web page right here. Personalized web page. I didn't give it anything.
08:46Looks pretty clean in terms of layout and format, and it does work from what I could see. And it's essentially one shot it, so pretty impressive. In terms of watching the agents do the work, you obviously have options.
08:58Now some people like to use this framework that's called TMux, and TMux allows you to essentially have a terminal with a main window, and then you could see the sub terminals open up where you can audit and see what's happening with the agents. The reason I made this UI is, one, to make it universal, and number two, as a dev, when I was walking through and looking at it, I just noticed that things are moving so quickly that I'd have to constantly scroll through the history to properly monitor what's happening.
09:25So completely up to you. If you wanna actually install it yourself, then you can use something like Warp. And the way I would actually just install it is by asking it to install and update to my latest version.
09:37I ask it, what is TMux? And it walks through exactly what that is. And then at the very bottom, once you wanna install it, you could say, can you install it?
09:46Can I use it in something like cursor? Then it walks me through that it's probably not the best use case there just because of the pain management. So you can just spin it up on your own terminal itself, and you'll notice if it works if we do something like this.
09:58So if we go to terminal right here and I spin this up and let's zoom in just a tad and then we say tmux, you'll see right here at the very bottom, it has this green little footer.
10:16Then if we say this is my shortcut for YOLO mode. You'll be able to spin up the agent teams, and it'll spin up multiple panes. But for me, it was a pain to look at that, a bit of an eyesore.
10:28So that's why we gave birth to this bad boy. And this is structured in a way where it's a skill. So it's invoked just in time.
10:35It's basically memorize the structure of the dashboard and memorize where it has to reference all the different files. Now what's the difference between agent teams and sub agents? Because that's a very important thing.
10:46So in the worlds of sub agents, they report back to you, but they never talk to each other. So even though they can run-in parallel, because of that lack of communication, you'll see a lot of disfluency or different objectives or different goals for the sub agents not knowing what their other agent compatriots are up to.
11:06One interesting thing is that with OPUS 4.6, it's infinitely better now at spinning up sub agents on the fly without you even asking for it to do things like explore your code base to actively, proactively preserve your context window as much as possible.
11:21So there's still a huge place for sub agents in the mix here. It's just a matter of when to use them. And for now, what I can see, when it comes to prepping, exploring, researching, doing tasks that are very admin in nature, it seems like sub agents is the way to go.
11:37Even though they also take a decent number of tokens, if it comes to code exploration, it will preserve your core context window, which really matters if you wanna be able to execute a team, execute sub agents, and still get everything done without having to compact your conversation. So if you needed one more diagram to drive this point home, with sub agents, you are the monkey in the middle right here getting all the discourse, all the updates, and you're the one who has to manage what has to happen next.
12:04In the world of agent teams, the team lead is the monkey, maybe not in the middle, monkey at the top, and then they receive all the inputs and questions and need for guidance from all the other teammates. Now when does it make sense to spend the tokens to use agent teams?
12:21Now these are four use cases that I've tried personally. I'm gonna keep experimenting and sharing what I find, but for now, parallel code review. If you've already vibe coded something and you're at the 85% mark and you're stuck, there's something not working.
12:36I personally have a project where I'm creating a version of OpusClip. It's a clone of it. I've been working on it for two months.
12:43I've been stuck on a core set of features that no matter how much I try or intervene using my own dev background, I can't get it over the line consistently. So I had it review the code base, and it found three or four areas where it had a series of duplicate functions or functions stepping on each other's toes that really muddied my code base.
13:03So having not just one extra set of eyes on it, but three or four helps you really orient yourself and better understand where to go next. The next, like I showed you, are cross layer features. So if you wanna build a web application or something on Next.
13:18Js and you have a front end, you have a back end, you have a database, and you have a series of features, this is where it makes a lot of sense to try to one shot the 80% with the team, maybe drive it home yourself. And the last two are debugging any hypotheses.
13:33So if you've watched my prior video before this even existed, I tried to create my own agent team by having multiple sub agents share a markdown file that they would use as a diary. So now that we live in this world, we don't technically need that strategy anymore unless you wanna preserve tokens at all cost. We can still use that method to do nontechnical tasks, create an entire brainstorming network.
13:57You could create a RFP generation network, proposal network. You can do all kinds of nontechnical things using this feature. And when it comes to research, a lot of people default to using things like Google deep research or Perplexity.
14:11Behind the scenes, you could implement a research committee. We have different agents go and research different parts but communicate their findings in real time.
14:20So you can imagine things like scientific discovery will become increasingly more possible. And when it comes to sub agents, it's really useful for quick research. We can just run it in parallel, a quick code exploration to see where things are or where certain files are, file operations where you wanna use lower token costs.
14:38So maybe you spin up four or five sub agents that use Haiku to do a very basic task. But you know at the end of the day that those are just being executed. They're being checked off a to do list, but there's no value add of adding more agents to the network in terms of additive knowledge.
14:55So here's a tactical applied scenario of, let's say, debugging a code base. So you could have agent one in a agent team decide that there's a memory leak of some sort. And then agent two could say, no.
15:07It's a race condition. And if you have no idea what those words mean, don't worry about it. Maybe just exploring some concepts.
15:13And the agent three could act as the devil's advocate, where it could say, you know what? You're wrong and you're wrong. I actually think it's something completely different.
15:22And then they work together. They have their own consensus until they get to the final result. So it's literally like having a possible committee go through and vote on what the problem is to get to the bottom of it.
15:34Now this debate could also happen with sub agents, but like we said before, they couldn't directly fight each other. When it comes to spinning up an agent team related to building a feature like authentication, you could end up with a UI agent, an API agent, and a database agent.
15:51And the way it would work is the first UI agent would design the scaffolding for the page, and they would realize that it needs an API, an API to actually allow the login. So this goes to the API agent that creates the API and sends some form of response to double check that it works, and then it realizes it needs users table with an email and a hash.
16:12So it needs some support to bring this API to life so that when you actually log in, this actually does something. So then the database agent gets to work. It creates this new users table right here, and it says it's ready.
16:24So then it goes back to the API agent, then it can go back and say, listen. We're good to go. It's time to actually test this out.
16:32If you want a quick heuristic on when to use sub agents versus agent teams, if you have a brand new task, the easiest question you could ask yourself is do you need agents to speak to each other? If the answer is no, then you can use sub agents.
16:45Otherwise, you wanna ask the next question. Is the task complex enough to justify the overhead of token slash cost of tokens? If the answer is yes, then use agent teams.
16:56But there is a world here where you say no to all of them, and you just have a normal session with no agents at all. The mentality I want you to adopt is if you are a bootstrapped founder of a startup and you have a very fine amount of money. Now depending on what plan you're on, that could be a very good analogy.
17:14But if you are bootstrapped, you have to be very picky about when it makes sense to actually hire your first employee or employees unless you raise some funding. But unless you have raised funding for your Anthropic subscription, I doubt that's the case.
17:27So you just wanna be as selective and picky and responsible as possible. The pros, just starting off with that, is you can require plan approval for any risky changes. So you could still tell Claude that this is what you want in terms of a framework.
17:41You can ideally aim to have five to six different agents. Beyond that, from what I found, is diminishing returns. Same thing with sub agents.
17:50And then you could start all of them working on a research task before they all work on an actual execution task. And in terms of the gotchas, like I said, the token cost is very real, and it will add up very quickly. You could spend anywhere between a 100 to 300,000 tokens just spinning up and executing an agent team.
18:09The next thing is even if you have separated roles, I've found that agents can overwrite the same file. Now it might make sense for them to overwrite it, but once in a while, you'll notice that one will do a great job and the other will come in and change just a few things that make it either unusable or it added bloat unnecessarily.
18:28The last thing is unless you create a way to respin up the exact same agents with the same instructions, every time it shuts down, you won't be able to bring that back up. And last but not least, the lead of the team could end up coding instead of delegating.
18:41So, ideally, you want your head team lead to be looking at the big picture and looking at what everyone's doing and making sure it accomplishes the goal. But once in a while, you could see it coding itself. So now you could see it stepping on the toes of its subemployees.
18:57Now hopping back into the terminal, one thing you can do to really visualize this entire system of where the messages lie, where the inboxes exist, is ask it for an ASCII art to visualize how this feature and function works. Now when I say this feature, I already referenced above the agents team feature. So it walks you.
19:14It says this you are the human, and you have a team lead. These are the functions that come with a team lead. So you have team create, task create, task tool, send message, then it walks you through how they share a file system.
19:27And then teammate coder will read the inbox, claim tasks. Teammate research will read the inbox as well, and so will the tester. And then in terms of file storage, it breaks down exactly where all this exists, which is why I always get confused while other creators confuse you with different folders and JSON files in a black terminal because it can be intimidating if you're nontechnical.
19:49You can just ask it to visualize it. And you can see right here, we have a config JSON that tells you who's on the team. Then you have all the agent mailboxes, which is one of the many ways we created that visual to monitor everything.
20:02All I said was, hey. Can you help me create a skill that when I say the words surveil my agents, we'll spin up the agent team and then look at the inboxes, look at the JSON files, and just stream whatever's there onto a local host UI. And ideally, make it so that you can spin up the same local host every single time so you don't have to go and search for a port.
20:24And then when you get to tasks, all of these are also structured. So the shared tasks board where something's pending, completed, that basically, that Kanban essence is already native to this structure.
20:35So with all of that in mind, you can see right here, this is my skill, my dashboard database using what's called SQLite, a free database that can live on your computer.
20:44No need for super base or anything else. In terms of where the information flows, this shows you exactly that the team lead will send a message directly to the inbox of the researcher. So it's at dot claud slash teams slash dynamic name of your team, inboxes, researcher dot JSON, then the agent waits to see when the next message arrives.
21:06Once it reads it, it tells us that it's read it. So if you think of WhatsApp or Telegram, when you see those two check marks and they're blue, then it knows you've read it. In this case, you have the exact same system.
21:17And then you have the task life cycle where you create a task, update said task, and then you delete the task when it's completed. And then the protocol messages look like this. So, hey.
21:28Can you review my PR? Like an actual human. Then you have the overall protocol, the protocol types, and this breaks down my surveillance dashboard.
21:37So it always will spin up on local host 3847. You can see live and history tabs, and you have a Kanban board and the inbox threats. If you watch it in action, it's literally as simple as me saying spin up an agent team to build a web page and surveil them using your special skill.
21:53Again, I told it in the skill itself that it should surveil the team whenever I say those words. So it knows to load that skill right here, then organizes itself, comes up with the three agents and the sub builders. It walks through how the pipeline will be designed.
22:09It executes the tasks. And while this is all happening, it's streaming on the main web page that I showed you.
22:17So once this is all done, then we're good to go. And the best part is it auto shuts down the session. And when it auto shuts down the session, that's where you can see everything go in the history tab.
22:27So you can see right here, this is the ended session of the web build right here, and we can always go back and look at the messages, tasks, etcetera. And that's pretty much it. So, hopefully, now you have a better understanding of when to use sub agents, how they work, and where it makes the most sense versus using something like sub agents or nothing at all.
22:45If you want access to all the diagrams I showed you along with a prompt that you can use to start your journey of creating your own skill for your own agent dashboard, I'll make them both available to you in the second link in the description below. But if you want access to my skill that I spent a couple hours putting together to make sure it works at least nine out of 10 times, then you're gonna wanna check that out in my exclusive early AI adopters community.
23:07And last but not least, if you found this video helpful and educational, would be super helpful for you to leave a comment and a like on the video. It really helps the video, really helps the channel, and I'll see you all next
§ · For Joe

Steal the surveillance skill.

JoeFlow / agent orchestration playbook

The real unlock here is not the agent teams feature itself — it is the idea of building a Claude Code skill that makes your agent fleet legible in real time.

  • The decision flowchart (talk to each other? complex enough?) is a ready-made prompt template — steal it verbatim.
  • Model allocation rule is concrete and actionable: Opus for the lead, Sonnet for every teammate. Use this in any agent prompt you write.
  • The surveillance skill pattern (keyword phrase -> localhost dashboard always on the same port) is directly applicable to JoeFlow Sessions panel.
  • The gotchas section (overwrite conflicts, no session resume, lead coding instead of delegating) is a standalone short-form video waiting to happen.
  • Mark's two-tier CTA (free diagrams + prompt in description, paid skill behind community paywall) is a clean offer architecture worth borrowing.
§ · For You

When to actually use agent teams.

The practical decision rule

Agent teams cost real money (100K-300K tokens per run) — use them only when agents genuinely need to negotiate with each other.

  • Ask one question before spinning up a team: do the agents need to talk to each other? If no, sub-agents are cheaper and simpler.
  • The right use cases are: parallel code review on a stuck project, building features that span frontend + backend + database, and debugging where you want competing hypotheses.
  • Start every new project with sub-agents. Upgrade to teams only when you hit communication walls.
  • Use Opus 4.6 for the team lead, Sonnet 4.5 for workers — this alone can cut token cost significantly.
  • Budget 5-6 tasks per teammate maximum; beyond that, returns diminish fast.
§ · Frame Gallery

Visual moments.