Modern Creator
Cal Hyslop · YouTube

Your CoWork Agents Are Probably Underbuilt. Here's What's Missing.

A 9-minute teardown of the three structural gaps that make most AI agents flaky and exactly how to close them in ten minutes.

Posted
2 days ago
Duration
Format
Tutorial
educational
Views
1K
66 likes
Big Idea

The argument in one line.

AI agents do not fail because the AI is bad -- they fail because most people give the AI a task without a job description, no context about who they are, and no instructions for what to do when things go wrong.

Who This Is For

Read if. Skip if.

READ IF YOU ARE…
  • You have set up at least one Claude CoWork agent and found that it works inconsistently or stopped using it after a few runs.
  • You write prompts but have not written a standing job description with quality standards, edge-case rules, and fallback behavior.
  • You want your agents to sound like you and reflect your voice, audience, and non-negotiables without re-explaining every session.
  • You are a solo creator, educator, or professional using CoWork for recurring tasks like research briefs, content drafts, or morning summaries.
SKIP IF…
  • You are building multi-agent pipelines or API-integrated systems -- this video targets the CoWork project-based interface, not code-level agent orchestration.
  • You are brand new to CoWork and have not yet built your first agent -- there is a beginner video linked at the end that covers that ground first.
TL;DR

The full version, fast.

When an AI agent produces inconsistent or useless output, the root cause is almost never the model -- it is underspecified instructions. This video names three fixes: replace thin prompts with a proper job description that defines what good looks like and what to avoid; add a context anchor file that loads your voice, audience, and non-negotiables into every session automatically; and write a failure protocol that tells the agent exactly what to do when it cannot complete the task as specified. All three can be drafted with Claude's help in about ten minutes.

Members feature

Chat with this breakdown.

Modern Creator members can chat with any breakdown — ask for the hook, quote a framework, find the exact transcript moment. Unlocks at T2: refer 3 friends + add your own API key.

Create a free account →
Chapters

Where the time goes.

00:0000:31

01 · Hook -- the quiet quitter agent

Addresses CoWork users directly: you built agents that kind of work but stopped using at least one of them. Frames this as a setup problem, not a CoWork problem.

00:3201:43

02 · Origin story -- the broken grading system

University instructor built an AI grading assistant that hallucinated features that no longer existed. Root cause: gave the AI a job without a job description.

01:4402:37

03 · Preview -- three missing components

Sets up the three-part structure. Frames the vending machine vs. real agent contrast.

02:3804:18

04 · #1 Job Clarity

Distinguishes prompt (do this this time) from job description (responsibility, quality standard, edge cases). Shows a real morning brief agent instruction set with source list, format rules, topic thresholds, and null-state behavior.

04:1905:37

05 · #2 Context Anchor

Every session starts knowing agent instructions but not who the user is. A claude.md file loaded automatically carries voice, audience, frameworks, and non-negotiables into every session.

05:3807:11

06 · #3 Failure Protocol

Agents built only to succeed will guess when they hit edge cases -- confidently and wrongly. A failure protocol is fallback instructions built into the job description itself.

07:1208:19

07 · Action step -- build all three with Claude

Shows the exact prompt template to ask Claude to write a job description including failure protocol. Workspace overview: several agents, each with job description plus claude.md plus fallbacks.

08:2009:08

08 · CTA -- Portable AI Working Identity guide

Free guide for building the context anchor file. Final watch-next card.

Atomic Insights

Lines worth screenshotting.

  • The agent that stopped working after two runs was not broken -- it was under-instructed, and it was guessing.
  • A prompt tells the AI what to do this time. A job description tells it what it is responsible for, what good looks like, and how to handle the exceptions.
  • Confident wrong answers are worse than honest admissions of uncertainty -- build the failure protocol before the agent ships.
  • Every CoWork session starts knowing the agent instructions but not knowing who you are -- your voice, your audience, your non-negotiables are invisible unless you file them.
  • A context anchor loaded automatically into every project session is the single highest-leverage thing a CoWork user can build.
  • Agent inconsistency does not come from the agent failing -- it comes from the agent succeeding at the wrong thing because no one defined the wrong thing.
  • Adding a null-state instruction is one extra line and prevents the agent from fabricating content to fill a gap.
  • Briefing an AI agent is identical to briefing a new hire for a recurring weekly job they will do without supervision -- same mental model, same information required.
  • A failure protocol does not make an agent smarter -- it makes it honest, which is more useful than confident-sounding garbage.
  • The job description and the failure protocol are not separate documents -- the failure conditions are clauses inside the job itself.
Takeaway

Three things every recurring agent needs.

WHAT TO LEARN

An AI agent that guesses is not failing -- it is succeeding at something you never defined, which is why the fix is never about the model and always about the instructions.

  • A prompt and a job description are not the same thing: a prompt covers one session, a job description covers every session, including the edge cases you have not thought of yet.
  • Specificity in agent instructions is not pedantry -- it is the only way to close the gap between what the AI thinks you want and what you actually need.
  • A context anchor file is not a memory hack; it is the answer to why your agent sounds generic even when the output is technically correct.
  • Failure conditions should be defined before the agent runs, not discovered after it hallucinates three weeks in a row.
  • The right prompt template for building a job description with failure protocol built in: say the agent is supposed to do X for you every week, then ask Claude to help write a job description that includes what good output looks like, what to avoid, and what to do if it cannot complete the task.
Glossary

Terms worth knowing.

CoWork
Anthropic's Claude.ai Projects interface, which allows users to configure persistent AI agents with standing instructions that activate on every new session within a project.
Job description (agent)
A standing instruction set that defines what an agent is responsible for, what good output looks like, what mistakes to avoid, and how to handle edge cases -- distinct from a one-off prompt.
Context anchor
A persistent file loaded automatically into every project session, containing the user's voice, audience, frameworks, and non-negotiables so the agent does not start from zero each time.
Failure protocol
Explicit fallback instructions embedded in an agent's job description that specify what to do when the task cannot be completed as specified -- flag it, ask for clarification, or produce a partial output with a label.
Null-state instruction
A clause that tells the agent what to do when the expected input or output does not exist -- for example, if nothing is worth reporting today, say so instead of fabricating content.
Quotables

Lines you could clip.

01:09
I was giving it a job without a real job description.
Relatable self-diagnosis, zero setup neededTikTok hook↗ Tweet quote
01:28
The guesses were confident and wrong.
Five words, universal, instantly understoodIG reel cold open↗ Tweet quote
04:07
Write your agent instructions the way you'd brief a new hire on a job they're going to do every week without you watching.
Concrete, memorable analogy that reframes the whole tasknewsletter pull-quote↗ Tweet quote
07:00
The failure protocol didn't make the agent smarter, it made it honest.
Punchy contrast, counterintuitive framingTikTok hook↗ Tweet quote
06:15
Not from the agent failing, from the agent succeeding in the wrong thing.
Reframe of the failure problem -- sounds wrong until it clicksIG reel cold open↗ Tweet quote
The Script

Word for word.

metaphoranalogystory
00:00You've been using CoWork. You've built at least one agent, maybe a few, and they kind of work.
00:07But if you're honest, they all don't work the way you thought they would when you first set them up. And there's at least one agent you built, ran twice, and quietly stopped using.
00:21That's not a co work problem, that's a setup problem. And it's the same setup problem I had in my own workspace when I first started.
00:32I'm a university instructor. One of my courses is writing, and a couple of semesters ago, I built an AI grading system to handle part of the routine grading.
00:44I do the teaching, it lends a hand grading. But it was a mess at first. The AI kept hallucinating features that no longer existed on the platform.
00:56I had to screenshot my own screen to prove I was correct. And I almost quit several times. But at one point, I realized the problem wasn't the AI.
01:09It was me. I was giving it a job without a real job description. I was asking it to work right without telling it what looked right and I hadn't told it what to do when it wasn't sure, So it guessed.
01:26The guesses were confident and wrong. Once I fixed three things, it worked.
01:33Not perfectly, about 85 to 90% of the time.
01:38But it's been solidly running ever since with little intervention from me.
01:44The same three things that were missing from the grading system are missing from most co work agents I've seen. And in this video, I'm going to show you exactly what they are.
01:57And by the end of this video, you'll know exactly what to do about it. Let me start with what a well built agent actually looks like versus what most people actually have because the gap is actually more specific than you think.
02:15Most agents are built the same way. Someone opened co work, created a new project, wrote a few lines of instructions at the top, and then started using it. Co What they ended up with is a vending machine.
02:29You put something in, you get something back. Here's what each of the three things actually look like.
02:38The first one is job clarity, not prompt clarity, job clarity, and they're really quite different.
02:46A prompt tells the AI what to do this time. A job description tells the AI what it's actually responsible for, what good looks like, what mistakes to avoid, and how to handle the edge cases you haven't thought to prompt it on yet.
03:05Your co work agents need the same thing. Here's an example of my morning brief agent. I wrote, pull news from these five sources, summarize each item in exactly two sentences.
03:19The first sentence states what happened. The second states why it matters to a nontechnical professional.
03:26Flag anything relevant to AI workflow, productivity tools, or education technology.
03:33Do not flag product launches unless they're from a company with more than 10,000,000 users. If there's nothing worth reporting on a given day, say so.
03:44That level of specificity separates an agent that gives you information that it thinks you want versus what you really need.
03:54And here's the thing, you already know how to do this. You've been delegating work to people for years. You tell them what good looks like, what to watch out for, and how to handle the exceptions.
04:07Write your agent instructions the way you'd brief a new hire on a job they're going to do every week without you watching. That's job clarity.
04:20The second thing most people never build is a context anchor. The context here is that every session you use in co work starts with co work knowing your instructions, but not really knowing who you are.
04:38It doesn't know your voice unless you've told it your voice. It doesn't know your audience unless you've described your audience.
04:47It doesn't know what you've already covered, what you're looking toward, what you care about, or what your non negotiables are.
04:56Here's what this looks like in practice. Before building this, my LinkedIn agent created content that was technically fine, but it really didn't sound like me as of course you would expect.
05:11A context anchor fixes that. It's a file. I call mine claud dot m d that loads automatically in the background every time I open any session within this project.
05:25It contains everything any AI needs to do my specific work. If you don't have a file like this in your workspace, that's the single highest leverage thing you can build today.
05:40The third thing, and this is really the one most people are surprised about, is a failure protocol.
05:49Most agents are built to succeed. Almost zero are built to fail gracefully. When your morning brief agent, for example, can't find anything worth reporting, what does it do?
06:03When your research agent hits a paywall and can't pull a source. When your LinkedIn agent gets content that's too vague to work with, if you haven't told it what to do in those situations, it will do something.
06:17It will guess. That's where the inconsistency comes from.
06:23Not from the agent failing, from the agent succeeding in the wrong thing.
06:28A failure protocol is just a set of instructions for what to do when the job can't be done as specified.
06:37I'll give you a concrete example. My weekly research brief has a fallback instruction that says, if fewer than three credible sources are found on a given topic, do not synthesize from weak sources.
06:53Instead, flag that topic as low coverage for this week and move on. The failure protocol didn't make the agent smarter, it made it honest.
07:04And that is infinitely more useful than one that produces confident sounding garbage.
07:11Now, before we continue, if you found this video useful so far, please give it a like or comment. It really helps the channel.
07:21Also, subscribe if you would like to see more content like this. Here's what my workspace looks like now.
07:29Several agents. Each one has a real job description, draws from my Claude dot m d file, and has fallback instructions. That's the setup.
07:40So here's what to do next. Open whatever agent you want to improve, tell Claude what it's supposed to do, and ask it to help you write a proper job description.
07:52Something like this. This agent is supposed to do x for me every week, help me write a job description that includes what good output looks like, what to avoid, and what to do if it can't complete the task.
08:08That last part, what to do if it can't complete the task, is your failure protocol. You're not building it separately, you're building it into the job itself.
08:20You'll have a real instruction set including the fallbacks in about ten minutes and you can paste it straight back into the agent.
08:30If you want to build the context anchor I mentioned, the file that loads every time and makes every agent in your workspace smarter by default, I've put together a free guide that walks you through exactly how to do that.
08:46It's called the Portable AI Working Identity. Link is in the description. And if you're still in the early stages of co work and not sure exactly what you should build first, I've got a video that covers exactly that.
09:02Click this video here to see what it is.
The Hook

The bait, then the rug-pull.

You built the agent. You ran it twice. Then you quietly stopped. The video opens by naming that specific embarrassment -- not as a criticism, but as a diagnosis -- before a university instructor who once had the same problem explains the three-part fix he found.

Frameworks

Named ideas worth stealing.

02:38list

The Three Missing Components

  1. Job Clarity
  2. Context Anchor
  3. Failure Protocol

Three structural elements that distinguish a reliable recurring agent from a vending machine that produces inconsistent output.

Steal forAny AI agent setup checklist or agent audit workflow
02:38concept

Job Description vs. Prompt

A prompt is a one-time instruction. A job description is standing guidance: what the agent is responsible for, what good looks like, what to avoid, and how to handle exceptions. Same mental model as briefing a new hire.

Steal forFraming for a CoWork setup tutorial or onboarding guide
05:38concept

Failure Protocol

Embedded fallback instructions that define what the agent does when it cannot complete the task as specified. Prevents confident fabrication. Three options shown: flag and stop, ask for clarification, produce partial output with a label.

Steal forAgent instruction templates, AI delegation checklists
CTA Breakdown

How they asked for the click.

08:20link
I've put together a free guide that walks you through exactly how to do that. It's called the Portable AI Working Identity. Link is in the description.

Clean soft sell after the content is fully delivered. No urgency language. The guide name (Portable AI Working Identity) is specific enough to feel like a real product.

Storyboard

Visual structure at a glance.

open
hookopen00:00
origin story
credibilityorigin story00:32
job clarity
valuejob clarity02:38
context anchor
valuecontext anchor04:19
failure protocol
valuefailure protocol05:38
action step
ctaaction step08:08
Frame Gallery

Visual moments.

Watch next

More from this channel + related breakdowns.

31:08
Jack Roberts · Tutorial

Hermes Agent just got 10X Better (Agentic OS)

A 31-minute setup walkthrough that bridges Hermes AI agent and Claude Code into one shared operating system — with Pantheon personas, Obsidian memory, Apollo lead scraping, and Zapier-to-Gmail wired in by the end.

May 15th