Big Idea

The argument in one line.

Building your own agentic OS inside Claude Code beats installing Hermes off-the-shelf because every pre-built stack inherits someone else's architecture, assumptions, and scaling ceilings.

Who This Is For

Read if. Skip if.

READ IF YOU ARE…

Claude Code power users who have looked at Hermes or similar agentic frameworks but don't want to inherit someone else's assumptions
Builders who want to understand memory systems and identity layers rather than just installing a black box
Developers who have had off-the-shelf agent stacks fail in ways they couldn't debug and want a build-your-own path

SKIP IF…

Beginners to Claude Code who haven't built their own setup yet — this assumes comfort with skills, memory files, and agent architecture
Anyone happy with Hermes or another pre-built agentic OS who isn't looking to roll their own

TL;DR

The full version, fast.

Installing an off-the-shelf agentic OS like Hermes is fast to start but comes with three hidden costs: inherited architectural assumptions you didn't choose, failures you can't debug because you don't understand the layers underneath, and scaling constraints built for someone else's use case. The alternative demonstrated here is rebuilding only the parts you actually need — identity layer, memory system, and modular skills — inside your own Claude Code setup. The self-learning loop Hermes celebrates has no external validation, meaning the same model that writes a skill also grades it, which quietly overwrites your improvements with worse versions. Building from scratch produces a system you fully understand, can debug, and can evolve as the space changes — which is more valuable long-term than faster initial setup.

Free for members

Chat with this breakdown — free.

Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.

Create a free account →

Chapters

Where the time goes.

00:00 – 01:05

01 · Cold open + promise

Hermes velocity stat → 'I read through the issues' → thesis: rebuild don't install → what this video covers

01:05 – 03:08

02 · Cost #1 — Inherited assumptions

The self-learning loop grades its own homework. No external validation. Can silently overwrite your good work with no audit log.

03:08 – 03:47

03 · Cost #2 — Can't fix what you don't own

OpenClaw: 200+ CVEs filed since February, 386 malicious packages from one threat actor. You're debugging someone else's code.

03:47 – 05:08

04 · Cost #3 — Doesn't scale across clients

Paul Baier (nontechnical CEO) spent 100+ hours and $1,000+ testing OpenClaw. Hermes is single-tenant by design — separate install per client.

05:08 – 06:25

05 · What he rebuilt: Identity layer

Keeps user.md + memory.md from Hermes but adds per-client brand context folders — voice, ICP, positioning, visual identity — that share procedures across clients.

06:25 – 08:23

06 · Memory system

Keeps Hermes's capped injection (~1,300 char memory.md) but replaces keyword long-term search with MemSearch (semantic/meaning-based recall).

08:23 – 11:00

07 · Self-learning loop critique + skill systems

Hermes auto-generates new skills but ends up with 15 near-duplicate LinkedIn skills with no deduplication or version control. Solution: modular skill components that chain together.

11:00 – 12:56

08 · Build vs. buy trade-off + CTA

Honest framing: faster to start with Hermes, faster to scale with your own. Neither is right for everyone. CTA to AgenTek Academy.

Atomic Insights

Lines worth screenshotting.

Hermes reached 40,000 GitHub stars in 46 days — the fastest adoption of any agentic system ever recorded on GitHub.
Installing an off-the-shelf agentic stack means inheriting its architecture's assumptions, which only reveal themselves once you are already committed to the system.
Hermes's self-learning loop has no external validation step — the same model that writes a skill is also the sole judge of whether that skill is correct.
A security researcher found 386 malicious packages in OpenCLAW's skills marketplace from a single threat actor.
Multi-client agencies cannot use one Hermes installation — each client requires a completely separate install with its own memory and skills that never share.
When a self-learning agent creates skills automatically, you risk ending up with 15 nearly-identical variants of the same task with no way to know which one to use.
Searching memory by keyword fails when you cannot remember the exact words you used in a conversation six months ago — meaning search is the only viable long-term recall strategy.
A modular skill system keeps voice, ICP, and formatting in separate files so updating one propagates to every skill system that depends on it automatically.
Hermes is faster to start building with; a custom setup is faster to maintain and scale past the tenth skill.
Understanding every assumption in your agentic stack is worth the slower build time because it determines whether you can fix what breaks.

Takeaway

The modular OS beats the installed one.

Own your stack — the AI edition

Hermes is faster to start; your own setup is faster to scale — and the hidden costs of someone else's architecture only surface once you're already committed.

Use Simon's three-hidden-costs structure verbatim for any 'why I stopped using X SaaS' video — it works for any AI tool critique.
The self-validation problem ('grading your own homework') is a clean, quotable metaphor for any content about AI blind spots.
The modular skill system idea directly maps to Joe's own setup: voice.md, ICP.md, format.md as separate source-of-truth files that compose into skill systems.
Simon's multi-client identity layer (per-client brand context folders sharing procedures) is worth shipping inside JoeFlow's Sessions panel as a named feature.
The MemSearch upgrade (semantic vs. keyword recall) is a concrete next step for any memory system — worth researching for the JoeFlow stack.

Glossary

Terms worth knowing.

Hermes (agentic OS): An open-source agentic operating system for AI assistants — built on top of Claude Code — that adds persistent memory, an identity layer, and a self-learning skill loop to the base AI coding environment.
Agentic OS: A layer of configuration files, memory systems, and skill definitions placed on top of an AI coding tool that gives it a persistent identity, long-term memory, and reusable behaviors across sessions.
Identity layer: A set of files (typically user.md and memory.md) injected at the start of every AI conversation to tell the agent who it's working for, what the brand stands for, and what context matters most.
Self-learning loop: A mechanism in some agentic systems where the AI automatically writes and saves a new skill file after completing a task, so that behavior can be reused in future sessions without re-prompting.
Memory injection: The practice of loading a summary of past conversations or important facts into the start of a new AI session so the model retains context it would otherwise lose between chats.
Keyword search vs. semantic search: Keyword search matches stored memories by exact words; semantic (meaning-based) search retrieves memories by conceptual similarity, which is more useful when the original phrasing can't be recalled precisely.
MemSearch: A memory architecture for AI agents that retrieves stored information by semantic meaning rather than keyword matching, enabling more accurate long-term recall.
Skill system: A modular approach to AI agent skills where each skill does one job and larger tasks are handled by chaining multiple single-purpose skills together, rather than baking all logic into one monolithic prompt.
ICP (Ideal Customer Profile): A detailed description of the specific type of customer a business most wants to attract, used to focus messaging, content, and offers.
OpenCLAW: An open-source alternative to Claude Code's agentic stack that preceded Hermes, noted for accumulating a large number of reported security vulnerabilities after release.

Quotables