Hermes Agent has a NEW SuperPower (NotebookLM)
How Jack Roberts wired Google NotebookLM into the Hermes Telegram agent -- turning a free 300-source research platform into a 24/7 AI assistant that reads, acts, and automates from your phone.
May 18thA 41-minute field guide to the open-source AI agent framework — 21 concepts, zero jargon, one tutorial that starts from zero.
Hermes is not a smarter chatbot but a personal AI operating system that compounds its usefulness the longer you use it — and understanding its 21 building blocks is the difference between occasional help and continuous leverage.
Hermes is an AI agent that uses tools to take real actions rather than only generate text. It runs on your own machine and learns your context over time through three layered memory systems. The video covers 21 concepts in ascending complexity: choosing which model to wire in, how MCP servers standardize tool connections, how sub-agents parallelize work, and how cron heartbeats keep tasks running while you sleep. The critical operational insight is that 73 percent of every API request is fixed system-prompt overhead, making session hygiene the single biggest cost lever.
Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.
Create a free account →
The foundational distinction: a chatbot creates a plan, an agent executes it. Live demo shows Hermes booking a flight and rendering results as HTML.

Four-animal framework: Hermes is your dog (companion), Claude Code is your contractor (project-scoped), OpenClaw is your roommate, Antigravity is your IDE buddy. Use Hermes mobile; Claude Code at desk.

Same Hermes agent accessible from Telegram, Discord, WhatsApp, browser OS, or any of 22+ platforms. One intelligence routed through many interfaces.

Three deployment options: laptop (free), VPS (~$5/mo), serverless (cents/month). Start local, never pay until you must.

OAuth = one-click sign-in, revocable. API key = secret string, keep out of chat logs, rotate when exposed. Claude requires API key only.

Model-agnostic multi-brain strategy: Claude/Opus for reasoning, GPT for generalist volume, Grok for search and Twitter, DeepSeek for free high-volume tasks. OpenRouter as the single hub.

Local model via Ollama = 100% offline, private, no costs. Limited by laptop RAM. Paste your Mac specs into Hermes to ask which quantized model you can run.

Three memory layers: memory.md (persistent facts), SQLite full-text search of every session, Obsidian integration. Two weeks of use before it feels magical.

The character bible that gives the agent a consistent persona, values, tone, and communication style. What makes your Hermes different from everyone else's.

~70 built-in integrations. Demo: Granola meeting-notes MCP connected via OAuth and API key to answer questions about past meetings from anywhere.

Hermes can operate a real browser via Chrome DevTools Protocol — real cursor movements, not screenshots. Native computer use.

API = raw wiring, MCP = instruction manual around the wiring. A universal remote that tells the AI every button each app has — token-efficient and reliable.

Skills and Pantheon personas: assign specialist models to specialist tasks. Sub-agents inherit fresh context per task and report back.

Six slash commands: /q (queue next prompt), /background (run in parallel), /kanban (task board), /reset (clear session), /compress (summarize context), /model (swap brain).

Principle of least privilege: only grant minimum access the task needs. Never paste API keys into chat. No send-email permission until error rate is trusted.

Goal function: 20-turn North Star — Hermes won't stop until it decides. Super goals: structured multi-step project plans with human and agent tasks, progress bar, dashboard.

Parallel sub-agents each get fresh context, work simultaneously, report back. Tuesday-morning work done by Tuesday lunch. Hermes co-founder runs 12 parallel agents daily.

Heartbeat = zombie watchdog that restarts crashed agents. Cron = natural-language scheduled tasks (no syntax required). Daily morning briefing demo.

73% of every request is fixed overhead. One user spent 4M tokens asking about the weather by mistake. Use cheap model for volume, expensive for hard calls.

Hermes as the hub: Pantheon personas, Obsidian memory vault, terminal/gateway, tool workshop. GitHub backup daily. The everything system for your AI life.

Hermes (business brain, long-term memory) + Claude Code (precision builder) share context so neither operates in isolation.
Hermes is not a better chatbot — it is a personal operating system that gets smarter the longer you use it, and these 21 concepts are the mental models that unlock the difference between occasional help and daily leverage.
“A chatbot tells you how to book a flight. An agent will actually go ahead and book the flight and find it for you.”
“The more you use it, the better it understands you. Its whole design ethos is built to live with you and improve with you over time.”
“73% of every request is fixed overhead.”
“Some guy actually spent 4,000,000 tokens in two hours of light usage, 21,000 tokens just asking about the weather because he made a mistake.”
“Tuesday morning's work, done by Tuesday lunch.”
See every word as it's spoken — crank it to 2× and still catch all of it. The same dual-channel trick behind Amazon's Kindle + Audible.
Most AI tutorials assume you already know what an agent is. This one starts before that — at the moment you open the app, see a wall of tools and skills, and ask yourself what any of it is supposed to do for you.
A mental model for when to use which AI tool based on the relationship each requires.
Start local and free; only pay for always-on infrastructure when uptime genuinely matters.
The only two ways to connect any service to Hermes.
Model-agnostic strategy: treat your AI brain as a toolbox, not a single hammer.
The six slash commands that separate manual Hermes users from power users.
The principle of least access applied to AI agents.
Parallel sub-agents each with fresh context work simultaneously and report back.
The dirty secret of AI token costs: most of what you pay for is boilerplate, not your actual query.
“The next thing we need to do is leverage all of these and get one of these operating systems ourselves, which you can learn about in this video right here.”
Clean end-screen redirect to OS setup video. No product pitch, no newsletter. Pure next-step funnel within YouTube.
00:01
00:30
01:07
01:59
02:25
02:49
03:41
04:36
05:08
05:36
06:20
07:04
07:55
08:26
09:27
09:55
10:19
10:38
11:18
11:35
12:27
13:11
13:29
13:55
14:51
15:03
15:45
16:02
16:36
17:18
18:11
18:17
18:45
19:26
19:50
20:25
21:21
21:49
22:15
22:39
22:53
23:11
23:27
23:43
24:35
24:53
25:33
25:56
26:04
26:34
27:26
27:54
28:46
29:14
30:30
31:22
31:59
32:19
32:41
32:59
33:30
33:39
33:56
34:59
35:28
35:49
36:19
36:48
37:18
37:39
38:05
38:11
38:16
38:35
39:05
39:48
40:00
40:39
40:56
41:09How Jack Roberts wired Google NotebookLM into the Hermes Telegram agent -- turning a free 300-source research platform into a 24/7 AI assistant that reads, acts, and automates from your phone.
May 18thA 31-minute setup walkthrough that bridges Hermes AI agent and Claude Code into one shared operating system — with Pantheon personas, Obsidian memory, Apollo lead scraping, and Zapier-to-Gmail wired in by the end.
May 15thJack Roberts complete Hermes Agent mastery guide from memory systems through deployment in under 25 minutes.
May 24thA 21-minute walkthrough on running a three-model AI triad overnight — Opus plans, DeepSeek grinds, GPT-5.5 critiques — for 1% of the cost of going all-in on frontier models.
May 16thA 24-minute screen-recorded walkthrough that builds a personal Claude Design system from scratch and ships it into slides, websites, apps, and automated Routines.
April 20thJack Roberts turns the /goal feature into a multi-week agentic OS: AI sprints plus human handshakes, tracked in a mission-control dashboard.
May 21st