The argument in one line.
Hermes Agent lets you build a self-improving AI assistant on your own VPS that learns your workflows, automates recurring tasks via cron jobs, and scales to multiple specialized agents by storing memory, skills, and configuration as version-controlled markdown files.
Read if. Skip if.
- A complete Hermes beginner who wants a single end-to-end walkthrough: VPS provisioning, install, Telegram connection, cron setup, and the self-improving loop — all in one session.
- Someone who was put off by the Mac mini requirement and wants to learn that a cheap Hostinger VPS is enough to run a persistent, always-on Hermes instance.
- A creator or community manager who wants to automate daily AI news briefings, YouTube comment responses, and morning business summaries via scheduled Hermes crons.
- A builder who already understands what Hermes is and wants to see the five core pillars (memory, skills, soul, crons, self-improving loop) applied in a live working system.
- You are already running Hermes on a VPS with Telegram connected and crons active — this course is foundational and covers ground you have likely already covered.
- You want a deep dive into multi-agent orchestration or advanced skill development; this is a zero-to-one setup course, not a power-user optimization guide.
The full version, fast.
Hermes Agent is an open-source, self-improving AI assistant from Noose Research that runs on your own VPS and reaches you anywhere through Telegram, Discord, or Slack. It rests on five pillars: persistent memory files that carry context across sessions, reusable skills written as markdown recipes, a soul file that defines personality, cron jobs that turn it from reactive to proactive, and a self-improving loop where corrections become durable skills and memory. Set it up with a Hostinger VPS using the one-click Docker install, authenticate through your ChatGPT subscription to avoid API costs, and connect Telegram via BotFather. Immediately sync everything to a private GitHub repo as a backup, store API keys in the .env rather than chat, and let one agent grow before splitting roles into separate containers.
Chat with this breakdown.
Modern Creator members can chat with any breakdown — ask for the hook, quote a framework, find the exact transcript moment. Unlocks at T2: refer 3 friends + add your own API key.
Create a free account →Where the time goes.

01 · Live demo intro
Shows live Hermes in Telegram: cron jobs, voice replies, hyperframes video the agent self-generated. Hook and promise established.

02 · What is Hermes + comparison to Claude Code and OpenClaw
Open-source MIT from Nous Research, 140K GitHub stars. Mental model: Claude Code = desk work; Hermes = on the go. Comparison breakdown of all three tools.

03 · The Five Pillars
Memory (user.md + memory.md), Skills (YAML front matter + Skills Hub 520+ community skills), Soul (soul.md personality), Crons (natural language scheduling, fresh isolated sessions), Self-Improving Loop. Each shown as Excalidraw diagram.

04 · VPS setup on Hostinger
KVM 2 plan, Ubuntu 24.04 LTS, Docker one-click install vs root install. Nate strongly recommends Docker for agent isolation. Shows Claude Code VPS-agents management project pattern.

05 · Onboarding + Telegram connection
Choose OpenAI Codex (cheapest, uses existing ChatGPT subscription). BotFather bot creation. Set allowed user ID. Launch CLI. First voice message auto-builds user.md.

06 · GitHub backup + first cron job
Secure API key flow: hermes config set in VPS terminal, NOT chat window. Creates private repo. Auto-generated nightly-github-sync skill + midnight cron from natural language request.

07 · CLI vs Telegram mental model
CLI = cockpit (deep work, slash commands, context visibility). Telegram = remote control (quick tasks, voice, scheduled reports). Same agent, same brain — Telegram hides context management.

08 · Best practices + security
Treat agent like an intern: narrow API scopes, own accounts per agent, VPS firewall via Hermes research, nightly security audit cron. Context compression demo at 167K tokens.

09 · Scaling multiple agents
Each agent in own Docker container. Decision tree for when to spin up a new agent. Bad pattern: one mega-agent with all keys. Good pattern: Personal Manager Hermes routing to specialist agents.
Lines worth screenshotting.
- Hermes Agent's five core pillars are memory, skills, soul (personality configuration), crons (scheduled automations), and a self-improving loop that writes and updates its own skills.
- Claude Code is for sitting at your desk doing knowledge work; Hermes is for being on the go and having an agent that wakes up immediately and responds through Telegram.
- Hermes comes with 684 total skills and 91 pre-installed out of the box — Excalidraw diagrams, voice transcription, and video generation are available without any additional configuration.
- Hermes running on a $5 VPS via Hostinger is the cost-optimal setup for 24/7 availability without tying up your local machine.
- A daily AI news briefing, YouTube comment monitoring, school community engagement, and morning business summaries are all examples of production crons running inside one Hermes instance.
- When you ask Hermes to make a video and it cannot do it well the first time, it researches the correct tool, installs it, and produces a significantly better second result — the self-improvement is observable.
- Hermes can respond to YouTube comments using a transcript-aware agent that has full knowledge of the video's content and the creator's background.
- The installation is a one-command setup — the agent provisions itself with tools, creates its memory database, and connects to Telegram without manual configuration of each component.
- GitHub backup automation ensures that all of Hermes's learned memory, custom skills, and configuration is version-controlled and recoverable.
- Multiple specialized agents (one for business research, one for content, one for personal tasks) can be scaled from a single Hermes instance using the same infrastructure.
- Hermes's SQLite memory database is searchable in real time — even tasks that did not make it into long-term memory can be found by searching the full conversation log.
- Asking Hermes to do an audit of its own security setup is the correct way to verify that API keys are stored safely and that the configuration is hardened.
Your agent is only as good as its context files.
Nate's Claude Code VPS-agents project — one repo, one .env per agent, all passwords and IPs in one place — is the missing organizational layer most builders skip on day one.
- Create a Claude Code project called vps-agents right now. One subfolder per agent. .env with IP, root password, admin creds, API keys.
- Five pillars first, tools second. Before installing anything, write user.md and memory.md. The agent is useless without durable context.
- Pass API keys via 'hermes config set KEY value' in the VPS terminal, never in the chat window.
- First cron = nightly GitHub backup. Natural language request, Hermes builds the skill automatically. Do this on session one.
- CLI for building, Telegram for running. Use Claude Code to set up and debug; use Telegram when you're walking.
- One agent, one job. Start with your personal/manager Hermes. Spin off specialist agents only when you hit the decision tree criteria.
- Correction + memory = the actual training loop. When the agent gets something wrong twice, correct it on the spot and tell it to update the skill or memory.
Terms worth knowing.
- VPS (Virtual Private Server)
- A rented computer in a data center that you control remotely. It gives you a dedicated slice of CPU, memory, and storage to run apps, services, or agents around the clock.
- Hermes Agent
- An open-source personal AI assistant from Nous Research that runs on your own infrastructure, supports skills, memory, and scheduled automations, and can be reached through messaging platforms like Telegram.
- Skill
- A reusable instruction file (skill.md) that tells an AI agent how to perform a specific task consistently. Acts like a recipe the agent follows so the same job produces the same quality every time.
- Cron / Cron Job
- A scheduled automation that runs at a set time or interval, such as every morning at 6 a.m. Agents use crons to handle recurring work without being asked each time.
- Soul.md
- A markdown file that defines an agent's personality, tone, and voice. It shapes how the assistant talks so different agents on the same system can feel distinctly different.
- Memory.md / User.md
- Markdown files that store durable facts about you and your environment so the agent does not start from scratch each session. User.md holds personal style and preferences; memory.md holds project and context details.
- Agents.md / Claude.md
- A per-project context file (named agents.md for Codex, claude.md for Claude Code) that describes the project's goal and structure so an AI coding agent has the right working context loaded.
- Stateless
- A property of large language models where each session starts with no memory of previous interactions. Persistent files like memory.md exist to compensate for this fresh-start behavior.
- Progressive Disclosure
- A design pattern where an agent only loads the full contents of a skill when it actually needs it, using short metadata to decide. This keeps the working context lean and avoids bloat.
- YAML Front Matter
- A small block of structured key-value data at the top of a markdown file, fenced by triple dashes. Agents read it to learn what a skill is for and when to invoke it.
- Context Window
- The maximum amount of text, measured in tokens, that a language model can consider at once. Once the limit is hit, older content must be summarized or dropped.
- Auto-Compaction
- An automatic process where an agent summarizes earlier parts of a long session to fit within the context window. It preserves continuity but may lose detail.
- Context Rot
- The degradation of an AI's response quality as a session's context grows too long or cluttered. Important instructions get diluted and the model starts missing or confusing details.
- CLI (Command Line Interface)
- A text-based way to interact with software by typing commands in a terminal. Many agentic tools expose their fullest controls and slash commands through a CLI.
- Claude Code
- Anthropic's terminal-based coding assistant that lives next to your code and is driven interactively by a developer. Often used as a daily driver for hands-on knowledge work.
- OpenClaw
- An open-source agent project, created by Peter Steinberger, designed for running AI assistants on personal infrastructure with messaging integrations. Often compared to Hermes Agent for similar use cases.
- Codex
- OpenAI's coding agent tooling and accompanying CLI workflow. Uses an agents.md file to describe project context, similar to how Claude Code uses claude.md.
- NemoClaw
- An enterprise-focused agent stack built by NVIDIA on top of the OpenClaw open-source project. Targets larger organizations rather than individual hobbyists.
- Open-Source Model
- A large language model whose weights are publicly released so anyone can run it locally or on their own server. Examples include Qwen and Llama, useful when privacy or cost is a concern.
- Docker / Docker Container
- A lightweight, isolated environment that packages an application with everything it needs to run. Multiple containers can live on one server without sharing files or credentials.
Things they pointed at.
Lines you could clip.
“Hermes agent is one of the most powerful AI agents that I've ever played with.”
“Cloud Code is still my daily driver. That's where I do 90% of my knowledge work throughout the day.”
“If you guys have ever seen the movie Memento — that's kind of like how agents work.”
“CLI is the cockpit. Telegram is the remote control.”
“This isn't a tool you finish setting up. It's a teammate that you keep using and you keep training.”
Word for word.
The bait, then the rug-pull.
Nate Herk opens with his live Hermes instance already running — voice replies in Telegram, cron jobs firing on schedule, a YouTube comment bot that knows his entire channel history. The promise: you can have this too, no Mac mini required, on a $6/mo VPS, in under an hour.
Named ideas worth stealing.
The Five Pillars of Hermes
- Memory (user.md + memory.md)
- Skills (skill.md with YAML front matter)
- Soul (soul.md personality)
- Crons (natural language scheduling)
- Self-Improving Loop
The five architectural concepts that make Hermes more than a chatbot — together they create a persistent, improving, proactive AI teammate.
Memento Mental Model for AI Agents
AI agents wake stateless every session, like the movie Memento. The context files (user.md, memory.md, CLAUDE.md) are the sticky notes on the wall that let it function.
CLI = Cockpit, Telegram = Remote Control
CLI gives full visibility: slash commands, context window, session management. Telegram is the mobile interface — same agent, less visibility. Don't vibe-code complex apps from Telegram.
Agent Scaling Decision Tree
- Needs different secrets/tools? -> new agent
- Needs separate long-term memory? -> new agent
- Is it ongoing repeated work? -> new agent
- One-off task? -> keep in personal main
Simple rule for when to spin up a new Hermes vs. adding to your existing one. Start with one, scale naturally when you hit criteria.
Least Privilege for Agents
Give each agent only the credentials and tools needed for its specific job. Marketing agent doesn't need QuickBooks access. Finance agent doesn't need social posting.
How they asked for the click.
“The link for the free resource guide is down in the description. You'll go into the free school community, click on classroom, click on all YouTube resources.”
CTA delivered twice: once mid-video before setup section, once at end. Free resource guide positioned as a companion doc you can hand directly to your Hermes to execute the setup. Smart integration.







































































