Big Idea

The argument in one line.

Hermes Agent lets you build a self-improving AI assistant on your own VPS that learns your workflows, automates recurring tasks via cron jobs, and scales to multiple specialized agents by storing memory, skills, and configuration as version-controlled markdown files.

Who This Is For

Read if. Skip if.

READ IF YOU ARE…

A complete Hermes beginner who wants a single end-to-end walkthrough: VPS provisioning, install, Telegram connection, cron setup, and the self-improving loop — all in one session.
Someone who was put off by the Mac mini requirement and wants to learn that a cheap Hostinger VPS is enough to run a persistent, always-on Hermes instance.
A creator or community manager who wants to automate daily AI news briefings, YouTube comment responses, and morning business summaries via scheduled Hermes crons.
A builder who already understands what Hermes is and wants to see the five core pillars (memory, skills, soul, crons, self-improving loop) applied in a live working system.

SKIP IF…

You are already running Hermes on a VPS with Telegram connected and crons active — this course is foundational and covers ground you have likely already covered.
You want a deep dive into multi-agent orchestration or advanced skill development; this is a zero-to-one setup course, not a power-user optimization guide.

TL;DR

The full version, fast.

Hermes Agent is an open-source, self-improving AI assistant from Noose Research that runs on your own VPS and reaches you anywhere through Telegram, Discord, or Slack. It rests on five pillars: persistent memory files that carry context across sessions, reusable skills written as markdown recipes, a soul file that defines personality, cron jobs that turn it from reactive to proactive, and a self-improving loop where corrections become durable skills and memory. Set it up with a Hostinger VPS using the one-click Docker install, authenticate through your ChatGPT subscription to avoid API costs, and connect Telegram via BotFather. Immediately sync everything to a private GitHub repo as a backup, store API keys in the .env rather than chat, and let one agent grow before splitting roles into separate containers.

Free for members

Chat with this breakdown — free.

Sign in and you get 23 free chat messages on us — ask for the hook, quote a framework, find the exact transcript moment, generate a markdown action plan. Bring your own key when you want unlimited.

Create a free account →

Chapters

Where the time goes.

00:00 – 03:19

01 · Live demo intro

Shows live Hermes in Telegram: cron jobs, voice replies, hyperframes video the agent self-generated. Hook and promise established.

03:19 – 07:36

02 · What is Hermes + comparison to Claude Code and OpenClaw

Open-source MIT from Nous Research, 140K GitHub stars. Mental model: Claude Code = desk work; Hermes = on the go. Comparison breakdown of all three tools.

07:36 – 16:20

03 · The Five Pillars

Memory (user.md + memory.md), Skills (YAML front matter + Skills Hub 520+ community skills), Soul (soul.md personality), Crons (natural language scheduling, fresh isolated sessions), Self-Improving Loop. Each shown as Excalidraw diagram.

16:20 – 25:30

04 · VPS setup on Hostinger

KVM 2 plan, Ubuntu 24.04 LTS, Docker one-click install vs root install. Nate strongly recommends Docker for agent isolation. Shows Claude Code VPS-agents management project pattern.

25:30 – 33:00

05 · Onboarding + Telegram connection

Choose OpenAI Codex (cheapest, uses existing ChatGPT subscription). BotFather bot creation. Set allowed user ID. Launch CLI. First voice message auto-builds user.md.

33:00 – 43:13

06 · GitHub backup + first cron job

Secure API key flow: hermes config set in VPS terminal, NOT chat window. Creates private repo. Auto-generated nightly-github-sync skill + midnight cron from natural language request.

43:13 – 46:20

07 · CLI vs Telegram mental model

CLI = cockpit (deep work, slash commands, context visibility). Telegram = remote control (quick tasks, voice, scheduled reports). Same agent, same brain — Telegram hides context management.

46:20 – 50:30

08 · Best practices + security

Treat agent like an intern: narrow API scopes, own accounts per agent, VPS firewall via Hermes research, nightly security audit cron. Context compression demo at 167K tokens.

50:30 – 58:22

09 · Scaling multiple agents

Each agent in own Docker container. Decision tree for when to spin up a new agent. Bad pattern: one mega-agent with all keys. Good pattern: Personal Manager Hermes routing to specialist agents.

Atomic Insights

Lines worth screenshotting.

Hermes Agent's five core pillars are memory, skills, soul (personality configuration), crons (scheduled automations), and a self-improving loop that writes and updates its own skills.
Claude Code is for sitting at your desk doing knowledge work; Hermes is for being on the go and having an agent that wakes up immediately and responds through Telegram.
Hermes comes with 684 total skills and 91 pre-installed out of the box — Excalidraw diagrams, voice transcription, and video generation are available without any additional configuration.
Hermes running on a $5 VPS via Hostinger is the cost-optimal setup for 24/7 availability without tying up your local machine.
A daily AI news briefing, YouTube comment monitoring, school community engagement, and morning business summaries are all examples of production crons running inside one Hermes instance.
When you ask Hermes to make a video and it cannot do it well the first time, it researches the correct tool, installs it, and produces a significantly better second result — the self-improvement is observable.
Hermes can respond to YouTube comments using a transcript-aware agent that has full knowledge of the video's content and the creator's background.
The installation is a one-command setup — the agent provisions itself with tools, creates its memory database, and connects to Telegram without manual configuration of each component.
GitHub backup automation ensures that all of Hermes's learned memory, custom skills, and configuration is version-controlled and recoverable.
Multiple specialized agents (one for business research, one for content, one for personal tasks) can be scaled from a single Hermes instance using the same infrastructure.
Hermes's SQLite memory database is searchable in real time — even tasks that did not make it into long-term memory can be found by searching the full conversation log.
Asking Hermes to do an audit of its own security setup is the correct way to verify that API keys are stored safely and that the configuration is hardened.

Takeaway

Your agent is only as good as its context files.

Steal this system

Nate's Claude Code VPS-agents project — one repo, one .env per agent, all passwords and IPs in one place — is the missing organizational layer most builders skip on day one.

Create a Claude Code project called vps-agents right now. One subfolder per agent. .env with IP, root password, admin creds, API keys.
Five pillars first, tools second. Before installing anything, write user.md and memory.md. The agent is useless without durable context.
Pass API keys via 'hermes config set KEY value' in the VPS terminal, never in the chat window.
First cron = nightly GitHub backup. Natural language request, Hermes builds the skill automatically. Do this on session one.
CLI for building, Telegram for running. Use Claude Code to set up and debug; use Telegram when you're walking.
One agent, one job. Start with your personal/manager Hermes. Spin off specialist agents only when you hit the decision tree criteria.
Correction + memory = the actual training loop. When the agent gets something wrong twice, correct it on the spot and tell it to update the skill or memory.

Glossary

Terms worth knowing.

VPS (Virtual Private Server): A rented computer in a data center that you control remotely. It gives you a dedicated slice of CPU, memory, and storage to run apps, services, or agents around the clock.
Hermes Agent: An open-source personal AI assistant from Nous Research that runs on your own infrastructure, supports skills, memory, and scheduled automations, and can be reached through messaging platforms like Telegram.
Skill: A reusable instruction file (skill.md) that tells an AI agent how to perform a specific task consistently. Acts like a recipe the agent follows so the same job produces the same quality every time.
Cron / Cron Job: A scheduled automation that runs at a set time or interval, such as every morning at 6 a.m. Agents use crons to handle recurring work without being asked each time.
Soul.md: A markdown file that defines an agent's personality, tone, and voice. It shapes how the assistant talks so different agents on the same system can feel distinctly different.
Memory.md / User.md: Markdown files that store durable facts about you and your environment so the agent does not start from scratch each session. User.md holds personal style and preferences; memory.md holds project and context details.
Agents.md / Claude.md: A per-project context file (named agents.md for Codex, claude.md for Claude Code) that describes the project's goal and structure so an AI coding agent has the right working context loaded.
Stateless: A property of large language models where each session starts with no memory of previous interactions. Persistent files like memory.md exist to compensate for this fresh-start behavior.
Progressive Disclosure: A design pattern where an agent only loads the full contents of a skill when it actually needs it, using short metadata to decide. This keeps the working context lean and avoids bloat.
YAML Front Matter: A small block of structured key-value data at the top of a markdown file, fenced by triple dashes. Agents read it to learn what a skill is for and when to invoke it.
Context Window: The maximum amount of text, measured in tokens, that a language model can consider at once. Once the limit is hit, older content must be summarized or dropped.
Auto-Compaction: An automatic process where an agent summarizes earlier parts of a long session to fit within the context window. It preserves continuity but may lose detail.
Context Rot: The degradation of an AI's response quality as a session's context grows too long or cluttered. Important instructions get diluted and the model starts missing or confusing details.
CLI (Command Line Interface): A text-based way to interact with software by typing commands in a terminal. Many agentic tools expose their fullest controls and slash commands through a CLI.
Claude Code: Anthropic's terminal-based coding assistant that lives next to your code and is driven interactively by a developer. Often used as a daily driver for hands-on knowledge work.
OpenClaw: An open-source agent project, created by Peter Steinberger, designed for running AI assistants on personal infrastructure with messaging integrations. Often compared to Hermes Agent for similar use cases.
Codex: OpenAI's coding agent tooling and accompanying CLI workflow. Uses an agents.md file to describe project context, similar to how Claude Code uses claude.md.
NemoClaw: An enterprise-focused agent stack built by NVIDIA on top of the OpenClaw open-source project. Targets larger organizations rather than individual hobbyists.
Open-Source Model: A large language model whose weights are publicly released so anyone can run it locally or on their own server. Examples include Qwen and Llama, useful when privacy or cost is a concern.
Docker / Docker Container: A lightweight, isolated environment that packages an application with everything it needs to run. Multiple containers can live on one server without sharing files or credentials.

Resources

Things they pointed at.

16:30productHostinger VPS ↗

57:30productGlaido voice-to-text ↗

06:40toolNous Research / Hermes Agent ↗

03:20toolOpenClaw

18:00toolHermes Skills Hub ↗

35:00linkGitHub personal access tokens ↗

Quotables