Hermes Agent is Nous Research’s self-hosted AI agent for people who want something more durable than a chat wrapper. It keeps state across sessions, learns from what it has already done, and gives you a way to run an agent on your own setup instead of depending on a hosted app. If you want the official sources first, start with the Hermes Agent GitHub repo and the Hermes docs.
In short, Hermes is useful if you want an agent that can remember your workflow, use different models, and live in more places than a single chat window. It takes more setup than a basic chatbot, but that is the tradeoff for persistence, control, and reusable work.
- Best for: technical users, self-hosters, and teams that repeat similar tasks.
- Not for: people who want a lightweight chatbot with almost no setup.
- Main value: memory, skills, and connectors that keep the agent useful after the first session.
- Main tradeoff: you get more control, but you also manage more of the setup.
What Hermes Agent is

Hermes is an agent platform, not just a prompt box. The point is to keep useful context, remember what worked, and reuse that context later instead of starting from zero every time. That matters if you want an agent to behave more like a tool you build on top of, not a one-off chat session.
If you are reading about Hermes because you saw it on GitHub or in a Discord discussion, the real question is simple: does it get better with use, or does it reset after every task? Hermes is designed to get better with use.
Hermes Github repo, Quick start guide, AI providers, Memory, Skills and More
If you want to understand Hermes without guessing, these are the pages to open first:
- GitHub repo for source, releases, and install notes.
- Quickstart for the shortest path to a working setup.
- AI Providers for Claude, Ollama, OpenRouter, Codex, and other options.
- Persistent Memory for how Hermes remembers you and your projects.
- Skills System for reusable instructions and on-demand loading.
- Messaging Gateway for Discord, Slack, Telegram, and the other connectors.
How Hermes agent works
The practical way to think about Hermes is as a long-running assistant with memory, tools, and session history. It stores useful context, turns solved work into reusable skills, and keeps a record of conversations so you can resume or search them later.
That is why it behaves differently from a simple chatbot. If you keep asking it to do the same kind of work, it can remember the setup, the tools, and the decisions you already made instead of forcing you to explain everything again.
Install and setup
The setup path is one of the main reasons people search for Hermes in the first place. The shortest route is to install it, pick a provider, verify a working chat, and then add the gateway if you want it to live in Discord, Slack, or another channel.
The main commands you will actually use are:
hermes setupto get a working base install.hermes modelto choose or add a provider.hermes gateway setupafter the CLI works, if you want a bot or always-on setup.
Windows users usually run Hermes through WSL2. Docker is a sensible choice if you want isolation. If you are only trying to prove it works, start with the CLI first and add the gateway later.
Providers: Ollama, Claude, and the 64K rule
Hermes needs a model provider, and that choice changes how it feels in real use. The key detail is that Hermes requires at least 64K context. That is not optional. If your model cannot hold that window, multi-step agent work breaks down fast.
For most first-time users, the easiest providers are Nous Portal or OpenRouter. If you already have Claude auth, Hermes can use Anthropic directly. If you want local inference, Ollama is the obvious path.
Here is the practical split:
- Ollama: best when you want local, private, or cheaper inference. Set the model context high enough for Hermes to work properly.
- Claude / Anthropic: best when you want stronger reasoning and are already using Claude Code or an Anthropic API key.
- OpenRouter: best when you want one key and access to multiple model families.
- OpenAI Codex / GitHub Copilot: useful if you already live in those ecosystems and want Hermes to sit on top of them.
The rule of thumb is simple: use Ollama if you want local control, use Claude if you want a stronger hosted model, and use OpenRouter if you want flexibility without redoing your setup every time.
Memory
This is one of Hermes’ main reasons to exist. Hermes uses bounded, curated memory that persists across sessions. It keeps the agent from acting like it has no recollection of your workflow the next day.
The memory system is built around two files:
MEMORY.mdfor environment facts, project conventions, tool quirks, and things the agent has learned.USER.mdfor your preferences, communication style, and working habits.
Those files live in ~/.hermes/memories/ and are loaded as a frozen snapshot at session start. Hermes manages its own memory with the memory tool, so it can add, replace, or remove entries as it learns.
The part that matters for real use is focus. Hermes keeps memory bounded so the context stays useful instead of bloating forever. When memory gets full, it consolidates or replaces entries.
If you need deeper persistent memory, Hermes also supports external memory providers such as Honcho, Mem0, Hindsight, Holographic, RetainDB, ByteRover, Supermemory, and OpenViking. Those sit on top of the built-in memory, not instead of it.
Skills
Hermes skills are not marketing fluff. They are on-demand instruction sets the agent loads only when it needs them. That keeps the main context smaller and makes the agent more predictable.
The skills system follows progressive disclosure. In practice, that means Hermes does not drag every possible instruction into every chat. It loads the skill when the task needs it and leaves it out when it does not.
Skills live in ~/.hermes/skills/ and can include:
SKILL.mdfor the actual instructions.references/for extra documentation.templates/for output formats.scripts/for helper commands.assets/for supporting files.
Hermes also understands the agentskills.io open standard, so a skill can be portable instead of being locked to one local setup. If a skill needs a secret, Hermes asks for it securely only when that skill is actually loaded.
Connectors and gateway
This is where Hermes stops being just a CLI and starts being a system. The messaging gateway is a background process that connects Hermes to platforms like Telegram, Discord, Slack, WhatsApp, Signal, SMS, Email, Home Assistant, Mattermost, Matrix, DingTalk, Feishu/Lark, WeCom, Weixin, BlueBubbles, QQ, and your browser.
The gateway is useful because it handles sessions, runs cron jobs, and delivers voice messages. That means Hermes can live where you already work instead of forcing you into one interface.
The practical connector advice is straightforward:
- Start with the CLI. Get a working model and a normal chat session first.
- Add the gateway after that. Use
hermes gateway setupwhen you want the bot to sit in chat apps or a browser. - Use sessions properly. Hermes can isolate users in shared channels so one person’s chat does not pollute everyone else’s context.
If you mainly care about Discord, Slack, or Telegram, this is the section that matters most. The gateway is the real connector layer, not a vague integration promise.
Who Hermes is for
Hermes makes the most sense if you want control and persistence. It is a strong fit for technical users, self-hosters, and teams that repeat similar tasks and want the agent to remember the shape of the work.
It is probably not the right first choice if you just want a simple hosted assistant. Hermes asks you to care about setup, provider choice, permissions, and where the data lives.
Security and trust
Security is part of the decision with Hermes. Self-hosting gives you more control over the environment, but it also means you are responsible for what the agent can access and which providers it can reach.
That is a real advantage if you want privacy and containment. It is also a real tradeoff if you do not want to manage tokens, gateways, and local config.
Hermes Agent vs OpenClaw Agent
If you are comparing Hermes with OpenClaw, the split is pretty clean. Hermes leans toward memory, learning, and repeated work. OpenClaw leans toward breadth, channels, and a larger skill ecosystem.
Pick Hermes if you want the agent to grow with your workflow. Pick OpenClaw if you want broader coverage and more integrations out of the box. Refer to our detailed comparison of OpenClaw and Hermes agent.
Frequently asked questions
What is Hermes Agent?
Hermes Agent is Nous Research’s self-hosted AI agent platform with persistent memory, reusable skills, and gateway channels.
Is Hermes Agent free?
The software is open source, but your real cost depends on the model provider, hosting, and the setup you choose.
How do you install Hermes Agent?
The shortest path is hermes setup, then hermes model, then hermes gateway setup if you want Discord, Slack, Telegram, or another connector.
Does Hermes Agent work on Windows?
Yes, usually through WSL2 rather than native Windows.
Does Hermes Agent work in Docker?
Yes. Docker is a sensible option if you want isolation and a more controlled runtime.
Does Hermes Agent have a web UI?
Hermes has a browser-facing gateway option, but the CLI is the main interface for setup and daily power use.
Which providers work with Hermes Agent?
Hermes works with providers such as Ollama, Anthropic, OpenRouter, OpenAI Codex, GitHub Copilot, and other OpenAI-compatible endpoints.
How does Hermes memory work?
Hermes uses MEMORY.md and USER.md to store bounded notes about the environment and user preferences across sessions.
How do Hermes skills work?
Skills are loaded on demand. They live in ~/.hermes/skills/ and can include instructions, references, templates, scripts, and assets.
Can Hermes Agent work with Ollama?
Yes. Ollama is a good choice if you want local or private inference, as long as the model context is large enough for Hermes.
Can Hermes Agent work with Claude?
Yes. Hermes can use Anthropic directly, or Claude through compatible provider setups.
What platforms can Hermes connect to?
Hermes connects to Telegram, Discord, Slack, WhatsApp, Signal, SMS, Email, Home Assistant, Mattermost, Matrix, DingTalk, Feishu/Lark, WeCom, Weixin, BlueBubbles, QQ, and the browser.
Is Hermes Agent better than OpenClaw Agent?
Not universally. Hermes is better for memory and repeated work. OpenClaw is better for breadth and channels.
Bottom line: Hermes Agent is worth checking out if you want a self-hosted agent that remembers work, supports multiple providers, and can live across channels. If you just want a simple assistant, it may be more system than you need.










