
Please critique my planned Hermes Agent setup
After briefly experimenting with Hermes Agent on my Macbook (with a local Gemma 4 model), I'm about to set up a more robust instance on a VPS. I'd appreciate your input on my config - both the choices I already pinned down and the ones I'm still deciding on (such as the specific LLMs).
For context: My use cases are still pretty open ended and exploratory, and will evolve over time. While I have a lot of experience with coding agents (Claude Code in particular) and I've built my own agentic pipelines, I'm fairly new to more general, always-on agents like Hermes. Never used OpenClaw etc. I envision using Hermes for extended research tasks (say into YouTube stats and patterns, app opportunities, gamedev, various niches, etc.) as well as daily briefings, using both my calendar and Todoist as well as current news. I want Hermes to be able to generate and share markdown reports and potentially other files (images, videos, PDFs) as well. I'd like to be able to forward emails to Hermes to have it act on them. I'm less likely to use it for coding directly, but will reassess this later.
Below is a high level list of my planned config, with some more details below:
- VPS: Hetzner CPX22 (2 VCPU, 4 GB RAM, 80 GB SSD
- LLM Provider: OpenRouter
- LLMs: TBD, see below
- Memory Provider: ByteRover
- Web Search Provider: TBD; Tavily or Firecrawl
- Long Term Data Storage: Obsidian Vault and Google Drive (details below)
- Messaging: Telegram
More details on some of these choices and open questions below.
VPS:
Hetzner because of competitive pricing. (I've mostly used DigitalOcean in the past, but their 4 GB instance is 2.5x the cost.)
I believe CPX22 with 4 GB RAM and 2 VCPUs is the right sweet spot for my needs?
LLMs:
This will likely require some experimentation. For custom apps, I've mostly used different flavors of Gemini (e.g. Gemini 2.5 Pro, Flash, and Flash Lite, depending on use case). So that's definitely a contender. Flash might be a good default model.
DeepSeek V4 also seems attractive, primarily because of the low cost.
Open to open source models like Qwen, Gemma 4, or Kimi 2 as well.
I'll read through more of the recommendations in this subreddit, but let me know if you have any particular recommendations for combos that have worked well for you.
Memory:
After researching the officially supported options, I landed on ByteRover. I like the file based approach and git semantics, as well as the tiered search. At least on paper, it seems more than suitable for my needs. I'd just use the local setup, with backup to Github.
I considered Hindsight, as the idea of a knowledge graph sounds compelling, and I've had great results with Postgres and pgvector for my own apps. But realistically, this is overkill for my needs right now.
Web Search:
Firecrawl and Tavily seem like the most popular options. Tavily seems to have the more generous free tier, but Firecrawl seems more commonly suggested. Any thoughts on the trade-offs here? Any alternative recommendations?
Data Storage:
A combination of Obsidian and Google Drive.
I already use Obsidian as my personal note taking app. I would set up a separate vault for Hermes Agent and sync this to a Github repo. I envision using this for more detailed, longer term data. Things like research reports, daily briefings, etc.
I want to explore ByteRover's "swarm" feature as well. It sounds like it can perform federated searches across its own memory and my Obsidian vault, which sounds compelling.
Google Drive is already my main cloud storage for personal and business related files. I would give Hermes read-only access to specific folders that might be needed for certain tasks. I would only give it write access to a dedicated "Hermes" folder.
I realize there are several ways to set up Google Drive support. I lean towards using the official Google Workspace CLI; see below.
Google Workspace:
I would create a dedicated Hermes account under my existing Google Workspace domain. That way, I can cleanly provision access to Google Drive, Email, Calendar, etc.
The official Google Workspace CLI sounds like the cleanest solution. That way, I can not only access Google Drive, but also Email etc. The CLI comes with agent skills that should make the Hermes integration pretty straightforward and robust. I should even be able to leverage ModelArmor to scan incoming emails to prevent prompt injection.
Other Integrations:
Telegram for messaging. (Perhaps Discord in the future.)
Todoist; haven't looked into plugins / MCPs / APIs yet.
Please let me know if you have any feedback or suggestions for improvement on this config. Thanks! Looking forward to getting deeper into Hermes Agent and uncovering more use cases over time. 😄
(Edit: Added a section for Web Search.)