
Hermes Agent Ships Security-Guidance Plugin With 25 Pattern-Matched Vulnerability Rules
Hermes Agent ports Anthropic's security-guidance plugin: 25 rules catch unsafe deserialization, command injection, and XSS at write time. Zero LLM tokens.
Deep-dives into AI infrastructure, prompt engineering, and the tools we build.

Hermes Agent ports Anthropic's security-guidance plugin: 25 rules catch unsafe deserialization, command injection, and XSS at write time. Zero LLM tokens.
Every component, plugin, and cron job in a production Hermes Agent setup, mapped and explained for replication from scratch.

Hermes Agent crossed a threshold this week: agents fixing agents, resurrecting decade-old phones, and running production workloads on 1M+ visitor sites.

Community desktop app turns Hermes into a 20-specialist multi-agent system with task decomposition, visual Skill Store, and orchestration.

GBrain's 30-page knowledge graph wired into AxDSan's Mnemosyne: a 113-char digest for ambient awareness, and relevance-gated page injection on first turn. Zero MCP calls.

Bridging OpenRouter's async video generation webhooks to Hermes Agent with a Cloudflare Worker signature translator and localhost tunnel.

GBrain runs at 250ms per CLI-spawned operation. Mnemosyne runs at 2ms in-process. The difference is architecture, not implementation quality.

Mnemosyne's sleep consolidation cycle scored +70.8pp on BEAM multi-hop reasoning with 0.076ms SQLite reads. Zero cloud dependencies, zero cold start.

Prompt caching cuts LLM input costs by up to 90% and TTFT by 80%, but only exact-prefix matches count. How production agents structure prompts for maximum cache hits.

A 3-layer memory stack earned 183 likes and 270 bookmarks. Hermes Desktop open-sourced. OpenClaw users migrating. Roundup of Hermes Agent on X.

Instrumenting the pre_gateway_dispatch hook to log every skill injection decision — score, skip reason, and top-3 matches — so we can tune the threshold with data instead of guessing.

semantic-skills v2 pre-loads skills into user messages via pre_gateway_dispatch hook. System prompt stays at 226 tokens, cache-friendly. No first-turn tool call needed.



A deep dive into the three-tier system prompt architecture inside Hermes Agent — how it works, why it costs 6,000+ tokens per session, and how we cut that by 95%.