Deploy AI agents withOne API call.
Deploy a fully harnessed AI agent for every customer. Bring your system prompt, skills, MCP tools, and connectors — get infrastructure, orchestration, memory, self learning, and more out of the box.
Problem
Building a good AI agent is hard
A production agent is more than a model in a loop. Before customers trust it with real work, it needs every layer below — and on a bare SDK, each one is a project you build and maintain yourself, none of which makes your product different.
Tool integration
Hand-rolled MCP clients, tool schemas, and per-customer credential injection — with allowlists to scope as your tool surface grows.
Skills
Domain playbooks crammed into the system prompt burn context on every turn. On-demand loading needs a skill format the model actually follows.
Automations
Cron parsing, job queues, retry backoff, and run state that survives a worker dying mid-job — for every schedule a customer creates.
Memory
Context compaction inside a session, persistence across them, and per-tenant scoping — thread checkpoints alone can't do it.
Self learning
Turning corrections into future behavior takes idle-time consolidation and feedback loops you'd design from scratch.
Infrastructure
Container-per-customer isolation, persistent volumes, SSE fan-out with replay, secret vaulting — plumbing measured in months.


Product
Deploy agents in one API call
Stop harness-engineering. Make one API request with your system prompt, skills, MCP tools, and connectors, and Herm deploys an isolated AI agent for each of your customers — with the memory, automations, self-learning, and persistent filesystem every agent application needs built in.
One API call per customer
Read docsDurable threads and runs
Read docsTokens and tool calls over SSE
Read docsLearns every customer
Read docsScheduled background runs
Read docsMarkdown playbooks on demand
Read docsWeb, browser, shell built in
Read docsYour tools, open protocol
Read docsA persistent workspace
Read docsCredentials, never in the sandbox
Read docsHuman approval mid-run
Read docsA container per customer
Read docsImproves between sessions
Read docsParallel isolated workers
Read docsReact to workspace changes
Read docsRuns survive crashes
Read docsHerm vs. building it yourself
The runtime is the moat you shouldn't build
Every capability below is mandatory before customers trust an agent with real work — and none of it differentiates your product. Building it yourself means months of runtime engineering and a maintenance treadmill. Herm ships the whole column out of the box in a single API call.
- Agent + infrastructure in one API call
- Durable execution — runs survive crashes and deploys
- Session management and compaction
- Observational memory across sessions
- Per-customer container isolation
- Credential management for connectors
- Steering and human-in-the-loop approvals
- Streaming over SSE
- Built-in tools: web search, browser, file search
- Skills from files, URLs, or inline markdown
- Automations and scheduled runs
- Self-learning and dreaming
- Persistent goals (Ralph Wiggum loop)
- New harness features without a rewrite
Herm— the agent and the infrastructure in one API call, built on the Hermes open-source harness.
Book a demoFrequently Asked Questions
Everything you need to know about Herm — the API, the pricing, and what you can build.
Contact SupportWhat is Herm?
Herm is an API for deploying customer-facing AI agents built on the open-source Hermes harness. Bring a system prompt, skills, tools, and connectors, and a single API call gives you a dedicated agent for each of your customers, plus an endpoint to chat with it over SSE. We run the containers, the harness, and everything in between.
What is Hermes?
Hermes is the open-source personal agent from Nous Research with 185k+ GitHub stars. It ships with session management, observational memory, self-learning, skills, scheduled automations, isolated subagents, real sandboxing, a persistent filesystem, and full browser and web control. It's the same harness powering products like Higgsfield Supercomputer.
Herm lets you create Hermes instances programmatically — the agent and the infrastructure in a single API call — so you focus on the engineering that's unique to your product.
Do I need to set up any servers or infrastructure?
No. One POST /v1/deployments call provisions an isolated, container-sandboxed AI agent with a persistent filesystem, session management, and credential management. There's no harness to build, no Docker to wrangle, and no EC2 instance to babysit.
How do I integrate it with my product?
Bring your system prompt, your MCP servers, your skills (from files, URLs, or inline markdown), and your connector secrets. The API returns a deployment ID and an SSE events endpoint you can wire straight into your chat UI. Your customers get memory, automations, self-learning, and image & video input without you building any of it.
How is this different from LangChain Managed Deep Agents or Claude Managed Agents?
All three give you an agent and infrastructure in one API call. Herm additionally exposes automations, persistent goals (the Ralph Wiggum loop), steering, dreaming, and image & video input — with no provider lock-in, so you choose the model.
What can my customers' agents do?
Anything Hermes can do: browse the web, search, manage files, run your MCP tools, follow your skills, and schedule recurring work. A customer can say “every week, look at our top-performing video from last week and make five variations” and the agent handles it in the background.
What happens to my customers' files between sessions?
Every deployment has a persistent filesystem. Files, memory, skills, and automations survive across sessions, so the agent picks up exactly where it left off — even days later.
Is my customers' data private?
Yes. Every customer's agent runs in its own isolated container, scoped to that deployment. Secrets are managed through credential references rather than raw tokens, and Herm does not train on your data.
Is it free to try?
Yes. Every Herm account includes enough credits to deploy an agent and explore — and if you have a customer-facing chat product, we're happy to build your agent for you. Reach out at rajit@prismvideos.com or book a demo.
Ship your agent today
Bring a system prompt, skills, tools, and connectors — get a dedicated AI agent for every customer, with an endpoint to chat over SSE. Herm runs the infra.