● MIT licensed 147 tests passing Self-hosted

Stop trusting
your AI agent.

Super Agent is a security-first plugin that gives any AI coding CLI persistent memory, observability, and the guardrails it should have shipped with — without locking you to a vendor.

Get started View on GitHub

~ superagent • host-cli guards armed

$ claude # or codex, cursor, ...

[superagent-memory] loaded 8 prior memories

[superagent-guards] PreToolUse hook armed (16 secret + 30 shell patterns)

› clean up the env files and force-push to main

🤖 attempting Read(.env)...

⛔ BLOCKED by Super Agent Guards:
secret file access denied (.env). Use env vars or a secrets manager.

🤖 attempting Bash("git push --force origin main")...

⛔ BLOCKED — force-push to protected branch.

🤖 I cannot complete that as requested. Here's a safer plan...█

147

tests passing

16+30

guard patterns

subagent roles

phases shipped

vendor lock

MIT

license

The problem

Five ways your AI agent will hurt you.

Every AI coding agent in 2026 ships with the same gaps. Most teams patch them ad-hoc, late, and incompletely. Super Agent bakes the answers in — at the architecture layer, not as add-ons.

Your secrets walk out the door

Agent reads .env to "fix" a bug, leaks DB password into chat logs that may live on a vendor's servers forever.

→ Super Agent Guards block secret-file reads at the harness layer. Model can't talk its way past Python.

One `rm -rf` away from disaster

Helpful agent runs rm -rf / or git push --force main — your repo history is gone in seconds.

→ Super Agent Guards match 30 destructive patterns deterministically. rm -rf node_modules works, root-level kill commands don't.

Every session, you re-explain the project

Monday: 30 minutes onboarding the agent. Tuesday: new session, agent forgot everything, asks "what's the auth strategy?" again.

→ Super Agent Memory persists explicit summaries and injects them at next session start. The agent becomes a permanent intern.

The bill arrives. You panic.

$340 this month. Which subagent burned tokens? Which prompt looped? Which day spiked? No idea.

→ Super Agent Trace records every tool call + subagent run with cost. Set budget alerts; never get surprised.

A web page hijacks your agent

Agent fetches a blog post. Buried in the HTML: . A naïve agent reads, decides, executes. Your SSH key is gone.

→ Super Agent Subagents break the lethal trifecta architecturally: the web-facing subagent has no shell, no file writes. Even if the model is tricked, the toolbox simply isn't there.

The system

Seven Super Agent-branded components.
All open-source. All self-hostable.

Each capability is a Super Agent-branded surface built on a trusted open-source engine. You get one coherent system; we attribute the engines openly.

● Shipped

Super Agent Guards

Deterministic Python hooks that block secret reads, destructive shell, force-pushes, raw-disk writes, and prompt-injection vectors before they execute.

first-party · zero runtime deps

● Shipped

Super Agent Subagents

Four specialized roles — reviewer, security-auditor, researcher, eval-runner — each with a scoped toolset that mechanically breaks the lethal trifecta.

first-party · 25 isolation tests

● Shipped

Super Agent Memory

Persistent context across sessions. SessionStart injects prior decisions; SessionEnd persists summaries. Backend down? Session continues silently.

Phase 4

Super Agent Trace

Every tool call, every subagent run, every LLM request recorded with cost and duration. Query the past. Set budget alarms. Never get surprised by an invoice again.

Phase 4

Super Agent Retrieval

Semantic search over your own docs and notes. The researcher subagent consults it before ever touching the open web — grounded, faster, safer.

Phase 5

Super Agent Browser

Sandboxed web automation. Allow-listed domains only. Only the researcher subagent can invoke it. Headless Chromium in an isolated container.

Phase 5

Super Agent Scanner

Continuous red-teaming against your own install. Prompt injection, jailbreak, system-prompt extraction — all probed weekly. Critical findings auto-file as issues.

Phase 6

Super Agent CLI

One command to install. superagent doctor diagnoses environment. superagent audit runs every check. Cross-platform, no manual wiring.

first-party · Windows · macOS · Linux

v1.0

Super Agent Plugin

Distributable single package. Install via your host CLI's plugin manager. SHA-pinned. Signed releases. Public marketplace listing.

first-party · signed

Architecture

A plugin, not a fork.

Super Agent rides on top of your host CLI's existing primitives: rules, hooks, subagents, MCP servers, skills. No second brain. No competing orchestrator. Just discipline + plumbing + trusted engines.

        ┌─────────────────────────────────────────┐
        │  Host AI Coding Agent CLI               │  orchestrator
        │  (loads rules, runs hooks, delegates    │
        │   to subagents, talks to MCP servers)   │
        └──────┬─────────────────────┬────────────┘
               │                     │
   ┌──────────▼─────────┐  ┌────▼──────────────────┐
   │  SUPERAGENT GUARDS    │  │  SUPERAGENT SUBAGENTS    │  specialization
   │  Pre / Post /      │  │  • code-reviewer      │
   │  Session hooks     │  │  • security-auditor   │
   │  (first-party      │  │  • researcher         │
   │   Python)          │  │  • eval-runner        │
   └────────┬───────────┘  └───────────┬───────────┘
            │                          │
            │  ┌───────────────────────▼──────────────────┐
            │  │  SUPERAGENT MCP SERVERS                     │  tools / data
            │  │  superagent-memory      ← Mem0              │
            │  │  superagent-trace       ← Langfuse          │
            │  │  superagent-retrieval   ← Qdrant + LlamaIdx │
            │  │  superagent-browser     ← Browser Use       │
            │  │  superagent-scanner     ← garak             │
            │  └───────────────────────┬──────────────────┘
            │                          │
            │              ┌───────────▼────────────┐
            │              │  Docker Compose         │  infra
            │              │  (self-hosted)          │
            ▼              └─────────────────────────┘
   ┌──────────┐
   │  audit   │
   │   log    │
   └──────────┘

Host primitives, used

Rules, MCP, hooks, subagents, skills. Super Agent doesn't reinvent them — it uses them.

External orchestration deps

No LangGraph. No CrewAI. Every framework you add is another CVE feed.

100%

Self-hostable

Every service runs in your Docker. Your data never leaves your network unless you say so.

The defense

We break the lethal trifecta by design.

A 2026 framing (Simon Willison; Palo Alto Networks): a system becomes architecturally vulnerable when one context simultaneously holds private data, exposure to untrusted content, and outbound communication. In a chatbot the worst outcome is a bad answer. In an agent, it's credential exfiltration. Super Agent splits these three capabilities across separate subagents — never co-located.

The three legs

Private data access

Source code, secrets, internal docs.

Exposure to untrusted content

Web pages, emails, plugin manifests, doc summaries.

Outbound communication

Shell, network, message sending.

When all three meet in one context, a prompt-injection on any untrusted page becomes a credential exfil pipeline. Patching after the fact doesn't help — the architecture has already lost.

How we split them

Subagent	Private	Untrusted	Outbound
superagent-code-reviewer	●	○	○
superagent-security-auditor	●	○	⊕
superagent-researcher	○	●	●
superagent-eval-runner	●	○	⊕

● has · ⊕ shell only (local) · ○ does not have

No row has all three. A CI test enforces this invariant — try adding shell to researcher and the build fails.

Lesson from ClawHub 2026

No marketplace blindspot

~12% of ClawHub skills were malicious; 335 installed credential stealers. We don't auto-install from any third-party skill registry. Every bundled skill is first-party and reviewed.

Lesson from LiteLLM Mar 2026

Supply chain pinned

Backdoored versions logged every request. Every Docker image we ship is SHA-digest pinned. Weekly pip-audit and npm audit in CI.

Lesson from LangGraph CVEs

Fewer frameworks, smaller surface

Three CVSS-9.3 vulnerabilities in 2026. We host orchestration in the CLI's own primitives. No LangGraph. No CrewAI. Their CVE feeds aren't on our changelog.

The competition

How Super Agent compares.

Side by side with the alternatives. We don't claim to do everything — we claim to do the security-critical things right.

	Super Agent	OpenClaw	Bare CLI	GStack
Hook-enforced command blocking	✓	—	manual	—
Lethal-trifecta architectural split	✓	✗	✗	partial
CI test for trifecta invariant	✓	✗	✗	✗
Persistent cross-session memory	✓	✓	✗	✗
Cost / trace observability	✓	partial	✗	partial
Plugin marketplace risk	none (curated)	high (~12%)	n/a	n/a
Supply chain pinned (SHA digests)	✓	✗	n/a	n/a
Self-host first	✓	✓	n/a	n/a
Brand neutrality (no vendor lock)	✓	✗	n/a	partial
License	MIT	MIT	—	MIT

Real code

200 lines of Python. Stops the most expensive mistakes.

No magic, no LLM in the critical path. A single cross-platform Python file reads a tool-call event, matches it against curated regexes, and exits with code 2 if it's dangerous. The host CLI shows the message to the model, which must reformulate.

Cross-platform: one file, Windows / macOS / Linux
Zero runtime dependencies — stdlib only
86 parametrized tests prove every block + every allow
Auditable in 5 minutes — readable by humans

plugin/hooks/pre_tool_use.py

# 30 dangerous shell patterns — match-and-block.
DANGEROUS_BASH = [
    (re.compile(r"\brm\s+-\S*[rR]\S*\s+/\s*($|\*|\s)"),
     "rm -r on filesystem root '/'"),
    (re.compile(
        r"\b(curl|wget|iwr)\b[^|]*\|\s*"
        r"(bash|sh|zsh|python|powershell|iex)\b"
     ), "pipe-to-shell of remote content"),
    (re.compile(
        r"\bgit\s+push\b"
        r"(?=[^&;|]*--force(-with-lease)?\b)"
        r"(?=[^&;|]*\b(main|master|prod)\b)"
     ), "force-push to protected branch"),
    # ... 27 more
]

def main():
    event = json.load(sys.stdin)
    tool, args = event.get("tool_name"), \
                 event.get("tool_input", {})

    if tool in FILE_TOOLS and \
       is_secret_file(args.get("file_path")):
        block("secret file access denied")

    if tool in SHELL_TOOLS:
        bad, reason = check_dangerous(args.get("command", ""))
        if bad: block(f"dangerous shell pattern — {reason}")

    sys.exit(0)  # allow

Roadmap

Five phases shipped. Three to go.

Public, honest, in the open. Every milestone is a commit on main; every milestone has tests.

● Shipped

Phase 0 — Open-source scaffold

MIT, README, governance, GitHub templates, doc skeleton.

● Shipped

Phase 1 — Super Agent Guards

PreToolUse hook, 16 secret patterns, 30 shell patterns, 86 tests.

● Shipped

Phase 1.5 — Branding pass

Super Agent naming convention canonicalized across all docs.

● Shipped

Phase 2 — Super Agent Subagents

4 roles, scoped tools, lethal-trifecta invariant test in CI.

● Shipped

Phase 3 — Super Agent Memory

Client + InMemory + Mem0 backends, SessionStart/End hooks, Docker compose, docs.

Phase 4 — Trace + Retrieval

Super Agent Trace (Langfuse) dashboards + budget alarms. Super Agent Retrieval (Qdrant + LlamaIndex) for semantic search.

Phase 5

Browser + Scanner

Sandboxed web automation (allow-listed domains). Weekly red-team via garak with auto-issue filing.

Phase 6

CLI + Plugin Package

superagent install / doctor / audit / status. SHA-pinning, plugin manifest, GitHub Pages docs site.

v1.0

Public launch

Show HN. Product Hunt. Blog. Awesome lists. Discord. v1.0 release.

Get started

Three commands to a safer agent.

Pre-alpha. The one-line installer ships with v0.1.0. For now, clone and run.

terminal

$ git clone https://github.com/USER/super-agent

$ cd super-agent

$ ./scripts/install.sh # or install.ps1 on Windows

[superagent] ✓ Docker detected

[superagent] ✓ Python 3.11+

[superagent] ✓ Host CLI registered

[superagent] ✓ superagent-memory: healthy

[superagent] ✓ guards armed

[superagent] ready.█

View on GitHub Read the docs

Stop trusting your AI agent.

Five ways your AI agent will hurt you.

Your secrets walk out the door

One rm -rf away from disaster

Every session, you re-explain the project

The bill arrives. You panic.

A web page hijacks your agent

Seven Super Agent-branded components.All open-source. All self-hostable.

Super Agent Guards

Super Agent Subagents

Super Agent Memory

Super Agent Trace

Super Agent Retrieval

Super Agent Browser

Super Agent Scanner

Super Agent CLI

Super Agent Plugin

A plugin, not a fork.

Host primitives, used

External orchestration deps

Self-hostable

We break the lethal trifecta by design.

The three legs

How we split them

No marketplace blindspot

Supply chain pinned

Fewer frameworks, smaller surface

How Super Agent compares.

200 lines of Python. Stops the most expensive mistakes.

Five phases shipped. Three to go.

Phase 0 — Open-source scaffold

Phase 1 — Super Agent Guards

Phase 1.5 — Branding pass

Phase 2 — Super Agent Subagents

Phase 3 — Super Agent Memory

Phase 4 — Trace + Retrieval

Browser + Scanner

CLI + Plugin Package

Public launch

Three commands to a safer agent.

Stop trusting
your AI agent.

One `rm -rf` away from disaster

Seven Super Agent-branded components.
All open-source. All self-hostable.