superagent

Name: superagent-ai/superagent
Author: superagent-ai

Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers.

Subagents6.7k stars965 forks● TypeScriptMITUpdated 3mo ago

Editor's note

Superagent SDK is an open-source safety layer for AI applications, offering four core functions: Guard (runtime detection and blocking of prompt injections and unsafe tool calls), Redact (automatic removal of PII, PHI, and secrets from text, replacing items like email addresses and SSNs with labeled placeholders), Scan (analysis of GitHub repositories for agent-targeted attacks such as repo poisoning), and Test (upcoming red-team scenario runner for production agents). It integrates with Claude specifically through an MCP server compatible with both Claude Code and Claude Desktop, while TypeScript and Python SDKs allow direct embedding into any application. The library works across Anthropic, OpenAI, Google, and other model providers. A notable self-hosting option exists via three open-weight Guard models ranging from 0.6B to 4B parameters, available in GGUF format for CPU inference, allowing organizations to run threat detection on their own infrastructure without sending data to external APIs. Security engineers and compliance teams building Claude-powered products are the primary audience.

ClaudeWave Trust Score

100/100

✓ Verified

Passed

✓Open-source license (MIT)
✓Recently active
✓Healthy fork ratio
✓Clear description
✓Topics declared
✓Mature repo (>1y old)

Last scanned: 6/11/2026

Install as a Claude Code subagent

Method: Clone

Terminal

git clone https://github.com/superagent-ai/superagent && cp superagent/*.md ~/.claude/agents/

1. Clone the repository and copy the agent .md definitions into ~/.claude/agents (or .claude/agents inside a project).

2. Start a new Claude Code session to load the agents.

3. Delegate work to them with the Task/Agent tool or by name.

Use cases

AI / ML Security Dev Tools

About

Subagents overview

<p align="center">
  <img src="logo.png" width="80" alt="Superagent" />
</p>

<h1 align="center">Superagent SDK</h1>

<p align="center">
  <strong>Make your AI apps safe.</strong>
</p>

<p align="center">
  <a href="https://superagent.sh">Website</a> ·
  <a href="https://docs.superagent.sh">Docs</a> ·
  <a href="https://discord.gg/spZ7MnqFT4">Discord</a> ·
  <a href="https://huggingface.co/superagent-ai">HuggingFace</a>
</p>

<p align="center">
  <img src="https://img.shields.io/badge/Y%20Combinator-Backed-orange" alt="Y Combinator" />
  <img src="https://img.shields.io/github/stars/superagent-ai/superagent?style=social" alt="GitHub stars" />
  <img src="https://img.shields.io/badge/license-MIT-blue" alt="MIT License" />
</p>

---

An open-source SDK for AI agent safety. Block prompt injections, redact PII and secrets, scan repositories for threats, and run red team scenarios against your agent.

## Features

### Guard

Detect and block prompt injections, malicious instructions, and unsafe tool calls at runtime.

**TypeScript:**

```typescript
import { createClient } from "safety-agent";

const client = createClient();

const result = await client.guard({
  input: userMessage
});

if (result.classification === "block") {
  console.log("Blocked:", result.violation_types);
}
```

**Python:**

```python
from safety_agent import create_client

client = create_client()

result = await client.guard(input=user_message)

if result.classification == "block":
    print("Blocked:", result.violation_types)
```

### Redact

Remove PII, PHI, and secrets from text automatically.

**TypeScript:**

```typescript
const result = await client.redact({
  input: "My email is john@example.com and SSN is 123-45-6789",
  model: "openai/gpt-4o-mini"
});

console.log(result.redacted);
// "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"
```

**Python:**

```python
result = await client.redact(
    input="My email is john@example.com and SSN is 123-45-6789",
    model="openai/gpt-4o-mini"
)

print(result.redacted)
# "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"
```

### Scan

Analyze repositories for AI agent-targeted attacks such as repo poisoning and malicious instructions.

**TypeScript:**

```typescript
const result = await client.scan({
  repo: "https://github.com/user/repo"
});

console.log(result.result);  // Security report
console.log(`Cost: $${result.usage.cost.toFixed(4)}`);
```

**Python:**

```python
result = await client.scan(repo="https://github.com/user/repo")

print(result.result)  # Security report
print(f"Cost: ${result.usage.cost:.4f}")
```

### Test

Run red team scenarios against your production agent. *(Coming soon)*

```typescript
const result = await client.test({
  endpoint: "https://your-agent.com/chat",
  scenarios: ["prompt_injection", "data_exfiltration"]
});

console.log(result.findings);  // Vulnerabilities discovered
```

## Get Started

Sign up at [superagent.sh](https://superagent.sh) to get your API key.

**TypeScript:**

```bash
npm install safety-agent
```

**Python:**

```bash
uv add safety-agent
```

**Set your API key:**

```bash
export SUPERAGENT_API_KEY=your-key
```

## Integration Options

| Option | Description | Link |
|--------|-------------|------|
| **TypeScript SDK** | Embed guard, redact, and scan directly in your app | [sdk/typescript](sdk/typescript/README.md) |
| **Python SDK** | Embed guard, redact, and scan directly in Python apps | [sdk/python](sdk/python/README.md) |
| **CLI** | Command-line tool for testing and automation | [cli](cli/README.md) |
| **MCP Server** | Use with Claude Code and Claude Desktop | [mcp](mcp/README.md) |

## Why Superagent SDK?

- **Works with any model** — OpenAI, Anthropic, Google, Groq, Bedrock, and more
- **Open-weight models** — Run Guard on your infrastructure with 50-100ms latency
- **Low latency** — Optimized for runtime use
- **Open source** — MIT license with full transparency

## Open-Weight Models

Run Guard on your own infrastructure. No API calls, no data leaving your environment.

| Model | Parameters | Use Case |
|-------|------------|----------|
| [superagent-guard-0.6b](https://huggingface.co/superagent-ai/superagent-guard-0.6b) | 0.6B | Fast inference, edge deployment |
| [superagent-guard-1.7b](https://huggingface.co/superagent-ai/superagent-guard-1.7b) | 1.7B | Balanced speed and accuracy |
| [superagent-guard-4b](https://huggingface.co/superagent-ai/superagent-guard-4b) | 4B | Maximum accuracy |

GGUF versions for CPU: [0.6b-gguf](https://huggingface.co/superagent-ai/superagent-guard-0.6b-gguf) · [1.7b-gguf](https://huggingface.co/superagent-ai/superagent-guard-1.7b-gguf) · [4b-gguf](https://huggingface.co/superagent-ai/superagent-guard-4b-gguf)

## Resources

- [Documentation](https://docs.superagent.sh)
- [Discord Community](https://discord.gg/spZ7MnqFT4)
- [HuggingFace Models](https://huggingface.co/superagent-ai)
- [Twitter/X](https://x.com/superagent_ai)

## License

MIT

Topics

aianthropicguardrailsllmopenaiprompt-injectionsecurity

Frequently asked

What people ask about superagent

What is superagent-ai/superagent?

superagent-ai/superagent is subagents for the Claude AI ecosystem. Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers. It has 6.7k GitHub stars and was last updated 3mo ago.

How do I install superagent?

You can install superagent by cloning the repository (https://github.com/superagent-ai/superagent) or following the README instructions on GitHub. ClaudeWave also provides quick install blocks on this page.

Is superagent-ai/superagent safe to use?

Our security agent has analyzed superagent-ai/superagent and assigned a Trust Score of 100/100 (tier: Verified). See the full breakdown of passed checks and flags on this page.

Who maintains superagent-ai/superagent?

superagent-ai/superagent is maintained by superagent-ai. The last recorded GitHub activity is from 3mo ago, with 12 open issues.

Are there alternatives to superagent?

Yes. On ClaudeWave you can browse similar subagents at /categories/agents, sorted by popularity or recent activity.

1-click deploy

Deploy superagent to your cloud

Ship this repo to production in minutes. Each platform spins up its own environment with editable env vars.

Vercel Railway Render

Embeddable badge

Maintain this repo? Add a badge to your README

Drop the badge into your GitHub README to show it's tracked on ClaudeWave. Each badge links back to this page and reflects the live Trust Score.

Markdown (README)

[![Featured on ClaudeWave](https://claudewave.com/api/badge/superagent-ai-superagent)](https://claudewave.com/repo/superagent-ai-superagent)

HTML

<a href="https://claudewave.com/repo/superagent-ai-superagent"><img src="https://claudewave.com/api/badge/superagent-ai-superagent" alt="Featured on ClaudeWave: superagent-ai/superagent" width="320" height="64" /></a>

More Subagents

superagent alternatives

affaan-m

ECC

today

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

234.2k35.7kJavaScript

Subagentsai-agentsanthropicInstall

NousResearch

hermes-agent

today

The agent that grows with you

221.5k42.3kPython

Subagentsaiai-agentInstall

Snailclimb

JavaGuide

yesterday

Java 面试 & 后端通用面试指南，覆盖计算机基础、数据库、分布式、高并发、系统设计与 AI 应用开发

157.3k46.2kJavaScript

SubagentsagentaiInstall

langgenius

dify

today

Build Agentic workflows, RAG pipelines, with rich AI model and tool support on one collaborative workspace. Deploy on cloud, VPC, or self-hosted, so teams move from prototype to production without rebuilding the stack.

150.5k23.7kTypeScript

Subagentsagentagentic-aiInstall

langchain-ai

langchain

today

The agent engineering platform.

142.7k23.8kPython

SubagentsagentsaiInstall

Graphify-Labs

graphify

today

Turn any codebase, with its docs, SQL schemas, configs, and PDFs, into a queryable knowledge graph. A /graphify skill for Claude Code, Cursor, Codex, and Gemini CLI: local deterministic AST parsing, every edge explained, no vector store.

97.2k9.4kPython

Subagentsai-agentsantigravityInstall