Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers.
Subagents6.5k stars960 forks● TypeScriptMITUpdated 3d ago
ClaudeWave Trust Score
100/100
Passed
- ✓Open-source license (MIT)
- ✓Actively maintained (<30d)
- ✓Healthy fork ratio
- ✓Clear description
- ✓Topics declared
- ✓Mature repo (>1y old)
Last scanned: 4/14/2026
Install in Claude Desktop
Method detected: Manual
{
"mcpServers": {
"superagent": {
"command": "node",
"args": ["/path/to/superagent/dist/index.js"],
"env": {
"SUPERAGENT_API_KEY": "<superagent_api_key>"
}
}
}
}1. Copy the snippet above.
2. Paste into
~/Library/Application Support/Claude/claude_desktop_config.json (Mac) or %APPDATA%\Claude\claude_desktop_config.json (Windows).3. Replace any
<placeholder> values with your API keys or paths.4. Restart Claude Desktop. The MCP server appears automatically.
💡 Clone https://github.com/superagent-ai/superagent and follow its README for install instructions.
Detected environment variables
SUPERAGENT_API_KEYUse cases
🧠 AI / ML🔐 Security🛠️ Dev Tools
About
Subagents overview
<p align="center">
<img src="logo.png" width="80" alt="Superagent" />
</p>
<h1 align="center">Superagent SDK</h1>
<p align="center">
<strong>Make your AI apps safe.</strong>
</p>
<p align="center">
<a href="https://superagent.sh">Website</a> ·
<a href="https://docs.superagent.sh">Docs</a> ·
<a href="https://discord.gg/spZ7MnqFT4">Discord</a> ·
<a href="https://huggingface.co/superagent-ai">HuggingFace</a>
</p>
<p align="center">
<img src="https://img.shields.io/badge/Y%20Combinator-Backed-orange" alt="Y Combinator" />
<img src="https://img.shields.io/github/stars/superagent-ai/superagent?style=social" alt="GitHub stars" />
<img src="https://img.shields.io/badge/license-MIT-blue" alt="MIT License" />
</p>
---
An open-source SDK for AI agent safety. Block prompt injections, redact PII and secrets, scan repositories for threats, and run red team scenarios against your agent.
## Features
### Guard
Detect and block prompt injections, malicious instructions, and unsafe tool calls at runtime.
**TypeScript:**
```typescript
import { createClient } from "safety-agent";
const client = createClient();
const result = await client.guard({
input: userMessage
});
if (result.classification === "block") {
console.log("Blocked:", result.violation_types);
}
```
**Python:**
```python
from safety_agent import create_client
client = create_client()
result = await client.guard(input=user_message)
if result.classification == "block":
print("Blocked:", result.violation_types)
```
### Redact
Remove PII, PHI, and secrets from text automatically.
**TypeScript:**
```typescript
const result = await client.redact({
input: "My email is john@example.com and SSN is 123-45-6789",
model: "openai/gpt-4o-mini"
});
console.log(result.redacted);
// "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"
```
**Python:**
```python
result = await client.redact(
input="My email is john@example.com and SSN is 123-45-6789",
model="openai/gpt-4o-mini"
)
print(result.redacted)
# "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"
```
### Scan
Analyze repositories for AI agent-targeted attacks such as repo poisoning and malicious instructions.
**TypeScript:**
```typescript
const result = await client.scan({
repo: "https://github.com/user/repo"
});
console.log(result.result); // Security report
console.log(`Cost: $${result.usage.cost.toFixed(4)}`);
```
**Python:**
```python
result = await client.scan(repo="https://github.com/user/repo")
print(result.result) # Security report
print(f"Cost: ${result.usage.cost:.4f}")
```
### Test
Run red team scenarios against your production agent. *(Coming soon)*
```typescript
const result = await client.test({
endpoint: "https://your-agent.com/chat",
scenarios: ["prompt_injection", "data_exfiltration"]
});
console.log(result.findings); // Vulnerabilities discovered
```
## Get Started
Sign up at [superagent.sh](https://superagent.sh) to get your API key.
**TypeScript:**
```bash
npm install safety-agent
```
**Python:**
```bash
uv add safety-agent
```
**Set your API key:**
```bash
export SUPERAGENT_API_KEY=your-key
```
## Integration Options
| Option | Description | Link |
|--------|-------------|------|
| **TypeScript SDK** | Embed guard, redact, and scan directly in your app | [sdk/typescript](sdk/typescript/README.md) |
| **Python SDK** | Embed guard, redact, and scan directly in Python apps | [sdk/python](sdk/python/README.md) |
| **CLI** | Command-line tool for testing and automation | [cli](cli/README.md) |
| **MCP Server** | Use with Claude Code and Claude Desktop | [mcp](mcp/README.md) |
## Why Superagent SDK?
- **Works with any model** — OpenAI, Anthropic, Google, Groq, Bedrock, and more
- **Open-weight models** — Run Guard on your infrastructure with 50-100ms latency
- **Low latency** — Optimized for runtime use
- **Open source** — MIT license with full transparency
## Open-Weight Models
Run Guard on your own infrastructure. No API calls, no data leaving your environment.
| Model | Parameters | Use Case |
|-------|------------|----------|
| [superagent-guard-0.6b](https://huggingface.co/superagent-ai/superagent-guard-0.6b) | 0.6B | Fast inference, edge deployment |
| [superagent-guard-1.7b](https://huggingface.co/superagent-ai/superagent-guard-1.7b) | 1.7B | Balanced speed and accuracy |
| [superagent-guard-4b](https://huggingface.co/superagent-ai/superagent-guard-4b) | 4B | Maximum accuracy |
GGUF versions for CPU: [0.6b-gguf](https://huggingface.co/superagent-ai/superagent-guard-0.6b-gguf) · [1.7b-gguf](https://huggingface.co/superagent-ai/superagent-guard-1.7b-gguf) · [4b-gguf](https://huggingface.co/superagent-ai/superagent-guard-4b-gguf)
## Resources
- [Documentation](https://docs.superagent.sh)
- [Discord Community](https://discord.gg/spZ7MnqFT4)
- [HuggingFace Models](https://huggingface.co/superagent-ai)
- [Twitter/X](https://x.com/superagent_ai)
## License
MIT
Topics
aianthropicguardrailsllmopenaiprompt-injectionsecurity
Related
More Subagents
affaan-m
everything-claude-code
·73
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
155.8k24.2kJavaScript· yesterday
Subagentsai-agentsanthropic
Snailclimb
JavaGuide
✓91
Java 面试 & 后端通用面试指南,覆盖计算机基础、数据库、分布式、高并发、系统设计与 AI 应用开发
154.9k46.2kJava· yesterday
Subagentsagentcontext-engineering
langgenius
dify
✓97
Production-ready platform for agentic workflow development.
137.8k21.6kTypeScript· today
Subagentsagentagentic-ai
langchain-ai
langchain
✓94
The agent engineering platform
133.5k22.1kPython· today
Subagentsagentsai
NousResearch
hermes-agent
✓76
The agent that grows with you
83.5k11.2kPython· today
Subagentsaiai-agent
infiniflow
ragflow
✓95
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
78k8.8kPython· today
Subagentsagentagentic