Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers.
Superagent SDK is an open-source safety layer for AI applications, offering four core functions: Guard (runtime detection and blocking of prompt injections and unsafe tool calls), Redact (automatic removal of PII, PHI, and secrets from text, replacing items like email addresses and SSNs with labeled placeholders), Scan (analysis of GitHub repositories for agent-targeted attacks such as repo poisoning), and Test (upcoming red-team scenario runner for production agents). It integrates with Claude specifically through an MCP server compatible with both Claude Code and Claude Desktop, while TypeScript and Python SDKs allow direct embedding into any application. The library works across Anthropic, OpenAI, Google, and other model providers. A notable self-hosting option exists via three open-weight Guard models ranging from 0.6B to 4B parameters, available in GGUF format for CPU inference, allowing organizations to run threat detection on their own infrastructure without sending data to external APIs. Security engineers and compliance teams building Claude-powered products are the primary audience.
- ✓Open-source license (MIT)
- ✓Recently active
- ✓Healthy fork ratio
- ✓Clear description
- ✓Topics declared
- ✓Mature repo (>1y old)
git clone https://github.com/superagent-ai/superagent && cp superagent/*.md ~/.claude/agents/Resumen de Subagents
<p align="center">
<img src="logo.png" width="80" alt="Superagent" />
</p>
<h1 align="center">Superagent SDK</h1>
<p align="center">
<strong>Make your AI apps safe.</strong>
</p>
<p align="center">
<a href="https://superagent.sh">Website</a> ·
<a href="https://docs.superagent.sh">Docs</a> ·
<a href="https://discord.gg/spZ7MnqFT4">Discord</a> ·
<a href="https://huggingface.co/superagent-ai">HuggingFace</a>
</p>
<p align="center">
<img src="https://img.shields.io/badge/Y%20Combinator-Backed-orange" alt="Y Combinator" />
<img src="https://img.shields.io/github/stars/superagent-ai/superagent?style=social" alt="GitHub stars" />
<img src="https://img.shields.io/badge/license-MIT-blue" alt="MIT License" />
</p>
---
An open-source SDK for AI agent safety. Block prompt injections, redact PII and secrets, scan repositories for threats, and run red team scenarios against your agent.
## Features
### Guard
Detect and block prompt injections, malicious instructions, and unsafe tool calls at runtime.
**TypeScript:**
```typescript
import { createClient } from "safety-agent";
const client = createClient();
const result = await client.guard({
input: userMessage
});
if (result.classification === "block") {
console.log("Blocked:", result.violation_types);
}
```
**Python:**
```python
from safety_agent import create_client
client = create_client()
result = await client.guard(input=user_message)
if result.classification == "block":
print("Blocked:", result.violation_types)
```
### Redact
Remove PII, PHI, and secrets from text automatically.
**TypeScript:**
```typescript
const result = await client.redact({
input: "My email is john@example.com and SSN is 123-45-6789",
model: "openai/gpt-4o-mini"
});
console.log(result.redacted);
// "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"
```
**Python:**
```python
result = await client.redact(
input="My email is john@example.com and SSN is 123-45-6789",
model="openai/gpt-4o-mini"
)
print(result.redacted)
# "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"
```
### Scan
Analyze repositories for AI agent-targeted attacks such as repo poisoning and malicious instructions.
**TypeScript:**
```typescript
const result = await client.scan({
repo: "https://github.com/user/repo"
});
console.log(result.result); // Security report
console.log(`Cost: $${result.usage.cost.toFixed(4)}`);
```
**Python:**
```python
result = await client.scan(repo="https://github.com/user/repo")
print(result.result) # Security report
print(f"Cost: ${result.usage.cost:.4f}")
```
### Test
Run red team scenarios against your production agent. *(Coming soon)*
```typescript
const result = await client.test({
endpoint: "https://your-agent.com/chat",
scenarios: ["prompt_injection", "data_exfiltration"]
});
console.log(result.findings); // Vulnerabilities discovered
```
## Get Started
Sign up at [superagent.sh](https://superagent.sh) to get your API key.
**TypeScript:**
```bash
npm install safety-agent
```
**Python:**
```bash
uv add safety-agent
```
**Set your API key:**
```bash
export SUPERAGENT_API_KEY=your-key
```
## Integration Options
| Option | Description | Link |
|--------|-------------|------|
| **TypeScript SDK** | Embed guard, redact, and scan directly in your app | [sdk/typescript](sdk/typescript/README.md) |
| **Python SDK** | Embed guard, redact, and scan directly in Python apps | [sdk/python](sdk/python/README.md) |
| **CLI** | Command-line tool for testing and automation | [cli](cli/README.md) |
| **MCP Server** | Use with Claude Code and Claude Desktop | [mcp](mcp/README.md) |
## Why Superagent SDK?
- **Works with any model** — OpenAI, Anthropic, Google, Groq, Bedrock, and more
- **Open-weight models** — Run Guard on your infrastructure with 50-100ms latency
- **Low latency** — Optimized for runtime use
- **Open source** — MIT license with full transparency
## Open-Weight Models
Run Guard on your own infrastructure. No API calls, no data leaving your environment.
| Model | Parameters | Use Case |
|-------|------------|----------|
| [superagent-guard-0.6b](https://huggingface.co/superagent-ai/superagent-guard-0.6b) | 0.6B | Fast inference, edge deployment |
| [superagent-guard-1.7b](https://huggingface.co/superagent-ai/superagent-guard-1.7b) | 1.7B | Balanced speed and accuracy |
| [superagent-guard-4b](https://huggingface.co/superagent-ai/superagent-guard-4b) | 4B | Maximum accuracy |
GGUF versions for CPU: [0.6b-gguf](https://huggingface.co/superagent-ai/superagent-guard-0.6b-gguf) · [1.7b-gguf](https://huggingface.co/superagent-ai/superagent-guard-1.7b-gguf) · [4b-gguf](https://huggingface.co/superagent-ai/superagent-guard-4b-gguf)
## Resources
- [Documentation](https://docs.superagent.sh)
- [Discord Community](https://discord.gg/spZ7MnqFT4)
- [HuggingFace Models](https://huggingface.co/superagent-ai)
- [Twitter/X](https://x.com/superagent_ai)
## License
MIT
Lo que la gente pregunta sobre superagent
¿Qué es superagent-ai/superagent?
+
superagent-ai/superagent es subagents para el ecosistema de Claude AI. Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers. Tiene 6.6k estrellas en GitHub y se actualizó por última vez 2mo ago.
¿Cómo se instala superagent?
+
Puedes instalar superagent clonando el repositorio (https://github.com/superagent-ai/superagent) o siguiendo las instrucciones del README en GitHub. ClaudeWave también te ofrece bloques de instalación rápida en esta misma página.
¿Es seguro usar superagent-ai/superagent?
+
Nuestro agente de seguridad ha analizado superagent-ai/superagent y le ha asignado un Trust Score de 100/100 (tier: Verified). Revisa el desglose completo de comprobaciones superadas y flags en esta página.
¿Quién mantiene superagent-ai/superagent?
+
superagent-ai/superagent es mantenido por superagent-ai. La última actividad registrada en GitHub es de 2mo ago, con 9 issues abiertos.
¿Hay alternativas a superagent?
+
Sí. En ClaudeWave puedes explorar subagents similares en /categories/agents, ordenados por popularidad o actividad reciente.
Despliega superagent en tu cloud
Lleva este repo a producción en minutos. Cada plataforma genera su propio entorno con variables de entorno editables.
¿Mantienes este repo? Añade un badge a tu README
Pega el badge en tu README de GitHub para mostrar que está auditado por ClaudeWave. Cada badge enlaza de vuelta a esta página y muestra el Trust Score actual.
[](https://claudewave.com/repo/superagent-ai-superagent)<a href="https://claudewave.com/repo/superagent-ai-superagent"><img src="https://claudewave.com/api/badge/superagent-ai-superagent" alt="Featured on ClaudeWave: superagent-ai/superagent" width="320" height="64" /></a>Más Subagents
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
The agent that grows with you
Java 面试 & 后端通用面试指南,覆盖计算机基础、数据库、分布式、高并发、系统设计与 AI 应用开发
Production-ready platform for agentic workflow development.
The agent engineering platform.
🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire AI team.