superagent

Name: superagent-ai/superagent
Author: superagent-ai

Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers.

Subagents6.6k estrellas959 forks● TypeScriptMITActualizado 2mo ago

Nota editorial

Superagent SDK is an open-source safety layer for AI applications, offering four core functions: Guard (runtime detection and blocking of prompt injections and unsafe tool calls), Redact (automatic removal of PII, PHI, and secrets from text, replacing items like email addresses and SSNs with labeled placeholders), Scan (analysis of GitHub repositories for agent-targeted attacks such as repo poisoning), and Test (upcoming red-team scenario runner for production agents). It integrates with Claude specifically through an MCP server compatible with both Claude Code and Claude Desktop, while TypeScript and Python SDKs allow direct embedding into any application. The library works across Anthropic, OpenAI, Google, and other model providers. A notable self-hosting option exists via three open-weight Guard models ranging from 0.6B to 4B parameters, available in GGUF format for CPU inference, allowing organizations to run threat detection on their own infrastructure without sending data to external APIs. Security engineers and compliance teams building Claude-powered products are the primary audience.

ClaudeWave Trust Score

100/100

✓ Verified

Passed

✓Open-source license (MIT)
✓Recently active
✓Healthy fork ratio
✓Clear description
✓Topics declared
✓Mature repo (>1y old)

Last scanned: 6/11/2026

Install as a Claude Code subagent

Method: Clone

Terminal

git clone https://github.com/superagent-ai/superagent && cp superagent/*.md ~/.claude/agents/

1. Clone the repository and copy the agent .md definitions into ~/.claude/agents (or .claude/agents inside a project).

2. Start a new Claude Code session to load the agents.

3. Delegate work to them with the Task/Agent tool or by name.

Casos de uso

AI / ML Security Dev Tools

Sobre el repo

Resumen de Subagents

<p align="center">
  <img src="logo.png" width="80" alt="Superagent" />
</p>

<h1 align="center">Superagent SDK</h1>

<p align="center">
  <strong>Make your AI apps safe.</strong>
</p>

<p align="center">
  <a href="https://superagent.sh">Website</a> ·
  <a href="https://docs.superagent.sh">Docs</a> ·
  <a href="https://discord.gg/spZ7MnqFT4">Discord</a> ·
  <a href="https://huggingface.co/superagent-ai">HuggingFace</a>
</p>

<p align="center">
  <img src="https://img.shields.io/badge/Y%20Combinator-Backed-orange" alt="Y Combinator" />
  <img src="https://img.shields.io/github/stars/superagent-ai/superagent?style=social" alt="GitHub stars" />
  <img src="https://img.shields.io/badge/license-MIT-blue" alt="MIT License" />
</p>

---

An open-source SDK for AI agent safety. Block prompt injections, redact PII and secrets, scan repositories for threats, and run red team scenarios against your agent.

## Features

### Guard

Detect and block prompt injections, malicious instructions, and unsafe tool calls at runtime.

**TypeScript:**

```typescript
import { createClient } from "safety-agent";

const client = createClient();

const result = await client.guard({
  input: userMessage
});

if (result.classification === "block") {
  console.log("Blocked:", result.violation_types);
}
```

**Python:**

```python
from safety_agent import create_client

client = create_client()

result = await client.guard(input=user_message)

if result.classification == "block":
    print("Blocked:", result.violation_types)
```

### Redact

Remove PII, PHI, and secrets from text automatically.

**TypeScript:**

```typescript
const result = await client.redact({
  input: "My email is john@example.com and SSN is 123-45-6789",
  model: "openai/gpt-4o-mini"
});

console.log(result.redacted);
// "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"
```

**Python:**

```python
result = await client.redact(
    input="My email is john@example.com and SSN is 123-45-6789",
    model="openai/gpt-4o-mini"
)

print(result.redacted)
# "My email is <EMAIL_REDACTED> and SSN is <SSN_REDACTED>"
```

### Scan

Analyze repositories for AI agent-targeted attacks such as repo poisoning and malicious instructions.

**TypeScript:**

```typescript
const result = await client.scan({
  repo: "https://github.com/user/repo"
});

console.log(result.result);  // Security report
console.log(`Cost: $${result.usage.cost.toFixed(4)}`);
```

**Python:**

```python
result = await client.scan(repo="https://github.com/user/repo")

print(result.result)  # Security report
print(f"Cost: ${result.usage.cost:.4f}")
```

### Test

Run red team scenarios against your production agent. *(Coming soon)*

```typescript
const result = await client.test({
  endpoint: "https://your-agent.com/chat",
  scenarios: ["prompt_injection", "data_exfiltration"]
});

console.log(result.findings);  // Vulnerabilities discovered
```

## Get Started

Sign up at [superagent.sh](https://superagent.sh) to get your API key.

**TypeScript:**

```bash
npm install safety-agent
```

**Python:**

```bash
uv add safety-agent
```

**Set your API key:**

```bash
export SUPERAGENT_API_KEY=your-key
```

## Integration Options

| Option | Description | Link |
|--------|-------------|------|
| **TypeScript SDK** | Embed guard, redact, and scan directly in your app | [sdk/typescript](sdk/typescript/README.md) |
| **Python SDK** | Embed guard, redact, and scan directly in Python apps | [sdk/python](sdk/python/README.md) |
| **CLI** | Command-line tool for testing and automation | [cli](cli/README.md) |
| **MCP Server** | Use with Claude Code and Claude Desktop | [mcp](mcp/README.md) |

## Why Superagent SDK?

- **Works with any model** — OpenAI, Anthropic, Google, Groq, Bedrock, and more
- **Open-weight models** — Run Guard on your infrastructure with 50-100ms latency
- **Low latency** — Optimized for runtime use
- **Open source** — MIT license with full transparency

## Open-Weight Models

Run Guard on your own infrastructure. No API calls, no data leaving your environment.

| Model | Parameters | Use Case |
|-------|------------|----------|
| [superagent-guard-0.6b](https://huggingface.co/superagent-ai/superagent-guard-0.6b) | 0.6B | Fast inference, edge deployment |
| [superagent-guard-1.7b](https://huggingface.co/superagent-ai/superagent-guard-1.7b) | 1.7B | Balanced speed and accuracy |
| [superagent-guard-4b](https://huggingface.co/superagent-ai/superagent-guard-4b) | 4B | Maximum accuracy |

GGUF versions for CPU: [0.6b-gguf](https://huggingface.co/superagent-ai/superagent-guard-0.6b-gguf) · [1.7b-gguf](https://huggingface.co/superagent-ai/superagent-guard-1.7b-gguf) · [4b-gguf](https://huggingface.co/superagent-ai/superagent-guard-4b-gguf)

## Resources

- [Documentation](https://docs.superagent.sh)
- [Discord Community](https://discord.gg/spZ7MnqFT4)
- [HuggingFace Models](https://huggingface.co/superagent-ai)
- [Twitter/X](https://x.com/superagent_ai)

## License

MIT

Topics

aianthropicguardrailsllmopenaiprompt-injectionsecurity

Preguntas frecuentes

Lo que la gente pregunta sobre superagent

¿Qué es superagent-ai/superagent?

superagent-ai/superagent es subagents para el ecosistema de Claude AI. Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers. Tiene 6.6k estrellas en GitHub y se actualizó por última vez 2mo ago.

¿Cómo se instala superagent?

Puedes instalar superagent clonando el repositorio (https://github.com/superagent-ai/superagent) o siguiendo las instrucciones del README en GitHub. ClaudeWave también te ofrece bloques de instalación rápida en esta misma página.

¿Es seguro usar superagent-ai/superagent?

Nuestro agente de seguridad ha analizado superagent-ai/superagent y le ha asignado un Trust Score de 100/100 (tier: Verified). Revisa el desglose completo de comprobaciones superadas y flags en esta página.

¿Quién mantiene superagent-ai/superagent?

superagent-ai/superagent es mantenido por superagent-ai. La última actividad registrada en GitHub es de 2mo ago, con 9 issues abiertos.

¿Hay alternativas a superagent?

Sí. En ClaudeWave puedes explorar subagents similares en /categories/agents, ordenados por popularidad o actividad reciente.

Deploy en 1 click

Despliega superagent en tu cloud

Lleva este repo a producción en minutos. Cada plataforma genera su propio entorno con variables de entorno editables.

Vercel Railway Render

Badge embebible

¿Mantienes este repo? Añade un badge a tu README

Pega el badge en tu README de GitHub para mostrar que está auditado por ClaudeWave. Cada badge enlaza de vuelta a esta página y muestra el Trust Score actual.

Markdown (README)

[![Featured on ClaudeWave](https://claudewave.com/api/badge/superagent-ai-superagent)](https://claudewave.com/repo/superagent-ai-superagent)

HTML

<a href="https://claudewave.com/repo/superagent-ai-superagent"><img src="https://claudewave.com/api/badge/superagent-ai-superagent" alt="Featured on ClaudeWave: superagent-ai/superagent" width="320" height="64" /></a>

Relacionados

Más Subagents

Alternativas a superagent

affaan-m

ECC

yesterday

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

214.4k32.9kJavaScript

Subagentsai-agentsanthropicInstall

NousResearch

hermes-agent

today

The agent that grows with you

192.1k33.5kPython

Subagentsaiai-agentInstall

Snailclimb

JavaGuide

today

Java 面试 & 后端通用面试指南，覆盖计算机基础、数据库、分布式、高并发、系统设计与 AI 应用开发

156.3k46.1kJavaScript

SubagentsagentaiInstall

langgenius

dify

today

Production-ready platform for agentic workflow development.

145k22.8kTypeScript

Subagentsagentagentic-aiInstall

langchain-ai

langchain

today

The agent engineering platform.

139.2k23.1kPython

SubagentsagentsaiInstall

lobehub

today

🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire AI team.

78.6k15.4kTypeScript

Subagentsagentagent-collaborationInstall