MCP servers expose tools with no information about what they actually do at runtime. mcpsafetywarden sits between your agent and any MCP server, profiling tool behavior, blocking destructive calls, and running active security audits before you trust them in a workflow.
- ✓Actively maintained (<30d)
- ✓Clear description
- ✓Topics declared
- !No standard license detected
claude mcp add mcpsafetywarden -- python -m mcpsafetywarden{
"mcpServers": {
"mcpsafetywarden": {
"command": "python",
"args": ["-m", "mcpsafetywarden"]
}
}
}MCP Servers overview
<!-- mcp-name: io.github.gautamvarmadatla/mcpsafetywarden --> <p align="center"> <img src="assets/logo.png" alt="MCP Safety Warden" width="1080"/> </p> MCP safety warden is a proxy server that wraps any MCP server and adds behavioral profiling, security scanning, risk gating, and safe execution to its tools. [](https://pypi.org/project/mcpsafetywarden/) [](https://pypi.org/project/mcpsafetywarden/) [](https://pypi.org/project/mcpsafetywarden/) [](LICENSE) [](https://github.com/gautamvarmadatla/mcpsafetywarden/actions/workflows/ci.yml) [](https://github.com/gautamvarmadatla/mcpsafetywarden/actions/workflows/codeql.yml) [](https://registry.modelcontextprotocol.io/v0.1/servers/io.github.gautamvarmadatla%2Fmcpsafetywarden) [](https://hub.docker.com/r/gautamdatla1999/mcpsafetywarden) [](https://scorecard.dev/viewer/?uri=github.com/gautamvarmadatla/mcpsafetywarden) [](https://www.bestpractices.dev/projects/12730) [](https://github.com/gautamvarmadatla/mcpsafetywarden) ## Contents - [Overview](#overview) - [Prerequisites](#prerequisites) - [Installation](#installation) - [Configuration](#configuration) - [MCP Integration](#mcp-integration) - [CLI Reference](#cli-reference) - [Auxiliary Integrations](#auxiliary-security-tool-integrations) - [Development](#development) - [Testing](#testing) - [Further reading](#further-reading) > [!IMPORTANT] > MCP security is an active research area. Recent surveys catalog a lot of protocol-specific threat categories spanning tool poisoning, prompt injection, rug-pull attacks, supply chain compromise, credential exfiltration, and composition attacks across the full server lifecycle. See [Securing the MCP (OpenReview)](https://openreview.net/pdf?id=Aqn9Wdr2wN), [Landscape & Threats (arXiv)](https://arxiv.org/abs/2503.23278), [When MCP Servers Attack (arXiv)](https://arxiv.org/abs/2509.24272), and [MCP-38 Taxonomy (arXiv)](https://arxiv.org/abs/2603.18063). ## Overview Use as a proxy to add safety gating to any MCP server, or point it at a server you don't own and run a full security audit without making a single tool call. <p align="center"> <img src="assets/two_operating_modes.jpg" alt="Two operating modes" width="800"/> <br/> <em>Fig 1. Two operating modes: proxy and audit</em> </p> **Behavioral profiling**: Effect class, retry safety, destructiveness. LLM-assisted (Anthropic, OpenAI, Gemini, Ollama) with rule-based fallback. Observed stats (latency p50/p95, failure rate, output size) updated after every proxied call. **Security scanning**: mcpsafety+ five-stage pipeline (Recon, Planner, Hacker, Auditor, Supervisor). Cisco AI Defense (AST/YARA). Snyk (metadata analysis). Kali and Burp Suite integrations enrich the pipeline with real network data and HTTP-layer probes. Source code scanning from GitHub with entropy, AST, taint flow, and rug-pull detection. <p align="center"> <img width="1983" height="793" alt="3e457b83-6980-49fc-b812-350ab94f5633" src="https://github.com/user-attachments/assets/bb0c479b-32a9-464c-9b80-b7f295924266" /> <br/> <em>Fig 2. mcpsafety+ five-stage pipeline, triggered when you run a full security audit on any MCP server</em> </p> **Safe execution**: Argument scanning (20+ attack categories, LLM second-pass). Two-layer output injection scanning. Risk gating with alternatives and per-tool policies. Drift detection on every call and standalone check. <p align="center"> <img width="1024" height="572" alt="fb949e62-5a3e-4a5c-be14-3523ef92295e" src="https://github.com/user-attachments/assets/5928e182-0f95-46aa-aa6e-88b4247ad137" /> <br/> <em>Fig 3. Safe execution pipeline: the five checks every proxied tool call passes through</em> </p> **CLI**: 24 subcommands, interactive risk menu, `--json` flag on every command, `--yes` for CI. **What it detects** - **Prompt injection**: tool outputs trying to hijack the agent: role hijacking, jailbreaks, fake system prompts, instruction overrides. Detects 11 obfuscation techniques including Unicode lookalikes, zero-width characters, and base64-encoded payloads. - **Malicious tool metadata**: descriptions containing injection strings, hardcoded secrets, suspicious download URLs, tool impersonation (shadowing), direct financial execution, system service modification, and untrusted external dependencies. Backed by 19 Snyk checks. - **Argument injection**: 20+ attack categories checked on every tool call before the call is forwarded: SSRF to cloud metadata endpoints (AWS, GCP, Azure, Alibaba), path traversal, credential file access (.aws, .ssh, .kube, .env), command injection, SQL/NoSQL/LDAP/XPath injection, XXE, template injection (SSTI), CRLF, null byte, deserialization payloads (Java, Python pickle, PHP, .NET), Windows UNC/ADS attacks, and base64-obfuscated variants of all of the above. - **Source code risks**: fetches the server's GitHub source and runs 6 analysis layers: entropy scanning for hardcoded secrets, AST taint flow tracking (parameter to dangerous sink), description-vs-implementation mismatch, Bandit and Semgrep SAST, and LLM cross-function reasoning. Supports Python and TypeScript/JavaScript. - **Rug-pull and drift**: stores a SHA-256 hash of the server's source on first scan and alerts if it changes. Catches description swaps, schema changes, and tool removal live on every call via a per-call drift guard. - **Behavior anomalies**: classifies every tool by effect class, destructiveness, and 7 risk tags: credential exposure, arbitrary execution, data exfiltration, filesystem access, lateral movement, privilege escalation, and prompt injection surface. - **Composition attacks**: analyzes tool sets for chaining risks: IDOR chains, read-write pairs, auth flow exploitation, write-then-execute sequences, and data accumulation + exfiltration paths across multiple tools. - **Network and host risks**: when Kali Linux MCP is registered: open ports, running services, OS fingerprint via nmap. When Burp Suite MCP is registered: HTTP-layer active probing and blind SSRF via out-of-band callbacks. - **Credential exposure in outputs**: redacts secrets from tool responses before storage. Injection-flagged responses are quarantined and never returned to the calling agent - stored under a run ID for forensic review. - **CVE research and Arxiv findings**: the mcpsafety+ Auditor stage cross-references discovered capabilities against known vulnerabilities and recent security research. ## Prerequisites - Python 3.10 or later - At least one wrapped MCP server to proxy (stdio, SSE, or streamable_http) - **Recommended: an LLM API key** (Anthropic, OpenAI, or Gemini) Without a key the wrapper operates in rule-based-only mode: lower confidence tool classification, regex-only injection scanning, no alternatives in the risk gate, no mcpsafety+ pipeline. For a fully local setup, run [Ollama](https://ollama.com), set `OLLAMA_MODEL`, and pass `--provider ollama` explicitly (Ollama is not auto-detected). > [!NOTE] > **stdio servers that require local setup** (`stdio` servers that need local configuration before starting - missing config files, credentials, data directories, or OS-specific dependencies) cannot be inspected by the wrapper - tool discovery will fail and 0 tools will be stored. You can still run a full source-code security scan without spawning the server by passing `--github-url` to `scan` / `onboard`, or the `github_url` parameter to `security_scan_server`. The mcpsafety+ pipeline will fetch and analyze the source directly from GitHub. `sse` and `streamable_http` servers are not affected. ## Installation ```bash pip install mcpsafetywarden ``` With all optional extras: ```bash pip install "mcpsafetywarden[all]" ``` Or specific extras: ```bash pip install "mcpsafetywarden[anthropic,snyk]" ``` From source: ```bash git clone https://github.com/gautamvarmadatla/mcpsafetywarden cd mcpsafetywarden pip install . ``` The SQLite database is created automatically on first run in the platform user data directory (`~/.local/share/mcpsafetywarden/` on Linux, `~/Library/Application Support/mcpsafetywarden/` on macOS, `%APPDATA%\mcpsafetywarden\` on Windows). Override with `MCP_DB_PATH`. **Credential protection (automatic, no action required)** Secret values passed to `register_server` or `onboard_server` (Bearer tokens, API keys in `headers` or `env`) are automatically detected and replaced with opaque `cref_` identifiers before anything touches the model context. The real credential is stored encrypted in the database and resolved silently at connection time. The model, conversation history, and logs only ever see `cref_<id>`. **Optional: at-rest encryption for stored credentials** ```bash pip install cryptography python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())" ``` Set the printed key as `MCP_DB_ENCRYPTION_KEY` before starting the server. This encrypts both server credentials and `cref_` values at rest. ## Configuration All configuration is via envir
What people ask about mcpsafetywarden
What is gautamvarmadatla/mcpsafetywarden?
+
gautamvarmadatla/mcpsafetywarden is mcp servers for the Claude AI ecosystem. MCP servers expose tools with no information about what they actually do at runtime. mcpsafetywarden sits between your agent and any MCP server, profiling tool behavior, blocking destructive calls, and running active security audits before you trust them in a workflow. It has 6 GitHub stars and was last updated yesterday.
How do I install mcpsafetywarden?
+
You can install mcpsafetywarden by cloning the repository (https://github.com/gautamvarmadatla/mcpsafetywarden) or following the README instructions on GitHub. ClaudeWave also provides quick install blocks on this page.
Is gautamvarmadatla/mcpsafetywarden safe to use?
+
Our security agent has analyzed gautamvarmadatla/mcpsafetywarden and assigned a Trust Score of 62/100 (tier: OK). See the full breakdown of passed checks and flags on this page.
Who maintains gautamvarmadatla/mcpsafetywarden?
+
gautamvarmadatla/mcpsafetywarden is maintained by gautamvarmadatla. The last recorded GitHub activity is from yesterday, with 0 open issues.
Are there alternatives to mcpsafetywarden?
+
Yes. On ClaudeWave you can browse similar mcp servers at /categories/mcp, sorted by popularity or recent activity.
Deploy mcpsafetywarden to your cloud
Ship this repo to production in minutes. Each platform spins up its own environment with editable env vars.
Maintain this repo? Add a badge to your README
Drop the badge into your GitHub README to show it's tracked on ClaudeWave. Each badge links back to this page and reflects the live Trust Score.
[](https://claudewave.com/repo/gautamvarmadatla-mcpsafetywarden)<a href="https://claudewave.com/repo/gautamvarmadatla-mcpsafetywarden"><img src="https://claudewave.com/api/badge/gautamvarmadatla-mcpsafetywarden" alt="Featured on ClaudeWave: gautamvarmadatla/mcpsafetywarden" width="320" height="64" /></a>More MCP Servers
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
An open-source AI agent that brings the power of Gemini directly into your terminal.
The fastest path to AI-powered full stack observability, even for lean teams.
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。