Skip to main content
ClaudeWave

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

MCP Servers4.3k stars706 forksGoApache-2.0Updated today
Editor's note

vLLM Semantic Router is a signal-driven routing layer, written in Go with Rust components via Hugging Face Candle, that directs incoming LLM workloads to the most appropriate model across cloud, data center, and edge deployments. It exposes an MCP server interface, allowing Claude and other MCP-compatible clients to interact with its routing infrastructure. The system performs real-time classification using BERT-based models to detect jailbreaks, PII leakage, and hallucinations before or after inference, and applies category-aware semantic caching to reduce redundant token generation. A LoRA-extensible architecture lets teams fine-tune routing classifiers for domain-specific workloads without retraining from scratch. Kubernetes-native deployment is supported alongside a Prompt Guard feature that intercepts unsafe inputs. The project targets platform engineers, MLOps teams, and enterprises running heterogeneous model fleets who need cost control, privacy boundary enforcement, and safety guardrails across a mixture-of-models setup rather than a single fixed endpoint.

ClaudeWave Trust Score
100/100
Verified
Passed
  • Open-source license (Apache-2.0)
  • Actively maintained (<30d)
  • Healthy fork ratio
  • Clear description
  • Topics declared
  • Documented (README)
Last scanned: 6/11/2026
Install in Claude Code / Claude Desktop
Method: Manual · semantic-router
Claude Code CLI
git clone https://github.com/vllm-project/semantic-router
claude_desktop_config.json (Claude Desktop)
{
  "mcpServers": {
    "semantic-router": {
      "command": "semantic-router"
    }
  }
}
1. Run the command above in your terminal (Claude Code), or paste the JSON config into claude_desktop_config.json (Claude Desktop).
2. Replace any <placeholder> values with your API keys or paths.
3. Restart Claude. The MCP server and its tools appear automatically.
💡 Install the binary first: go install github.com/vllm-project/semantic-router@latest (make sure it ends up on your PATH).
Use cases

MCP Servers overview

README preview not available. Visit the repo on GitHub for full documentation.
ai-gatewaybert-classificationfine-tuninggolanghuggingface-candlehuggingface-transformerskubernetesllmllmroutermcpmixture-of-modelsopenclawpii-detectionprompt-engineeringprompt-guardrustsemantic-routervllm

What people ask about semantic-router

What is vllm-project/semantic-router?

+

vllm-project/semantic-router is mcp servers for the Claude AI ecosystem. System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge It has 4.3k GitHub stars and was last updated today.

How do I install semantic-router?

+

You can install semantic-router by cloning the repository (https://github.com/vllm-project/semantic-router) or following the README instructions on GitHub. ClaudeWave also provides quick install blocks on this page.

Is vllm-project/semantic-router safe to use?

+

Our security agent has analyzed vllm-project/semantic-router and assigned a Trust Score of 100/100 (tier: Verified). See the full breakdown of passed checks and flags on this page.

Who maintains vllm-project/semantic-router?

+

vllm-project/semantic-router is maintained by vllm-project. The last recorded GitHub activity is from today, with 205 open issues.

Are there alternatives to semantic-router?

+

Yes. On ClaudeWave you can browse similar mcp servers at /categories/mcp, sorted by popularity or recent activity.

Deploy semantic-router to your cloud

Ship this repo to production in minutes. Each platform spins up its own environment with editable env vars.

Maintain this repo? Add a badge to your README

Drop the badge into your GitHub README to show it's tracked on ClaudeWave. Each badge links back to this page and reflects the live Trust Score.

Featured on ClaudeWave: vllm-project/semantic-router
[![Featured on ClaudeWave](https://claudewave.com/api/badge/vllm-project-semantic-router)](https://claudewave.com/repo/vllm-project-semantic-router)
<a href="https://claudewave.com/repo/vllm-project-semantic-router"><img src="https://claudewave.com/api/badge/vllm-project-semantic-router" alt="Featured on ClaudeWave: vllm-project/semantic-router" width="320" height="64" /></a>

More MCP Servers