Skip to main content
ClaudeWave

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

MCP Servers4.3k estrellas706 forksGoApache-2.0Actualizado today
Nota editorial

vLLM Semantic Router is a signal-driven routing layer, written in Go with Rust components via Hugging Face Candle, that directs incoming LLM workloads to the most appropriate model across cloud, data center, and edge deployments. It exposes an MCP server interface, allowing Claude and other MCP-compatible clients to interact with its routing infrastructure. The system performs real-time classification using BERT-based models to detect jailbreaks, PII leakage, and hallucinations before or after inference, and applies category-aware semantic caching to reduce redundant token generation. A LoRA-extensible architecture lets teams fine-tune routing classifiers for domain-specific workloads without retraining from scratch. Kubernetes-native deployment is supported alongside a Prompt Guard feature that intercepts unsafe inputs. The project targets platform engineers, MLOps teams, and enterprises running heterogeneous model fleets who need cost control, privacy boundary enforcement, and safety guardrails across a mixture-of-models setup rather than a single fixed endpoint.

ClaudeWave Trust Score
100/100
Verified
Passed
  • Open-source license (Apache-2.0)
  • Actively maintained (<30d)
  • Healthy fork ratio
  • Clear description
  • Topics declared
  • Documented (README)
Last scanned: 6/11/2026
Install in Claude Code / Claude Desktop
Method: Manual · semantic-router
Claude Code CLI
git clone https://github.com/vllm-project/semantic-router
claude_desktop_config.json (Claude Desktop)
{
  "mcpServers": {
    "semantic-router": {
      "command": "semantic-router"
    }
  }
}
1. Run the command above in your terminal (Claude Code), or paste the JSON config into claude_desktop_config.json (Claude Desktop).
2. Replace any <placeholder> values with your API keys or paths.
3. Restart Claude. The MCP server and its tools appear automatically.
💡 Install the binary first: go install github.com/vllm-project/semantic-router@latest (make sure it ends up on your PATH).
Casos de uso

Resumen de MCP Servers

README no disponible. Visita el repo en GitHub para la documentación completa.
ai-gatewaybert-classificationfine-tuninggolanghuggingface-candlehuggingface-transformerskubernetesllmllmroutermcpmixture-of-modelsopenclawpii-detectionprompt-engineeringprompt-guardrustsemantic-routervllm

Lo que la gente pregunta sobre semantic-router

¿Qué es vllm-project/semantic-router?

+

vllm-project/semantic-router es mcp servers para el ecosistema de Claude AI. System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge Tiene 4.3k estrellas en GitHub y se actualizó por última vez today.

¿Cómo se instala semantic-router?

+

Puedes instalar semantic-router clonando el repositorio (https://github.com/vllm-project/semantic-router) o siguiendo las instrucciones del README en GitHub. ClaudeWave también te ofrece bloques de instalación rápida en esta misma página.

¿Es seguro usar vllm-project/semantic-router?

+

Nuestro agente de seguridad ha analizado vllm-project/semantic-router y le ha asignado un Trust Score de 100/100 (tier: Verified). Revisa el desglose completo de comprobaciones superadas y flags en esta página.

¿Quién mantiene vllm-project/semantic-router?

+

vllm-project/semantic-router es mantenido por vllm-project. La última actividad registrada en GitHub es de today, con 205 issues abiertos.

¿Hay alternativas a semantic-router?

+

Sí. En ClaudeWave puedes explorar mcp servers similares en /categories/mcp, ordenados por popularidad o actividad reciente.

Despliega semantic-router en tu cloud

Lleva este repo a producción en minutos. Cada plataforma genera su propio entorno con variables de entorno editables.

¿Mantienes este repo? Añade un badge a tu README

Pega el badge en tu README de GitHub para mostrar que está auditado por ClaudeWave. Cada badge enlaza de vuelta a esta página y muestra el Trust Score actual.

Featured on ClaudeWave: vllm-project/semantic-router
[![Featured on ClaudeWave](https://claudewave.com/api/badge/vllm-project-semantic-router)](https://claudewave.com/repo/vllm-project-semantic-router)
<a href="https://claudewave.com/repo/vllm-project-semantic-router"><img src="https://claudewave.com/api/badge/vllm-project-semantic-router" alt="Featured on ClaudeWave: vllm-project/semantic-router" width="320" height="64" /></a>

Más MCP Servers