waybarrios

vllm-mlx

Name: waybarrios/vllm-mlx
Rating: 5 (829 reviews)
Author: waybarrios

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

Tools829 stars188 forks● PythonApache-2.0Updated today

ClaudeWave Trust Score

97/100

✓ Verified

Passed

✓Open-source license (Apache-2.0)
✓Actively maintained (<30d)
✓Healthy fork ratio
✓Clear description
✓Topics declared

Last scanned: 4/14/2026

Install in Claude Desktop

Method detected: Manual

{
  "mcpServers": {
    "vllm-mlx": {
      "command": "node",
      "args": ["/path/to/vllm-mlx/dist/index.js"]
    }
  }
}

1. Copy the snippet above.

2. Paste into ~/Library/Application Support/Claude/claude_desktop_config.json (Mac) or %APPDATA%\Claude\claude_desktop_config.json (Windows).

3. Replace any <placeholder> values with your API keys or paths.

4. Restart Claude Desktop. The MCP server appears automatically.

💡 Clone https://github.com/waybarrios/vllm-mlx and follow its README for install instructions.

Use cases

🎬 Media🧠 AI / ML🎨 Creative

About

Tools overview

README preview not available. Visit the repo on GitHub for full documentation.

Topics

anthropicapple-siliconaudio-processingclaude-codecomputer-visionimage-understandinginferencellmmachine-learningmacosmllmmlxmultimodal-aispeech-to-textstttext-to-speechttsvideo-understandingvision-language-modelvllm

anthropics

claude-code

✓97

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.

113.9k19kShell· today

An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms

64.9k6.6kPython· 11d ago

Toolsai-skillsantigravity

gsd-build

get-shit-done

·73

A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES.

52.8k4.4kJavaScript· today

Toolsclaude-codecontext-engineering

Aider-AI

aider

✓94

aider is AI pair programming in your terminal

43.3k4.2kPython· 5d ago

Toolsanthropicchatgpt

BerriAI

litellm

✓94

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, VLLM, NVIDIA NIM]

43.3k7.2kPython· today

Toolsai-gatewayanthropic

asgeirtj

system_prompts_leaks

·73

Extracted system prompts from ChatGPT (GPT-5.4, GPT-5.3, Codex), Claude (Opus 4.6, Sonnet 4.6, Claude Code), Gemini (3.1 Pro, 3 Flash, CLI), Grok (4.2, 4), Perplexity, and more. Updated regularly.

38.3k6.3k· today

Toolsaiai-transparency

vllm-mlx

Tools overview

More Tools