omni-cache
The omni-cache Claude Code skill manages LLM response caching through REST API endpoints that retrieve cache statistics, clear cached entries, and monitor cache performance across multiple layers. Use this skill when you need to optimize API performance by inspecting what responses are cached, free up storage by removing old entries, or troubleshoot caching behavior in applications built on the OmniRoute framework.
git clone --depth 1 https://github.com/diegosouzapw/OmniRoute /tmp/omni-cache && cp -r /tmp/omni-cache/skills/omni-cache ~/.claude/skills/omni-cacheSKILL.md
<!-- generated by src/lib/agentSkills/generator.ts; manual edits will be overwritten --> ## Overview Manage the LLM response cache. View cache statistics, clear entries, configure TTL policies, and control semantic-similarity caching thresholds. ## Authentication All requests require a valid Bearer token or session cookie. Obtain a token via `POST /api/auth/login` or configure `REQUIRE_API_KEY=false` for local development. ## Endpoints ### GET /api/cache Get cache statistics ```bash curl https://localhost:20128/api/cache \ -H "Authorization: Bearer $OMNIROUTE_TOKEN" ``` ### DELETE /api/cache Clear all caches ```bash curl -X DELETE https://localhost:20128/api/cache \ -H "Authorization: Bearer $OMNIROUTE_TOKEN" ``` ### GET /api/cache/stats Get detailed cache statistics Returns detailed statistics for all cache layers. ```bash curl https://localhost:20128/api/cache/stats \ -H "Authorization: Bearer $OMNIROUTE_TOKEN" ``` ### DELETE /api/cache/stats Clear cache statistics ```bash curl -X DELETE https://localhost:20128/api/cache/stats \ -H "Authorization: Bearer $OMNIROUTE_TOKEN" ``` ## Payloads See the full OpenAPI specification at `GET /api/openapi/spec` or `docs/reference/openapi.yaml` for detailed request/response schemas.
Interact with the OmniRoute A2A server from the CLI. Send tasks, inspect skill execution history, and test the JSON-RPC 2.0 agent-to-agent protocol interactively.
Backup and restore OmniRoute data from the CLI. Trigger incremental snapshots, sync to cloud storage, manage backup schedules, and restore from archive files.
Submit and monitor batch inference jobs from the CLI. Upload and manage files for batch processing, retrieve results, and integrate batch pipelines with CI/CD workflows.
Send chat completions, stream responses, and start an interactive REPL session from the CLI. Supports all OmniRoute providers, combo routing, and system prompt configuration.
Configure and test prompt compression from the CLI. Manage RTK filters, Caveman rules, stacked compression modes, and preview compression output with real prompts.
Manage context engineering configurations, RTK filter sets, and conversation sessions from the CLI. Apply context-relay settings and inspect active context pipelines.
View cost breakdowns, token usage, and call logs from the CLI. Filter by provider, model, or date range. Export usage reports and inspect per-connection spending.
Create and run evaluation suites, watch live benchmark progress, view scorecards, compare model performance, and integrate eval runs with CI workflows from the CLI.