cascadeflow

Name: lemony-ai/cascadeflow
Author: lemony-ai

Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside the agent loop.

Subagents3.6k stars771 forks● PythonMITUpdated 26d ago

Editor's note

CascadeFlow is an in-process runtime intelligence layer for AI agents that handles model selection, cost control, latency optimization, and policy enforcement inside the agent execution loop rather than at the HTTP boundary. It integrates with Claude through the Anthropic API and works alongside frameworks including LangChain, CrewAI, PydanticAI, Google ADK, n8n, Vercel AI SDK, and OpenAI Agents SDK, available as both a Python package and a TypeScript npm module. The core mechanism is model cascading: routing each agent step or tool call to the most appropriate and cost-effective model based on task complexity, token budgets, quality thresholds, and business KPIs accumulated across the run. A standout benchmark result from the README shows 93% cost reduction on GSM8K math tasks while retaining 96% of GPT-5 quality. The library adds under 5ms overhead per call and supports per-tool-call budget gating and runtime stop, continue, or escalate decisions. It targets developers and teams running multi-step Claude-based agents who need transparent cost accounting and adaptive model routing without an external proxy.

ClaudeWave Trust Score

100/100

✓ Verified

Passed

✓Open-source license (MIT)
✓Actively maintained (<30d)
✓Healthy fork ratio
✓Clear description
✓Topics declared
✓Documented (README)

Last scanned: 6/11/2026

Install as a Claude Code subagent

Method: Clone

Terminal

git clone https://github.com/lemony-ai/cascadeflow && cp cascadeflow/*.md ~/.claude/agents/

1. Clone the repository and copy the agent .md definitions into ~/.claude/agents (or .claude/agents inside a project).

2. Start a new Claude Code session to load the agents.

3. Delegate work to them with the Task/Agent tool or by name.

Installable items

1 items in this repository

cascadeflowSkill

Use when building, extending, or debugging AI agents with cascadeflow (agent runtime intelligence layer) — installing `cascadeflow` (Python) or `@cascadeflow/core`/`@cascadeflow/langchain` (TypeScript); using `CascadeAgent`, `ModelConfig`, harness APIs (`cascadeflow.init`, `cascadeflow.run`, `@agent` from `cascadeflow.harness`, `simulate`), `withCascade`/`CascadeFlow`; picking drafter+verifier pairs; per-step budget/compliance/KPI enforcement; quality validation; complexity pre-routing; tool execution and multi-turn agent loops; presets; decision traces; or wiring cascadeflow into LangChain, OpenAI Agents, CrewAI, PydanticAI, Google ADK, n8n, or Vercel AI SDK. Also when a user mentions "cascade", "drafter/verifier", "runtime intelligence", "in-process harness", "cost-optimized agent", "agent loop with cost control", is in the lemony-ai/cascadeflow repo, or found a bug in cascadeflow/integrations needing an upstream fix/PR.

Install

Use cases

AI / ML Dev Tools Automation

About

Subagents overview

README preview not available. Visit the repo on GitHub for full documentation.

Topics

agentaianthropicapibudgetsclaudecost-optimizationcost-transparencygoogle-adkgpthuggingfacellmmodel-cascadingn8nollamaopenaipythontogether-aitypescriptvllm

Frequently asked

What people ask about cascadeflow

What is lemony-ai/cascadeflow?

lemony-ai/cascadeflow is subagents for the Claude AI ecosystem. Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside the agent loop. It has 3.6k GitHub stars and was last updated 26d ago.

How do I install cascadeflow?

You can install cascadeflow by cloning the repository (https://github.com/lemony-ai/cascadeflow) or following the README instructions on GitHub. ClaudeWave also provides quick install blocks on this page.

Is lemony-ai/cascadeflow safe to use?

Our security agent has analyzed lemony-ai/cascadeflow and assigned a Trust Score of 100/100 (tier: Verified). See the full breakdown of passed checks and flags on this page.

Who maintains lemony-ai/cascadeflow?

lemony-ai/cascadeflow is maintained by lemony-ai. The last recorded GitHub activity is from 26d ago, with 8 open issues.

Are there alternatives to cascadeflow?

Yes. On ClaudeWave you can browse similar subagents at /categories/agents, sorted by popularity or recent activity.

1-click deploy

Deploy cascadeflow to your cloud

Ship this repo to production in minutes. Each platform spins up its own environment with editable env vars.

Vercel Railway Render

Embeddable badge

Maintain this repo? Add a badge to your README

Drop the badge into your GitHub README to show it's tracked on ClaudeWave. Each badge links back to this page and reflects the live Trust Score.

Markdown (README)

[![Featured on ClaudeWave](https://claudewave.com/api/badge/lemony-ai-cascadeflow)](https://claudewave.com/repo/lemony-ai-cascadeflow)

HTML

<a href="https://claudewave.com/repo/lemony-ai-cascadeflow"><img src="https://claudewave.com/api/badge/lemony-ai-cascadeflow" alt="Featured on ClaudeWave: lemony-ai/cascadeflow" width="320" height="64" /></a>

More Subagents

affaan-m

ECC

today

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

234.2k35.7kJavaScript

Subagentsai-agentsanthropicInstall

NousResearch

hermes-agent

today

The agent that grows with you

221.5k42.3kPython

Subagentsaiai-agentInstall

Snailclimb

JavaGuide

yesterday

Java 面试 & 后端通用面试指南，覆盖计算机基础、数据库、分布式、高并发、系统设计与 AI 应用开发

157.3k46.2kJavaScript

SubagentsagentaiInstall

langgenius

dify

today

Build Agentic workflows, RAG pipelines, with rich AI model and tool support on one collaborative workspace. Deploy on cloud, VPC, or self-hosted, so teams move from prototype to production without rebuilding the stack.

150.5k23.7kTypeScript

Subagentsagentagentic-aiInstall

langchain-ai

langchain

today

The agent engineering platform.

142.7k23.8kPython

SubagentsagentsaiInstall

Graphify-Labs

graphify

today

Turn any codebase, with its docs, SQL schemas, configs, and PDFs, into a queryable knowledge graph. A /graphify skill for Claude Code, Cursor, Codex, and Gemini CLI: local deterministic AST parsing, every edge explained, no vector store.

97.2k9.4kPython

Subagentsai-agentsantigravityInstall