lemony-ai/cascadeflow

Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside the agent loop.

74
/ 100
Verified

It acts as an in-process agent harness, enabling per-step model decisions, budget gating for tool calls, and runtime actions like `stop` or `switch_model` with sub-5ms overhead. The library intelligently selects optimal models through speculative execution and integrates with popular frameworks like LangChain, CrewAI, OpenAI Agents, and Vercel AI SDK.

294 stars and 221 monthly downloads. Available on PyPI.

Maintenance 13 / 25
Adoption 15 / 25
Maturity 22 / 25
Community 24 / 25

How are scores calculated?

Stars

294

Forks

96

Language

Python

License

MIT

Last pushed

Mar 12, 2026

Monthly downloads

221

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/lemony-ai/cascadeflow"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.