algorithmicsuperintelligence/optillm

Optimizing inference proxy for LLMs

/ 100

Established

Implements 20+ inference-time optimization techniques—including MARS, CePO, chain-of-thought reflection, and Monte Carlo tree search—that layer multiple reasoning strategies to achieve 2-10x accuracy gains on math and coding tasks. Acts as an OpenAI API-compatible proxy that intercepts requests and automatically applies selected techniques based on model prefix (e.g., `moa-gpt-4o-mini`), requiring no model retraining or client-side changes. Supports 100+ models across OpenAI, Anthropic, Google, and other providers via LiteLLM, with multi-variant Docker images for full, proxy-only, or offline deployment scenarios.

3,377 stars. Actively maintained with 6 commits in the last 30 days.

No Package No Dependents

Maintenance 17 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

3,377

Forks

265

Language

Python

License

Apache-2.0

Compare

optillm and LLMstudio

Related tools

langfuse/langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management,...

Arize-ai/phoenix

AI Observability & Evaluation

Mirascope/mirascope

The LLM Anti-Framework

Helicone/helicone

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

Agenta-AI/agenta

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM...

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights