adrianliechti/wingman

Inference Hub for AI at Scale

/ 100

Established

Supports multi-provider LLM integration (OpenAI, Anthropic, Gemini, Bedrock, local Ollama) with pluggable document processing pipelines (extractors, segmenters, retrievers) for RAG workflows. Offers modular architecture with built-in tools, Model Context Protocol (MCP) support for external tool servers, and load balancing/rate limiting across providers. Exposes OpenAI-compatible APIs with full OpenTelemetry observability and YAML-based configuration for chains, agents, and complex AI workflows.

No Package No Dependents

Maintenance 13 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

License

MIT

Related tools

langbot-app/LangBot

Production-grade platform for building agentic IM bots - 生产级多平台智能机器人开发平台. 提供 Agent、知识库编排、插件系统 /...

open-webui/open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

cactus-compute/cactus

Low-latency AI engine for mobile devices & wearables

rudrankriyam/Foundation-Models-Framework-Example

Example apps for Foundation Models Framework in iOS 26 and macOS 26

sigoden/aichat

All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with...

Explore RAG Tools

All categories Trending RAG directory Insights