llama-farm/llamafarm

Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes

/ 100

Established

Combines a multi-model inference layer (Ollama, vLLM, OpenAI-compatible) with specialized ML runtimes for embeddings, OCR, and anomaly detection across 12+ algorithms. Built on FastAPI with a Celery-based RAG worker and a visual Designer UI, enabling local-first development with complete privacy and hardware acceleration for Apple Silicon, NVIDIA, and AMD GPUs.

823 stars. Actively maintained with 11 commits in the last 30 days.

No Package No Dependents

Maintenance 20 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 15 / 25

How are scores calculated?

Stars

823

Forks

Language

Python

License

Apache-2.0

Related tools

langbot-app/LangBot

Production-grade platform for building agentic IM bots - 生产级多平台智能机器人开发平台. 提供 Agent、知识库编排、插件系统 /...

open-webui/open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

cactus-compute/cactus

Low-latency AI engine for mobile devices & wearables

rudrankriyam/Foundation-Models-Framework-Example

Example apps for Foundation Models Framework in iOS 26 and macOS 26

sigoden/aichat

All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with...

Explore RAG Tools

All categories Trending RAG directory Insights