llama-farm/llamafarm
Deploy any AI model, agent, database, RAG, and pipeline locally or remotely in minutes
Combines a multi-model inference layer (Ollama, vLLM, OpenAI-compatible) with specialized ML runtimes for embeddings, OCR, and anomaly detection across 12+ algorithms. Built on FastAPI with a Celery-based RAG worker and a visual Designer UI, enabling local-first development with complete privacy and hardware acceleration for Apple Silicon, NVIDIA, and AMD GPUs.
823 stars. Actively maintained with 11 commits in the last 30 days.
Stars
823
Forks
48
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 12, 2026
Commits (30d)
11
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/llama-farm/llamafarm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
langbot-app/LangBot
Production-grade platform for building agentic IM bots - 生产级多平台智能机器人开发平台. 提供 Agent、知识库编排、插件系统 /...
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
cactus-compute/cactus
Low-latency AI engine for mobile devices & wearables
rudrankriyam/Foundation-Models-Framework-Example
Example apps for Foundation Models Framework in iOS 26 and macOS 26
sigoden/aichat
All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with...