future-agi/futureagi-sdk
Production-grade AI evaluation, prompt management & observability SDK. Automated evaluations with sub-100ms guardrails. No human-in-the-loop required. Python + TypeScript.
It provides dataset management for programmatically creating, updating, and running evaluations on AI training data. The SDK also offers a prompt workbench with version control, A/B testing, and deployment labeling, alongside a knowledge base for managing documents in RAG applications. It integrates with an external Future AGI platform for dashboard visibility and uses API keys for authentication.
Stars
37
Forks
—
Language
Python
License
BSD-3-Clause
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/future-agi/futureagi-sdk"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
StonyBrookNLP/appworld
🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and...
qualifire-dev/rogue
AI Agent Evaluator & Red Team Platform
future-agi/ai-evaluation
Evaluation Framework for all your AI related Workflows
microsoft/WindowsAgentArena
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of...
agentscope-ai/OpenJudge
OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards