pentoai/ml-ralph
Autonomous ML agent for running experiments using Claude or Codex.
Automates the full ML experiment lifecycle—planning, execution, analysis, and structured learning accumulation—through a four-phase cognitive cycle (understand, strategize, execute, reflect) that prioritizes verification and hypothesis testing over raw execution. Integrates with Claude Code CLI and runs within tmux, providing a terminal UI that operates autonomously on ML projects once initialized with a product requirements document. Implements a "paranoid scientist" framework allocating 70% effort to data verification and assumption validation, enabling strategic retreats to understanding phases when experiments plateau.
Stars
30
Forks
3
Language
TypeScript
License
—
Category
Last pushed
Feb 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/pentoai/ml-ralph"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jbexta/AgentPilot
A versatile workflow automation platform to create, organize, and execute AI workflows, from a...
vibeforge1111/vibeship-spark-intelligence
a self-evolving intelligent companion
tghamm/Anthropic.SDK
An unofficial C#/.NET SDK for accessing the Anthropic Claude API. This package is not affiliated...
ownpilot/OwnPilot
Privacy-first personal AI assistant platform with autonomous agents, tool orchestration, and...
vedant007-v/codex_dspy
🤖 Simplify multi-turn conversations with CodexAgent, a DSPy module for OpenAI Codex, offering...