pentoai/ml-ralph

Autonomous ML agent for running experiments using Claude or Codex.

/ 100

Experimental

Automates the full ML experiment lifecycle—planning, execution, analysis, and structured learning accumulation—through a four-phase cognitive cycle (understand, strategize, execute, reflect) that prioritizes verification and hypothesis testing over raw execution. Integrates with Claude Code CLI and runs within tmux, providing a terminal UI that operates autonomously on ML projects once initialized with a product requirements document. Implements a "paranoid scientist" framework allocating 70% effort to data verification and assumption validation, enabling strategic retreats to understanding phases when experiments plateau.

No License No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 3 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

TypeScript

License

—

Higher-rated alternatives

jbexta/AgentPilot

A versatile workflow automation platform to create, organize, and execute AI workflows, from a...

vibeforge1111/vibeship-spark-intelligence

a self-evolving intelligent companion

tghamm/Anthropic.SDK

An unofficial C#/.NET SDK for accessing the Anthropic Claude API. This package is not affiliated...

ownpilot/OwnPilot

Privacy-first personal AI assistant platform with autonomous agents, tool orchestration, and...

vedant007-v/codex_dspy

🤖 Simplify multi-turn conversations with CodexAgent, a DSPy module for OpenAI Codex, offering...

Explore AI Agents

All categories Trending AI Agent directory Insights