FareedKhan-dev/training-ai-agents

Training architecture for self-improving AI agents.

/ 100

Emerging

Implements a multi-agent training pipeline using LangGraph and distributed RL algorithms (SFT, PPO, contextual bandits) with real-time observability via LangSmith and Weights & Biases. Agents collaborate through shared hierarchical state, exchange knowledge in parallel, and self-improve through dynamic reward systems that adapt based on performance and task alignment. The architecture progresses through supervised fine-tuning, reinforcement learning phases, and includes tracing hooks and logging adapters for capturing every interaction and learning step.

No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 13 / 25

Community 20 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Featured in

Your Docs Are Written for Humans. Your Users Are Agents.

Compare

training-ai-agents and ai-agents-for-beginners

Higher-rated alternatives

microsoft/ai-agents-for-beginners

12 Lessons to Get Started Building AI Agents

ForceInjection/AI-fundermentals

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识

NirDiamant/agents-towards-production

This repository delivers end-to-end, code-first tutorials covering every layer of...

cnoe-io/ai-platform-engineering

CAIPE: Community AI Platform Engineering Multi-Agent Systems

FlyAIBox/Agent_In_Action

Agentic AI 智能体开发实战

Explore AI Agents

All categories Trending AI Agent directory Insights