FareedKhan-dev/training-ai-agents
Training architecture for self-improving AI agents.
Implements a multi-agent training pipeline using LangGraph and distributed RL algorithms (SFT, PPO, contextual bandits) with real-time observability via LangSmith and Weights & Biases. Agents collaborate through shared hierarchical state, exchange knowledge in parallel, and self-improve through dynamic reward systems that adapt based on performance and task alignment. The architecture progresses through supervised fine-tuning, reinforcement learning phases, and includes tracing hooks and logging adapters for capturing every interaction and learning step.
Stars
54
Forks
24
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Nov 04, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/FareedKhan-dev/training-ai-agents"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
microsoft/ai-agents-for-beginners
12 Lessons to Get Started Building AI Agents
ForceInjection/AI-fundermentals
AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识
NirDiamant/agents-towards-production
This repository delivers end-to-end, code-first tutorials covering every layer of...
cnoe-io/ai-platform-engineering
CAIPE: Community AI Platform Engineering Multi-Agent Systems
FlyAIBox/Agent_In_Action
Agentic AI 智能体开发实战