AgileRL/AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.

/ 100

Established

Implements evolutionary hyperparameter optimization that automatically tunes agent networks during training through population-based mutation and selection, eliminating the need for separate HPO runs. Supports diverse RL paradigms—on-policy (PPO), off-policy (TD3, DQN), offline RL, multi-agent (MADDPG, MATD3), and contextual bandits—with distributed training across multiple workers and Petting Zoo compatibility for multi-agent environments. Also includes LLM fine-tuning capabilities via reinforcement feedback (RFT) with optional dependencies for transformers, DeepSpeed, and PEFT.

896 stars. Actively maintained with 10 commits in the last 30 days.

No Package No Dependents

Maintenance 20 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

896

Forks

Language

Python

License

Apache-2.0

Related agents

facebookresearch/BenchMARL

BenchMARL is a library for benchmarking Multi-Agent Reinforcement Learning (MARL). BenchMARL...

datamllab/rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Toni-SM/skrl

Modular Reinforcement Learning (RL) library (implemented in PyTorch, JAX, and NVIDIA Warp) with...

utiasDSL/gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

koulanurag/ma-gym

A collection of multi agent environments based on OpenAI gym.

Explore AI Agents

All categories Trending AI Agent directory Insights