CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
4,738 stars. No commits in the last 6 months.
Stars
4,738
Forks
482
Language
Python
License
MIT
Category
Last pushed
Jan 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/CarperAI/trlx"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...
Denys88/rl_games
RL implementations
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
flatland-association/flatland-rl
The Flatland Framework is a multi-purpose environment to tackle problems around resilient...
takuseno/d3rlpy
An offline deep reinforcement learning library