HGVAbyte/rlhf-data-agent-full
🔍 Generate synthetic preference-ranked datasets for RLHF and DPO training, enhancing AI model fine-tuning and ensuring data integrity with blockchain support.
Stars
—
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 21, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/HGVAbyte/rlhf-data-agent-full"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning...
Denys88/rl_games
RL implementations
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
flatland-association/flatland-rl
The Flatland Framework is a multi-purpose environment to tackle problems around resilient...
takuseno/d3rlpy
An offline deep reinforcement learning library