ashworks1706/rlhf-from-scratch
A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
Archived108 stars.
Stars
108
Forks
11
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Nov 07, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ashworks1706/rlhf-from-scratch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
uxlfoundation/scikit-learn-intelex
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
ddbourgin/numpy-ml
Machine learning, in numpy
INRIA/scikit-learn-mooc
Machine learning in Python with scikit-learn MOOC
nubank/fklearn
fklearn: Functional Machine Learning
workofart/ml-by-hand
A deep learning library built from scratch with complex neural networks examples built on top...