ashworks1706/rlhf-from-scratch

A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.

Archived
36
/ 100
Emerging

108 stars.

Archived No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 15 / 25
Community 12 / 25

How are scores calculated?

Stars

108

Forks

11

Language

Jupyter Notebook

License

Apache-2.0

Last pushed

Nov 07, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ashworks1706/rlhf-from-scratch"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.