casinca/GRPO-classic-RL

Open-source implementation/adaptation of DeepSeek GRPO applied to Reinforcement Learning control problems. Example on LunarLander-V3.

15
/ 100
Experimental
No Package No Dependents
Maintenance 6 / 25
Adoption 0 / 25
Maturity 9 / 25
Community 0 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Category

lunar-lander-rl

Last pushed

Dec 06, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/casinca/GRPO-classic-RL"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.