voidful/TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
564 stars and 129 monthly downloads. No commits in the last 6 months. Available on PyPI.
Stars
564
Forks
61
Language
Python
License
MIT
Category
Last pushed
May 09, 2024
Monthly downloads
129
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/voidful/TextRL"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
openai/openai-cookbook
Examples and guides for using the OpenAI API
rgbkrk/dangermode
Execute IPython & Jupyter from the comforts of chat.openai.com
CogStack/OpenGPT
A framework for creating grounded instruction based datasets and training conversational domain...
antononcube/Python-JupyterChatbook
Python package of a Jupyter extension that facilitates the interaction with LLMs.
Declipsonator/GPTZzzs
Large language model detection evasion through grammar and vocabulary modifcation.