luizhenriqueds/reddit-br-toxicity-dataset
This repository makes available a new dataset for toxicity detection in Brazilian Portuguese from the work accepted by the 16th International Conference on Computational Processing of Portuguese (PROPOR 2024). The data collected is from the most popular Brazilian subreddits in 2022.
No commits in the last 6 months.
Stars
3
Forks
—
Language
—
License
MIT
Category
Last pushed
Dec 27, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/luizhenriqueds/reddit-br-toxicity-dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
zake7749/DeepToxic
top 1% solution to toxic comment classification challenge on Kaggle.
aralroca/react-text-toxicity
Detect text toxicity in a simple way, using React. Based in a Keras model, loaded with Tensorflow.js.
charliegerard/safe-space
Github action that checks the toxicity level of comments and PR reviews to help make repos safe spaces.
jaydeepjethwa/DeTox
A web-app to identify toxic comments in a youtube channel and delete them.
DenisIndenbom/AntiToxicBot
AntiToxicBot is a bot that detects toxics in a chat using Data Science and Machine Learning...