PavelOstyakov/toxic
Toxic Comment Classification Challenge
Implements multi-label toxic comment classification using Keras with fastText embeddings (300d), processing six toxicity categories simultaneously. The pipeline combines NLTK preprocessing with scikit-learn utilities to train deep learning models that achieve competitive leaderboard performance on the Kaggle Jigsaw competition dataset.
266 stars. No commits in the last 6 months.
Stars
266
Forks
73
Language
Python
License
MIT
Category
Last pushed
Jan 22, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/PavelOstyakov/toxic"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built...
kensk8er/chicksexer
A Python package for gender classification.
Infinitode/ValX
ValX is an open-source Python package for text cleaning tasks, including profanity detection and...
minerva-ml/open-solution-toxic-comments
Open solution to the Toxic Comment Classification Challenge
IBM/MAX-Toxic-Comment-Classifier
Detect 6 types of toxicity in user comments.