Toxic Comment Detection NLP Tools
Tools for detecting, classifying, and analyzing toxic, offensive, or harmful language in text (comments, messages, chat). Includes severity scoring and content moderation. Does NOT include general sentiment analysis, hate speech as a separate category, or profanity filtering without toxicity context.
There are 154 toxic comment detection tools tracked. 1 score above 70 (verified tier). The highest-rated is unitaryai/detoxify at 76/100 with 1,202 stars and 94,691 monthly downloads. 1 of the top 10 are actively maintained.
Get all 154 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=toxic-comment-detection&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic... |
|
Verified |
| 2 |
kensk8er/chicksexer
A Python package for gender classification. |
|
Established |
| 3 |
Infinitode/ValX
ValX is an open-source Python package for text cleaning tasks, including... |
|
Established |
| 4 |
PavelOstyakov/toxic
Toxic Comment Classification Challenge |
|
Emerging |
| 5 |
minerva-ml/open-solution-toxic-comments
Open solution to the Toxic Comment Classification Challenge |
|
Emerging |
| 6 |
IBM/MAX-Toxic-Comment-Classifier
Detect 6 types of toxicity in user comments. |
|
Emerging |
| 7 |
kocohub/korean-hate-speech
Korean HateSpeech Dataset |
|
Emerging |
| 8 |
gokulkarthik/hateclipper
Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit... |
|
Emerging |
| 9 |
JoshPola96/dissertation-racism-detection-bert-cnn-bilstm
Bias-minimized multimodal deep learning system for racism detection in... |
|
Emerging |
| 10 |
joaoaleite/ToLD-Br
Toxic Language Detection in Social Media for Brazilian Portuguese: New... |
|
Emerging |
| 11 |
rezacsedu/Bengali-Hate-Speech-Dataset
Dataset for identifying potential hates (e.g., political, religious,... |
|
Emerging |
| 12 |
WSU-SEAL/ToxiCR
A supervised learning based tool to identify toxic code review comments |
|
Emerging |
| 13 |
leeyejin1231/KOTOX
A Korean Toxic Text for Obfuscation and Detoxification |
|
Emerging |
| 14 |
ahmedhammad97/Offensive-Language-Detection
NLP model that uses Machine Learning to detect offensive tweets, and... |
|
Emerging |
| 15 |
hate-alert/Countering_Hate_Speech_ICWSM2019
Repository for the paper "Thou shalt not hate: Countering Online Hate... |
|
Emerging |
| 16 |
rezacsedu/Classification_Benchmarks_Benglai_NLP
Classification Benchmarks for Under-resourced Bengali Language based on... |
|
Emerging |
| 17 |
hate-alert/Tutorial-Resources
Resources and tools for the Tutorial - "Hate speech detection, mitigation... |
|
Emerging |
| 18 |
datascisteven/Automated-Hate-Tweet-Detection
Developing a classification model to detect hate tweets ready for deployment... |
|
Emerging |
| 19 |
leeyejin1231/RV-HATE
Reinforced Multi-Module Voting for Implicit Hate Speech Detection |
|
Emerging |
| 20 |
leeyejin1231/AmpleHate
AmpleHate: Amplify target-context relationships for implicit hate speech... |
|
Emerging |
| 21 |
Prakhar-FF13/Toxic-Comments-Classification
Predict the toxicity rating of comment made by the user. |
|
Emerging |
| 22 |
NakulLakhotia/Hate-Speech-Detection-in-Social-Media-using-Python
Python code to detect hate speech and classify twitter texts using NLP... |
|
Experimental |
| 23 |
s-nlp/rudetoxifier
Code and data of "Methods for Detoxification of Texts for the Russian Language" paper |
|
Experimental |
| 24 |
datascisteven/Twitter-Sentiment-Analysis
Using NLP techniques and Sentiment Analysis of Hate Tweets for Automated... |
|
Experimental |
| 25 |
gesiscss/sexism_custom_classifier
Custom classifiers to detect sexist language. |
|
Experimental |
| 26 |
JherezTaylor/hatespeech_codewords
A contextual approach for detecting hate speech code words |
|
Experimental |
| 27 |
hate-alert/HateALERT-EVALITA
Code for replicating results of team 'hateminers' at EVALITA-2018 for AMI task |
|
Experimental |
| 28 |
EliasCai/bert-toxicity-classification
bert on Jigsaw Unintended Bias in Toxicity Classification |
|
Experimental |
| 29 |
FelipeLuz/dotnet-bad-word-detector-and-filter
.NET library that uses machine learning to detect bad words (profanity)... |
|
Experimental |
| 30 |
WSU-SEAL/ToxiSpanSE
An Explainable Toxicity detector for code review comments. Published in ESEM'2023 |
|
Experimental |
| 31 |
ignacioct/Temis
Temis is an Automatic Misogyny Identification tool. Using Deep Learning... |
|
Experimental |
| 32 |
faizaladhitama/Hierarchical-Multi-Label-Classification-API
Hierarchical Multi Label Hate Speech and Abusive Language Classification |
|
Experimental |
| 33 |
im-syn/SafeContentText
A Python toolkit and web API for detecting “bad” content (profanity, hate... |
|
Experimental |
| 34 |
prashanthm07/Twitter_Sentiment_Analysis-Using-ML-and-NLP
The objective of this project is to detect hate speech in tweets. For the... |
|
Experimental |
| 35 |
jmjmalik22/Hate-Speech-Detection
Official implementation of the paper "Deep Learning for Hate Speech... |
|
Experimental |
| 36 |
sidneykung/twitter_hate_speech_detection
Capstone project to automate Twitter hate speech detection with... |
|
Experimental |
| 37 |
MarcinCiura/przetak
Przetak: fewer weeds on the Web |
|
Experimental |
| 38 |
jrzvnn/lfcm-racism
Enhances racist post detection by combining comment features with visual and... |
|
Experimental |
| 39 |
alessandrocuda/ReSt
This project aims to investigate different models for solving the stereotype... |
|
Experimental |
| 40 |
RexGod/NLP-Project
Analyze user comments through Natural Language Processing (NLP) techniques... |
|
Experimental |
| 41 |
priyanshkedia04/Tweets-Classification-among-Hate-Speech-Offensive-Neither
Twitter Dataset was used to classify the tweets as Offensive, Hate Text, and... |
|
Experimental |
| 42 |
SamuelTelesSilva/toxic_comment_detection
Detecção de comentários ou textos preconceituosos com processamento de... |
|
Experimental |
| 43 |
hbujakow/hateful_memes_detection
Joint work on utilizing a combination of NLP and CV methods in implementing... |
|
Experimental |
| 44 |
fer-aguirre/pmdm
Political Misogynistic Discourse Monitor team from the 2021 JournalismAI... |
|
Experimental |
| 45 |
esceptico/toxic
End-to-end toxic Russian comment classification |
|
Experimental |
| 46 |
dvdbisong/automl-toxicity-classification
Google Cloud AutoML Natural Language for Toxicity Classification |
|
Experimental |
| 47 |
IRLab-UDC/metahate
MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection @ ICWSM '24 |
|
Experimental |
| 48 |
batuhanguler/Offensive-language-detection
Classification of Offensive tweets, part of OffensEval 2019 Competition. |
|
Experimental |
| 49 |
dimitreOliveira/Jigsaw-Multilingual-Toxic-Comment-Classification
:3rd_place_medal: (Bronze medal - 100th place - Top 7%) Repository for the... |
|
Experimental |
| 50 |
manashpratim/Tweet-Classification
Detect hate speech in tweets |
|
Experimental |
| 51 |
TaqsBlaze/Hush
Hush: A lightweight, context-aware text toxicity classifier. Leveraging NLP... |
|
Experimental |
| 52 |
mrsac7/Data-Mining-Project
Hate Speech Detection | Data Mining (CSE-362) Project | IIT (BHU) Varanasi |... |
|
Experimental |
| 53 |
manishshettym/Offensive-Text-Detection
Detect offensive (Racist, Sexist etc.) text from social media posts (tweets)... |
|
Experimental |
| 54 |
Life-Experimentalist/ToxicGuard_AI
ToxiGuard AI is a browser extension that detects and censors toxic language... |
|
Experimental |
| 55 |
satyaSK/Hate-Speech-Detection
Hate speech detection from code-mixed Hindi-English tweets using deep... |
|
Experimental |
| 56 |
imrahulr/Toxic-Comment-Classification-Kaggle
Deep Learning for Toxic Comment Classification |
|
Experimental |
| 57 |
LCS2-IIITD/Hate_Norm
[KDD 2022] Proactively Reducing the Hate Intensity of Online Posts via Hate... |
|
Experimental |
| 58 |
databricks-industry-solutions/toxicity-detection-in-gaming
Build a lakehouse for all your gamer data and use natural language... |
|
Experimental |
| 59 |
baroquerock/detoxify
Chrome extension which detects, classifies and highlights toxic comments... |
|
Experimental |
| 60 |
ciwang/deep_hatespeech
Classifying hate speech with deep learning (honors thesis 2017-18) |
|
Experimental |
| 61 |
a-abuzayed/Hate-Speech-Detection_OSACT4-Workshop
Quick and Simple Approach for Detecting Hate Speech in Arabic Tweets. |
|
Experimental |
| 62 |
minhd-vu/toxicity-filter
Natural language processing API to detect toxic chat. |
|
Experimental |
| 63 |
TiagoMAntunes/KAREN
KAREN: Unifying Hatespeech Detection and Benchmarking |
|
Experimental |
| 64 |
andiosika/NLP-to-identify-toxic-or-abusive-language-for-online-conversation-using-Keras-Deep-Learning-Models
Natural Language Processing: A multi-headed model capable of detecting... |
|
Experimental |
| 65 |
kroudir/Twitter-Sentiment-Analysis-with-python
The objective of this project is to detect hate speech in tweets. For the... |
|
Experimental |
| 66 |
onur-cubukcu/cop28-hate-speech
Hate speech detection pipeline for #COP28 tweets using BERT — TU Dortmund... |
|
Experimental |
| 67 |
Mujjjtaba/Hate-Speech-Detection-On-Social-Media
Hate Speech Detection system using NLP and supervised machine learning to... |
|
Experimental |
| 68 |
thefirebanks/Ensemble-Learning-for-Tweet-Classification-of-Hate-Speech-and-Offensive-Language
Contains code for a voting classifier that is part of an ensemble learning... |
|
Experimental |
| 69 |
pinarhaskiris/hate-speech-recognizer-ml
An hate-speech-recognizer implemented using three different machine learning... |
|
Experimental |
| 70 |
yilingchung/counternarrative-resources
Resources on hate speech moderation through counter narratives, counter speech |
|
Experimental |
| 71 |
andrewwhitman/HatefulMemesDetection
Classification of memes as hateful or not based on the textual content of the meme. |
|
Experimental |
| 72 |
aflah02/Combating_Hate_Scalably
Vote! Don’t Retrain: Combating Hate in a Scalable Way. This work is an... |
|
Experimental |
| 73 |
RafayKhattak/ToxiScan
ToxiScan is a text analysis tool that leverages the power of Natural... |
|
Experimental |
| 74 |
Carmezim/ToxicCommentsClassification
Bidirectional LSTM in TensorFlow for Kaggle Text Classification Competition |
|
Experimental |
| 75 |
TanYouxi/Chinese-Toxic-Comment-Detection
This repository presents a comprehensive comparison of traditional machine... |
|
Experimental |
| 76 |
Sreyan88/SemEval-2021-Toxic-Spans-Detection
This repository contains code for the paper: Cisco at SemEval-2021 Task 5:... |
|
Experimental |
| 77 |
SiddeshSambasivam/Detox
Detox is a web service that offers multilingual profanity filtering which... |
|
Experimental |
| 78 |
MeetMulik/E-Takraar
Platform designed to create a secure and inclusive online environment by... |
|
Experimental |
| 79 |
4ndrewparr/jigsaw-toxicity-classification
Training and inference code from the ML models used for the Jigsaw Kaggle... |
|
Experimental |
| 80 |
s-nlp/parallel_detoxification_dataset
Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for... |
|
Experimental |
| 81 |
pri1311/TweeToxicity
TweeToxicity is a program that analyzes user profiles or hashtags based on... |
|
Experimental |
| 82 |
JustLearningThings/Null--NLP
We trained a model that will predict if an article or comment is offensive,... |
|
Experimental |
| 83 |
2forts/hateval-trainer
Hybrid quantum–classical neural network for Spanish hate speech detection... |
|
Experimental |
| 84 |
abdouaziz/gbv
Gender Based Violence |
|
Experimental |
| 85 |
vidhur2k/Multilngual-Hate-Speech
Highly Generalizable Models for Multilingual Hate Speech Detection (CSE8803... |
|
Experimental |
| 86 |
danielmuthama/Hate-Speech-and-Offensive-Language-Detection
Hate Speech and Offensive Language Detection |
|
Experimental |
| 87 |
neemiasbsilva/kedro_orchestrate_dl_transformer_arch
A pipeline using Kedro to orchestrate the deployment of a deep learning... |
|
Experimental |
| 88 |
Runoi/lightweight-toxic-classifier
🚀 Production-ready API for detecting toxic comments in Russian text, powered... |
|
Experimental |
| 89 |
ashen8810/Sinhala-hate-speech
A Model Using LSTM |
|
Experimental |
| 90 |
BatsResearch/cross-lingual-detox
Code for "Preference Tuning For Toxicity Mitigation Generalizes Across... |
|
Experimental |
| 91 |
lwhsd/hatespeech-demo
Indonesian hate speech classification |
|
Experimental |
| 92 |
cassiasilvaR/TwitterHateBR
Repositório que contém dados coletados do Twitter durante as eleições... |
|
Experimental |
| 93 |
vitthal-bhandari/Homophobia-Transphobia-Detection
Code for the shared task on homophobia/transphobia detection at LT-EDI... |
|
Experimental |
| 94 |
Kushaagra-exe/ToxicCommentsMultiLabelClassification
Toxic Multi Label Classification for Text using NLP and Logistic Regression |
|
Experimental |
| 95 |
harilexm/hatespeech-detection
ML-based hate speech detection system for Roman Urdu — built from scratch... |
|
Experimental |
| 96 |
Adnanabbas0398/toxic-comment-classifier
ML pipeline that classifies YouTube/online comments as toxic or clean using... |
|
Experimental |
| 97 |
Sanu700/ToxicBuddy
Chat toxicity analyzer — detects toxic messages, generates per-user stats,... |
|
Experimental |
| 98 |
abirmondal/multi-label-hate-speech-classification
In this project we have tried to do multi-label hate-speech classification... |
|
Experimental |
| 99 |
shubhamgupta407/Patrol-notes-violation-on-device-inference
Machine learning system for on-device inference that analyzes patrol notes... |
|
Experimental |
| 100 |
GigasTaufan/Indonesian-Hate-Speech-Classification
Program akan melakukan klasifikasi terhadap tweet berbahasa Indonesia yang... |
|
Experimental |
| 101 |
twirapp/toxicity-detector
Simple Multi-Language HTTP Server Text Toxicity Detector |
|
Experimental |
| 102 |
mbahadirk/Offensive-Text-Classification
Turkish Toxic Comment Classification |
|
Experimental |
| 103 |
mllamazares/auto-abuse-asvs
🥊 Automagically select the top ASVS security controls for a given Abuse Case... |
|
Experimental |
| 104 |
sabinleee/anti-cursing
python package that detects and switches negative or any kind of cursing... |
|
Experimental |
| 105 |
tanyelai/lingda
UBMK 2022 Conference Paper: Linguistic-based Data Augmentation Approach for... |
|
Experimental |
| 106 |
naitian/social-memeing
Code and data for "Social Meme-ing: Measuring Linguistic Variation in Memes" |
|
Experimental |
| 107 |
ogtal/A-ttack
Dette repository indeholder kode og modelvægtene til A&ttack algortimen. |
|
Experimental |
| 108 |
AindriyaBarua/anti-hate-reddit-Hinglish-bot-with-PRAW-and-BERT
Reddit bot that detects Hindi-English code-mixed hate speech in comments in... |
|
Experimental |
| 109 |
ogtal/Ha-te
Dette repository indeholder kode og modelvægtene til Ha&te algortimen. |
|
Experimental |
| 110 |
jnnpk/hate_instigating_speech_dataset
Uncovering theRoot of Hate Speech: A Dataset for Identifying Hate... |
|
Experimental |
| 111 |
hoangcaobao/projectube-sentiment-analysis
TensorFlow and PyTorch deep learning model to detect Vietnamese toxic... |
|
Experimental |
| 112 |
sigvt/sensai-dataset
❤️🩹 Sensai: Toxic Live Chat Dataset |
|
Experimental |
| 113 |
awinml/jigsaw-toxic-comment-clf
Built a multilingual text classification model to predict the probability... |
|
Experimental |
| 114 |
hoangcaobao/Vietnamese-Toxic-Comment-Classifier
PyTorch deep learning model to detect toxic of Vietnamese sentences using Bert |
|
Experimental |
| 115 |
sid-shnkar/Social-Media-Toxic-Comments-Classification
A Machine learning project on NLP to detect different types of toxicity like... |
|
Experimental |
| 116 |
MilaNLProc/crosslingual-analysis-homotransphobia
This repository contains data and code used in the paper "A Crosslingual... |
|
Experimental |
| 117 |
JaweriaAsif745/Hate_Offensive_Speech_Detection
Hate & Offensive Speech Detection using NLP and Random Forest | Flask Web... |
|
Experimental |
| 118 |
margaritageleta/multilingual-toxicity-detector
NLP deep learning model for multilingual toxicity detection in text 📚 |
|
Experimental |
| 119 |
EdoWhite/toxic-comments-classification
End-to-end comments classification task focusing on Semantic Representation... |
|
Experimental |
| 120 |
dinraj910/HateGuard-AI-Real-time-Hate-speech-Detection
HateGuard AI is an end-to-end hate speech detection system categories —... |
|
Experimental |
| 121 |
qbxlvnf11/aggression-identification-in-text
Identifying trolling, aggression, cyber-bullying and hate speech etc. Three... |
|
Experimental |
| 122 |
BrianMburu/Distiled-BERT-model-training-pytorch
A noote book to showcases training the Distilled Bert model on Toxic... |
|
Experimental |
| 123 |
Rayyan9477/Hate-Speech-Detection
This project is an end-to-end machine learning pipeline designed to evaluate... |
|
Experimental |
| 124 |
BrianMburu/Toxic-Comment-Explanatory-Data-Analysis
This notebook analyzes a dataset of toxic comments using NLP techniques such... |
|
Experimental |
| 125 |
Dutta-SD/AggDetectApp
A web application that detects aggression and misogyny in text using BERT... |
|
Experimental |
| 126 |
Dutta-SD/AggDetect
Code for Aggression and Misogyny Detection using BERT augmentation... |
|
Experimental |
| 127 |
MilaNLProc/hate-ita
Hate Speech classification in Italian using XLM (fine-tuning). Published at... |
|
Experimental |
| 128 |
alexvishnevskiy/jigsaw
Jigsaw competition solution |
|
Experimental |
| 129 |
timthedev07/toxic-comment-classification
Detecting toxic comments with machine learning. |
|
Experimental |
| 130 |
vipul-shinde/toxic-comment-classification
Toxic Comment Classifier Web App deployed using Flask and AWS. |
|
Experimental |
| 131 |
azizbarank/Toxic-Comment-Detector
This project applies classification models with the aim of automating the... |
|
Experimental |
| 132 |
Jayveersinh-Raj/cross-lingual-zero-shot-transfer
A cross lingual toxicity detection model that works for over 100 languages.... |
|
Experimental |
| 133 |
gungorefecetin/toxic-comment-severity-analysis
Fine-tuned DistilBERT for multi-label toxic comment classification with... |
|
Experimental |
| 134 |
chonzadaniel/Racist-Detection-App
This is a production-ready, end-to-end system developed to detect and... |
|
Experimental |
| 135 |
bremelket/toxic-comment-classification-for-content-moderation
Automated toxic comment detection system for online community moderation... |
|
Experimental |
| 136 |
daniel-023/Offensive-Speech-Detection
Comparing text vectorisation techniques with machine learning for offensive... |
|
Experimental |
| 137 |
pashudzu/ToxicCommentClassificationAI
🧪 Classifies toxic comments (EN + RU) using TF-IDF and logistic regression. |
|
Experimental |
| 138 |
yucellyillmaz/smelligent-inline-code-comment-smell-detection
Experimental code for the Smelligent study on detecting inline code comment smells. |
|
Experimental |
| 139 |
orozg5/hate-speech-analysis
A text classification project to detect Hate Speech and Offensive Language... |
|
Experimental |
| 140 |
Revanth-Reddy-Pingala/Abusive_Comment_Detector_BERT
Fine tuned BERT, mBERT and XLMRoBERTa for Abusive Comments Detection in... |
|
Experimental |
| 141 |
El-Bahnasawi/AI-Powered-Textual-Hate-Content-Moderation
Real-time hate-speech filter: regex → LoRA-tuned BERTweet (530K tweets, 84.5... |
|
Experimental |
| 142 |
gangula-karthik/Hateful-Meme-Classifier
A multimodal AI that spots hateful memes by looking at both 🖼️ images and 📝... |
|
Experimental |
| 143 |
fc63/gender-classification
Fine-tuned DeBERTa-v3-large model for gender prediction from English text... |
|
Experimental |
| 144 |
Nyx1311/Toxicity-Detector-using-BiLSTM
🚀 What we built: An AI-powered Women’s Safety & Well-Being Detector — a web... |
|
Experimental |
| 145 |
D-Chandrahas/Toxicity-Classifier
RoBERTa based hatespeech & toxicity classifier |
|
Experimental |
| 146 |
Ashutosh-Singh-Thakur/Toxic-Comment-Classification
Toxic Comment Classification - An NLP project using TensorFlow and LSTM for... |
|
Experimental |
| 147 |
SURESHBEEKHANI/Hate-Speech-Classifier-LSTM
A Deep learning project that detects hate speech in text using LSTM-based... |
|
Experimental |
| 148 |
e-vdb/toxic-comment-classification
Multi-label classification on toxic-comment dataset |
|
Experimental |
| 149 |
chakshu-dhannawat/Toxic-Comment-Classification
NLP based classification pipeline for detecting toxic comments |
|
Experimental |
| 150 |
skyradez/Audio-based-Indonesia-Toxic-Language-Classification-using-RNN-Speech-Recognition-and-NLP
Audio-based Indonesia Toxic Language Classification using Recurrent Neural... |
|
Experimental |
| 151 |
dilettagoglia/BiBiNET
💬A Gated Recurrent Neural Network for Supervised Text Classification:... |
|
Experimental |
| 152 |
amsqr/SEMEVAL-2023-EDOS-solution
Solution using adversarial training for the explainable detection of sexism... |
|
Experimental |
| 153 |
PoojanDoshi11/Speech_Detection
🗣️ Speech Type Detection is a Flask app to classifies text into categories... |
|
Experimental |
| 154 |
AklimaRimi/NegativeCommentDetector
Catch Negative Comments and Classify |
|
Experimental |