Toxic Comment Detection NLP Tools

Tools for detecting, classifying, and analyzing toxic, offensive, or harmful language in text (comments, messages, chat). Includes severity scoring and content moderation. Does NOT include general sentiment analysis, hate speech as a separate category, or profanity filtering without toxicity context.

There are 154 toxic comment detection tools tracked. 1 score above 70 (verified tier). The highest-rated is unitaryai/detoxify at 76/100 with 1,202 stars and 94,691 monthly downloads. 1 of the top 10 are actively maintained.

Get all 154 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=toxic-comment-detection&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 unitaryai/detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic...

76
Verified
2 kensk8er/chicksexer

A Python package for gender classification.

51
Established
3 Infinitode/ValX

ValX is an open-source Python package for text cleaning tasks, including...

50
Established
4 PavelOstyakov/toxic

Toxic Comment Classification Challenge

42
Emerging
5 minerva-ml/open-solution-toxic-comments

Open solution to the Toxic Comment Classification Challenge

41
Emerging
6 IBM/MAX-Toxic-Comment-Classifier

Detect 6 types of toxicity in user comments.

39
Emerging
7 kocohub/korean-hate-speech

Korean HateSpeech Dataset

35
Emerging
8 gokulkarthik/hateclipper

Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit...

35
Emerging
9 JoshPola96/dissertation-racism-detection-bert-cnn-bilstm

Bias-minimized multimodal deep learning system for racism detection in...

35
Emerging
10 joaoaleite/ToLD-Br

Toxic Language Detection in Social Media for Brazilian Portuguese: New...

34
Emerging
11 rezacsedu/Bengali-Hate-Speech-Dataset

Dataset for identifying potential hates (e.g., political, religious,...

34
Emerging
12 WSU-SEAL/ToxiCR

A supervised learning based tool to identify toxic code review comments

34
Emerging
13 leeyejin1231/KOTOX

A Korean Toxic Text for Obfuscation and Detoxification

34
Emerging
14 ahmedhammad97/Offensive-Language-Detection

NLP model that uses Machine Learning to detect offensive tweets, and...

34
Emerging
15 hate-alert/Countering_Hate_Speech_ICWSM2019

Repository for the paper "Thou shalt not hate: Countering Online Hate...

33
Emerging
16 rezacsedu/Classification_Benchmarks_Benglai_NLP

Classification Benchmarks for Under-resourced Bengali Language based on...

33
Emerging
17 hate-alert/Tutorial-Resources

Resources and tools for the Tutorial - "Hate speech detection, mitigation...

31
Emerging
18 datascisteven/Automated-Hate-Tweet-Detection

Developing a classification model to detect hate tweets ready for deployment...

31
Emerging
19 leeyejin1231/RV-HATE

Reinforced Multi-Module Voting for Implicit Hate Speech Detection

30
Emerging
20 leeyejin1231/AmpleHate

AmpleHate: Amplify target-context relationships for implicit hate speech...

30
Emerging
21 Prakhar-FF13/Toxic-Comments-Classification

Predict the toxicity rating of comment made by the user.

30
Emerging
22 NakulLakhotia/Hate-Speech-Detection-in-Social-Media-using-Python

Python code to detect hate speech and classify twitter texts using NLP...

29
Experimental
23 s-nlp/rudetoxifier

Code and data of "Methods for Detoxification of Texts for the Russian Language" paper

29
Experimental
24 datascisteven/Twitter-Sentiment-Analysis

Using NLP techniques and Sentiment Analysis of Hate Tweets for Automated...

28
Experimental
25 gesiscss/sexism_custom_classifier

Custom classifiers to detect sexist language.

28
Experimental
26 JherezTaylor/hatespeech_codewords

A contextual approach for detecting hate speech code words

28
Experimental
27 hate-alert/HateALERT-EVALITA

Code for replicating results of team 'hateminers' at EVALITA-2018 for AMI task

28
Experimental
28 EliasCai/bert-toxicity-classification

bert on Jigsaw Unintended Bias in Toxicity Classification

27
Experimental
29 FelipeLuz/dotnet-bad-word-detector-and-filter

.NET library that uses machine learning to detect bad words (profanity)...

27
Experimental
30 WSU-SEAL/ToxiSpanSE

An Explainable Toxicity detector for code review comments. Published in ESEM'2023

26
Experimental
31 ignacioct/Temis

Temis is an Automatic Misogyny Identification tool. Using Deep Learning...

26
Experimental
32 faizaladhitama/Hierarchical-Multi-Label-Classification-API

Hierarchical Multi Label Hate Speech and Abusive Language Classification

26
Experimental
33 im-syn/SafeContentText

A Python toolkit and web API for detecting “bad” content (profanity, hate...

26
Experimental
34 prashanthm07/Twitter_Sentiment_Analysis-Using-ML-and-NLP

The objective of this project is to detect hate speech in tweets. For the...

26
Experimental
35 jmjmalik22/Hate-Speech-Detection

Official implementation of the paper "Deep Learning for Hate Speech...

26
Experimental
36 sidneykung/twitter_hate_speech_detection

Capstone project to automate Twitter hate speech detection with...

26
Experimental
37 MarcinCiura/przetak

Przetak: fewer weeds on the Web

25
Experimental
38 jrzvnn/lfcm-racism

Enhances racist post detection by combining comment features with visual and...

25
Experimental
39 alessandrocuda/ReSt

This project aims to investigate different models for solving the stereotype...

24
Experimental
40 RexGod/NLP-Project

Analyze user comments through Natural Language Processing (NLP) techniques...

24
Experimental
41 priyanshkedia04/Tweets-Classification-among-Hate-Speech-Offensive-Neither

Twitter Dataset was used to classify the tweets as Offensive, Hate Text, and...

24
Experimental
42 SamuelTelesSilva/toxic_comment_detection

Detecção de comentários ou textos preconceituosos com processamento de...

24
Experimental
43 hbujakow/hateful_memes_detection

Joint work on utilizing a combination of NLP and CV methods in implementing...

24
Experimental
44 fer-aguirre/pmdm

Political Misogynistic Discourse Monitor team from the 2021 JournalismAI...

23
Experimental
45 esceptico/toxic

End-to-end toxic Russian comment classification

23
Experimental
46 dvdbisong/automl-toxicity-classification

Google Cloud AutoML Natural Language for Toxicity Classification

23
Experimental
47 IRLab-UDC/metahate

MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection @ ICWSM '24

22
Experimental
48 batuhanguler/Offensive-language-detection

Classification of Offensive tweets, part of OffensEval 2019 Competition.

22
Experimental
49 dimitreOliveira/Jigsaw-Multilingual-Toxic-Comment-Classification

:3rd_place_medal: (Bronze medal - 100th place - Top 7%) Repository for the...

22
Experimental
50 manashpratim/Tweet-Classification

Detect hate speech in tweets

22
Experimental
51 TaqsBlaze/Hush

Hush: A lightweight, context-aware text toxicity classifier. Leveraging NLP...

22
Experimental
52 mrsac7/Data-Mining-Project

Hate Speech Detection | Data Mining (CSE-362) Project | IIT (BHU) Varanasi |...

22
Experimental
53 manishshettym/Offensive-Text-Detection

Detect offensive (Racist, Sexist etc.) text from social media posts (tweets)...

22
Experimental
54 Life-Experimentalist/ToxicGuard_AI

ToxiGuard AI is a browser extension that detects and censors toxic language...

21
Experimental
55 satyaSK/Hate-Speech-Detection

Hate speech detection from code-mixed Hindi-English tweets using deep...

21
Experimental
56 imrahulr/Toxic-Comment-Classification-Kaggle

Deep Learning for Toxic Comment Classification

21
Experimental
57 LCS2-IIITD/Hate_Norm

[KDD 2022] Proactively Reducing the Hate Intensity of Online Posts via Hate...

21
Experimental
58 databricks-industry-solutions/toxicity-detection-in-gaming

Build a lakehouse for all your gamer data and use natural language...

21
Experimental
59 baroquerock/detoxify

Chrome extension which detects, classifies and highlights toxic comments...

20
Experimental
60 ciwang/deep_hatespeech

Classifying hate speech with deep learning (honors thesis 2017-18)

20
Experimental
61 a-abuzayed/Hate-Speech-Detection_OSACT4-Workshop

Quick and Simple Approach for Detecting Hate Speech in Arabic Tweets.

20
Experimental
62 minhd-vu/toxicity-filter

Natural language processing API to detect toxic chat.

20
Experimental
63 TiagoMAntunes/KAREN

KAREN: Unifying Hatespeech Detection and Benchmarking

19
Experimental
64 andiosika/NLP-to-identify-toxic-or-abusive-language-for-online-conversation-using-Keras-Deep-Learning-Models

Natural Language Processing: A multi-headed model capable of detecting...

19
Experimental
65 kroudir/Twitter-Sentiment-Analysis-with-python

The objective of this project is to detect hate speech in tweets. For the...

19
Experimental
66 onur-cubukcu/cop28-hate-speech

Hate speech detection pipeline for #COP28 tweets using BERT — TU Dortmund...

19
Experimental
67 Mujjjtaba/Hate-Speech-Detection-On-Social-Media

Hate Speech Detection system using NLP and supervised machine learning to...

18
Experimental
68 thefirebanks/Ensemble-Learning-for-Tweet-Classification-of-Hate-Speech-and-Offensive-Language

Contains code for a voting classifier that is part of an ensemble learning...

18
Experimental
69 pinarhaskiris/hate-speech-recognizer-ml

An hate-speech-recognizer implemented using three different machine learning...

18
Experimental
70 yilingchung/counternarrative-resources

Resources on hate speech moderation through counter narratives, counter speech

18
Experimental
71 andrewwhitman/HatefulMemesDetection

Classification of memes as hateful or not based on the textual content of the meme.

17
Experimental
72 aflah02/Combating_Hate_Scalably

Vote! Don’t Retrain: Combating Hate in a Scalable Way. This work is an...

17
Experimental
73 RafayKhattak/ToxiScan

ToxiScan is a text analysis tool that leverages the power of Natural...

17
Experimental
74 Carmezim/ToxicCommentsClassification

Bidirectional LSTM in TensorFlow for Kaggle Text Classification Competition

17
Experimental
75 TanYouxi/Chinese-Toxic-Comment-Detection

This repository presents a comprehensive comparison of traditional machine...

17
Experimental
76 Sreyan88/SemEval-2021-Toxic-Spans-Detection

This repository contains code for the paper: Cisco at SemEval-2021 Task 5:...

17
Experimental
77 SiddeshSambasivam/Detox

Detox is a web service that offers multilingual profanity filtering which...

16
Experimental
78 MeetMulik/E-Takraar

Platform designed to create a secure and inclusive online environment by...

16
Experimental
79 4ndrewparr/jigsaw-toxicity-classification

Training and inference code from the ML models used for the Jigsaw Kaggle...

16
Experimental
80 s-nlp/parallel_detoxification_dataset

Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for...

16
Experimental
81 pri1311/TweeToxicity

TweeToxicity is a program that analyzes user profiles or hashtags based on...

16
Experimental
82 JustLearningThings/Null--NLP

We trained a model that will predict if an article or comment is offensive,...

16
Experimental
83 2forts/hateval-trainer

Hybrid quantum–classical neural network for Spanish hate speech detection...

16
Experimental
84 abdouaziz/gbv

Gender Based Violence

16
Experimental
85 vidhur2k/Multilngual-Hate-Speech

Highly Generalizable Models for Multilingual Hate Speech Detection (CSE8803...

16
Experimental
86 danielmuthama/Hate-Speech-and-Offensive-Language-Detection

Hate Speech and Offensive Language Detection

15
Experimental
87 neemiasbsilva/kedro_orchestrate_dl_transformer_arch

A pipeline using Kedro to orchestrate the deployment of a deep learning...

15
Experimental
88 Runoi/lightweight-toxic-classifier

🚀 Production-ready API for detecting toxic comments in Russian text, powered...

15
Experimental
89 ashen8810/Sinhala-hate-speech

A Model Using LSTM

15
Experimental
90 BatsResearch/cross-lingual-detox

Code for "Preference Tuning For Toxicity Mitigation Generalizes Across...

15
Experimental
91 lwhsd/hatespeech-demo

Indonesian hate speech classification

14
Experimental
92 cassiasilvaR/TwitterHateBR

Repositório que contém dados coletados do Twitter durante as eleições...

14
Experimental
93 vitthal-bhandari/Homophobia-Transphobia-Detection

Code for the shared task on homophobia/transphobia detection at LT-EDI...

14
Experimental
94 Kushaagra-exe/ToxicCommentsMultiLabelClassification

Toxic Multi Label Classification for Text using NLP and Logistic Regression

14
Experimental
95 harilexm/hatespeech-detection

ML-based hate speech detection system for Roman Urdu — built from scratch...

14
Experimental
96 Adnanabbas0398/toxic-comment-classifier

ML pipeline that classifies YouTube/online comments as toxic or clean using...

14
Experimental
97 Sanu700/ToxicBuddy

Chat toxicity analyzer — detects toxic messages, generates per-user stats,...

14
Experimental
98 abirmondal/multi-label-hate-speech-classification

In this project we have tried to do multi-label hate-speech classification...

14
Experimental
99 shubhamgupta407/Patrol-notes-violation-on-device-inference

Machine learning system for on-device inference that analyzes patrol notes...

14
Experimental
100 GigasTaufan/Indonesian-Hate-Speech-Classification

Program akan melakukan klasifikasi terhadap tweet berbahasa Indonesia yang...

14
Experimental
101 twirapp/toxicity-detector

Simple Multi-Language HTTP Server Text Toxicity Detector

14
Experimental
102 mbahadirk/Offensive-Text-Classification

Turkish Toxic Comment Classification

14
Experimental
103 mllamazares/auto-abuse-asvs

🥊 Automagically select the top ASVS security controls for a given Abuse Case...

13
Experimental
104 sabinleee/anti-cursing

python package that detects and switches negative or any kind of cursing...

13
Experimental
105 tanyelai/lingda

UBMK 2022 Conference Paper: Linguistic-based Data Augmentation Approach for...

13
Experimental
106 naitian/social-memeing

Code and data for "Social Meme-ing: Measuring Linguistic Variation in Memes"

13
Experimental
107 ogtal/A-ttack

Dette repository indeholder kode og modelvægtene til A&ttack algortimen.

13
Experimental
108 AindriyaBarua/anti-hate-reddit-Hinglish-bot-with-PRAW-and-BERT

Reddit bot that detects Hindi-English code-mixed hate speech in comments in...

13
Experimental
109 ogtal/Ha-te

Dette repository indeholder kode og modelvægtene til Ha&te algortimen.

13
Experimental
110 jnnpk/hate_instigating_speech_dataset

Uncovering theRoot of Hate Speech: A Dataset for Identifying Hate...

13
Experimental
111 hoangcaobao/projectube-sentiment-analysis

TensorFlow and PyTorch deep learning model to detect Vietnamese toxic...

13
Experimental
112 sigvt/sensai-dataset

❤️‍🩹 Sensai: Toxic Live Chat Dataset

13
Experimental
113 awinml/jigsaw-toxic-comment-clf

Built a multilingual text classification model to predict the probability...

13
Experimental
114 hoangcaobao/Vietnamese-Toxic-Comment-Classifier

PyTorch deep learning model to detect toxic of Vietnamese sentences using Bert

12
Experimental
115 sid-shnkar/Social-Media-Toxic-Comments-Classification

A Machine learning project on NLP to detect different types of toxicity like...

12
Experimental
116 MilaNLProc/crosslingual-analysis-homotransphobia

This repository contains data and code used in the paper "A Crosslingual...

12
Experimental
117 JaweriaAsif745/Hate_Offensive_Speech_Detection

Hate & Offensive Speech Detection using NLP and Random Forest | Flask Web...

12
Experimental
118 margaritageleta/multilingual-toxicity-detector

NLP deep learning model for multilingual toxicity detection in text 📚

12
Experimental
119 EdoWhite/toxic-comments-classification

End-to-end comments classification task focusing on Semantic Representation...

12
Experimental
120 dinraj910/HateGuard-AI-Real-time-Hate-speech-Detection

HateGuard AI is an end-to-end hate speech detection system categories —...

12
Experimental
121 qbxlvnf11/aggression-identification-in-text

Identifying trolling, aggression, cyber-bullying and hate speech etc. Three...

12
Experimental
122 BrianMburu/Distiled-BERT-model-training-pytorch

A noote book to showcases training the Distilled Bert model on Toxic...

12
Experimental
123 Rayyan9477/Hate-Speech-Detection

This project is an end-to-end machine learning pipeline designed to evaluate...

12
Experimental
124 BrianMburu/Toxic-Comment-Explanatory-Data-Analysis

This notebook analyzes a dataset of toxic comments using NLP techniques such...

12
Experimental
125 Dutta-SD/AggDetectApp

A web application that detects aggression and misogyny in text using BERT...

12
Experimental
126 Dutta-SD/AggDetect

Code for Aggression and Misogyny Detection using BERT augmentation...

12
Experimental
127 MilaNLProc/hate-ita

Hate Speech classification in Italian using XLM (fine-tuning). Published at...

11
Experimental
128 alexvishnevskiy/jigsaw

Jigsaw competition solution

11
Experimental
129 timthedev07/toxic-comment-classification

Detecting toxic comments with machine learning.

11
Experimental
130 vipul-shinde/toxic-comment-classification

Toxic Comment Classifier Web App deployed using Flask and AWS.

11
Experimental
131 azizbarank/Toxic-Comment-Detector

This project applies classification models with the aim of automating the...

11
Experimental
132 Jayveersinh-Raj/cross-lingual-zero-shot-transfer

A cross lingual toxicity detection model that works for over 100 languages....

11
Experimental
133 gungorefecetin/toxic-comment-severity-analysis

Fine-tuned DistilBERT for multi-label toxic comment classification with...

11
Experimental
134 chonzadaniel/Racist-Detection-App

This is a production-ready, end-to-end system developed to detect and...

11
Experimental
135 bremelket/toxic-comment-classification-for-content-moderation

Automated toxic comment detection system for online community moderation...

11
Experimental
136 daniel-023/Offensive-Speech-Detection

Comparing text vectorisation techniques with machine learning for offensive...

11
Experimental
137 pashudzu/ToxicCommentClassificationAI

🧪 Classifies toxic comments (EN + RU) using TF-IDF and logistic regression.

11
Experimental
138 yucellyillmaz/smelligent-inline-code-comment-smell-detection

Experimental code for the Smelligent study on detecting inline code comment smells.

11
Experimental
139 orozg5/hate-speech-analysis

A text classification project to detect Hate Speech and Offensive Language...

11
Experimental
140 Revanth-Reddy-Pingala/Abusive_Comment_Detector_BERT

Fine tuned BERT, mBERT and XLMRoBERTa for Abusive Comments Detection in...

11
Experimental
141 El-Bahnasawi/AI-Powered-Textual-Hate-Content-Moderation

Real-time hate-speech filter: regex → LoRA-tuned BERTweet (530K tweets, 84.5...

11
Experimental
142 gangula-karthik/Hateful-Meme-Classifier

A multimodal AI that spots hateful memes by looking at both 🖼️ images and 📝...

11
Experimental
143 fc63/gender-classification

Fine-tuned DeBERTa-v3-large model for gender prediction from English text...

11
Experimental
144 Nyx1311/Toxicity-Detector-using-BiLSTM

🚀 What we built: An AI-powered Women’s Safety & Well-Being Detector — a web...

11
Experimental
145 D-Chandrahas/Toxicity-Classifier

RoBERTa based hatespeech & toxicity classifier

11
Experimental
146 Ashutosh-Singh-Thakur/Toxic-Comment-Classification

Toxic Comment Classification - An NLP project using TensorFlow and LSTM for...

10
Experimental
147 SURESHBEEKHANI/Hate-Speech-Classifier-LSTM

A Deep learning project that detects hate speech in text using LSTM-based...

10
Experimental
148 e-vdb/toxic-comment-classification

Multi-label classification on toxic-comment dataset

10
Experimental
149 chakshu-dhannawat/Toxic-Comment-Classification

NLP based classification pipeline for detecting toxic comments

10
Experimental
150 skyradez/Audio-based-Indonesia-Toxic-Language-Classification-using-RNN-Speech-Recognition-and-NLP

Audio-based Indonesia Toxic Language Classification using Recurrent Neural...

10
Experimental
151 dilettagoglia/BiBiNET

💬A Gated Recurrent Neural Network for Supervised Text Classification:...

10
Experimental
152 amsqr/SEMEVAL-2023-EDOS-solution

Solution using adversarial training for the explainable detection of sexism...

10
Experimental
153 PoojanDoshi11/Speech_Detection

🗣️ Speech Type Detection is a Flask app to classifies text into categories...

10
Experimental
154 AklimaRimi/NegativeCommentDetector

Catch Negative Comments and Classify

10
Experimental