General NLP Toolkits NLP Tools
Comprehensive NLP libraries and toolkits that bundle multiple fundamental NLP tasks (tokenization, stemming, POS tagging, sentiment analysis, NER) into reusable modules. Does NOT include single-task focused tools, domain-specific NLP applications, or language-specific implementations.
There are 78 general nlp toolkits tools tracked. 3 score above 70 (verified tier). The highest-rated is sloria/TextBlob at 91/100 with 9,515 stars and 1,988,445 monthly downloads. 2 of the top 10 are actively maintained.
Get all 78 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=general-nlp-toolkits&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech... |
|
Verified |
| 2 |
chrismattmann/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing... |
|
Verified |
| 3 |
cltk/cltk
The Classical Language Toolkit |
|
Verified |
| 4 |
allenai/scispacy
A full spaCy pipeline and models for scientific/biomedical documents. |
|
Established |
| 5 |
wi2trier/cbrkit
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in... |
|
Established |
| 6 |
delph-in/pydelphin
Python libraries for DELPH-IN |
|
Established |
| 7 |
SamEdwardes/spacytextblob
A TextBlob sentiment analysis pipeline component for spaCy. |
|
Established |
| 8 |
grid-parity-exchange/Egret
Tools for building power systems optimization problems |
|
Established |
| 9 |
ysenarath/sinling
A collection of NLP tools for Sinhalese (සිංහල). |
|
Established |
| 10 |
CGCL-codes/naturalcc
NaturalCC: An Open-Source Toolkit for Code Intelligence |
|
Established |
| 11 |
NIHOPA/NLPre
Python library for Natural Language Preprocessing (NLPre) |
|
Emerging |
| 12 |
rajcscw/nlp-gym
NLPGym - A toolkit to develop RL agents to solve NLP tasks. |
|
Emerging |
| 13 |
rlayers/pawpaw
Text Processing & Segmentation Framework |
|
Emerging |
| 14 |
sharejing/Takin
A Python toolkit for file processing, text cleaning and data splitting.... |
|
Emerging |
| 15 |
StarlangSoftware/NlpToolkit
Natural Language Processing Library |
|
Emerging |
| 16 |
DukeEnglish/knlp
非常好用的工具包,可以直接安装并使用 |
|
Emerging |
| 17 |
dhopp1/nlp_pipeline
Collection of NLP tools for processing and analyzing text data. |
|
Emerging |
| 18 |
lvzii/nlpertools
NLP/数据工作人员常用函数,简化读写操作,代码更加可读。Functions commonly used by NLP/data... |
|
Emerging |
| 19 |
lyeoni/prenlp
Preprocessing Library for Natural Language Processing |
|
Emerging |
| 20 |
neocl/speach
🐍🍑 Python 3 library for managing, annotating, and converting natural... |
|
Emerging |
| 21 |
soskek/bookcorpus
Crawl BookCorpus |
|
Emerging |
| 22 |
quadrismegistus/prosodic
Prosodic: a metrical-phonological parser, written in Python. For English and... |
|
Emerging |
| 23 |
stevewyl/nlp_toolkit
Basic NLP Toolkits include text classification, sequence labeling etc. |
|
Emerging |
| 24 |
sliderSun/pynlp
NLP related tasks, including text classification, sequence annotation, text... |
|
Emerging |
| 25 |
alexandrainst/danlp
DaNLP is a repository for Natural Language Processing resources for the... |
|
Emerging |
| 26 |
lattice-8094/propp
PROPP: A Python library for narrative analysis |
|
Emerging |
| 27 |
DerXter/State-of-NLP-Research-in-Senegal
First comprehensive survey of NLP work carried out in Senegalese languages... |
|
Emerging |
| 28 |
wyounas/homer
Homer, a text analyser in Python, can help make your text more clear, simple... |
|
Emerging |
| 29 |
Vidito/vidinlp
VidiNLP is a simple, modern, and fast NLP library built on top of spaCy. It... |
|
Emerging |
| 30 |
dlite-tools/NLPiper
NLPiper is a package that agglomerates different NLP tools and applies their... |
|
Emerging |
| 31 |
ckiplab/ckipnlp
CKIP CoreNLP Toolkits |
|
Emerging |
| 32 |
microsoft/verseagility
Ramp up your custom natural language processing (NLP) task, allowing you to... |
|
Emerging |
| 33 |
d99kris/spacy-cpp
C++ wrapper library for the NLP library spaCy |
|
Emerging |
| 34 |
paudan/opennlp_python
Python NLTK module for interfacing with the Apache OpenNLP |
|
Emerging |
| 35 |
asyml/forte-wrappers
Forte wrapper of third-party toolkits. |
|
Emerging |
| 36 |
worldbank/wb-nlp-tools
Natural language processing tools developed by the World Bank's DECAT unit.... |
|
Emerging |
| 37 |
lord-alfred/dnlp
📚 Сборник полезных штук из Natural Language Processing: Определение языка... |
|
Emerging |
| 38 |
NLP-Suite/NLP-Suite
CLICK ON WIKI LINK BELOW OR ON Wiki TAB AT TOP BANNER FOR DOWNLOAD... |
|
Emerging |
| 39 |
wjbmattingly/bagpipes-spacy
Bagpipes spaCy is a collection of custom spaCy pipeline components designed... |
|
Emerging |
| 40 |
pharo-ai/Polyglot
A library for Natural Language Processing |
|
Experimental |
| 41 |
entelecheia/ekorpkit
ekorpkit: NLP Library for Social Science Research |
|
Experimental |
| 42 |
dcondrey/BridgeNLP
A universal NLP model-to-pipeline integration framework. |
|
Experimental |
| 43 |
F1uctus/ttc
✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP]. |
|
Experimental |
| 44 |
AdrianBZG/Polyglotter
[Nature Scientific Reports] Translating synthetic natural language to... |
|
Experimental |
| 45 |
tinq-ai/tinq-python
Tinq.ai python library |
|
Experimental |
| 46 |
hitchhicker/tweet_nlp_toolkit
Tweet NLP toolkit |
|
Experimental |
| 47 |
davidjurgens/sporc
Python library for working with the Structured Podcast Open Research Corpus (SPORC) |
|
Experimental |
| 48 |
hulsedev/hulse-py
The Python client for the Hulse platform |
|
Experimental |
| 49 |
olcaytaner/NlpToolkit
Natural Language Processing Library |
|
Experimental |
| 50 |
markgw/pimlico
The Pimlico Processing Toolkit |
|
Experimental |
| 51 |
goldwing4/pyml
🐍 Write Python code using a YAML-like syntax for cleaner and more readable... |
|
Experimental |
| 52 |
capjamesg/pysurprisal
Calculate surprisal for words in text. |
|
Experimental |
| 53 |
TakeLab/podium
Podium: a framework agnostic Python NLP library for data loading and preprocessing |
|
Experimental |
| 54 |
wencan/distilNLP
distilNLP is a natural language processing toolkits that are out-of-the-box,... |
|
Experimental |
| 55 |
moraleida/wordpress-nlp-tools
A toolkit for using self-hosted Natural Language Processing with... |
|
Experimental |
| 56 |
aditeyabaral/nlpc
Natural Language Toolkit built using the C Programming Language |
|
Experimental |
| 57 |
peterdalle/svensktext
Svenska språkresurser: kvinno- och mansnamn, orter, län, kommuner, länder,... |
|
Experimental |
| 58 |
liuziwei7/nlp-toolkit
NLP Toolkit for Social Science (NTSS) |
|
Experimental |
| 59 |
KhiemNguyen2104/CO3085-Chunking-in-NLP
A repository for my NLP project that focus on chuking techniques in NLP for... |
|
Experimental |
| 60 |
Valuebai/pyqt5_nlp_tools
PyQt5 集成Nlp 小工具(文本合并、分词、词性标注),也可作为Pyqt5的入门例子 |
|
Experimental |
| 61 |
erickorsi/polyglot-windows-installation
Installation and setup of Polyglot package for Python 3 on Windows. |
|
Experimental |
| 62 |
ashishpoudel995/NLP-for-Nepali-Language
The project is all about Natural Language Processing for the Nepali... |
|
Experimental |
| 63 |
eistakovskii/NLP_Toolbox
This repository contains various tools intended for handling Natural... |
|
Experimental |
| 64 |
ojrlopez27/nl-service-composition
NLSC Unrestricted Natural Language-based Service Composition Middleware that... |
|
Experimental |
| 65 |
Thableaddles57/nlp-toolkit
A Python-based natural language processing toolkit with common NLP tasks... |
|
Experimental |
| 66 |
galax19ksh/Manipuri-NLP
A comprehensive overview of research regarding Natural Language Processing... |
|
Experimental |
| 67 |
andrea-ballatore/place-vocabulary
This repository contains a vocabulary of nouns used to describe places, such... |
|
Experimental |
| 68 |
MayADevBe/Swedish-FrequencyList-8Sidor
Data Mining - Swedisch Freqency List from Sidor8 |
|
Experimental |
| 69 |
fbn776/Word-groups
An effort to organize more than 370k words in the English language. |
|
Experimental |
| 70 |
Dechrissen/hstk
Toolkit for creating and interfacing with a database of news headlines |
|
Experimental |
| 71 |
partigabor/scikit-talk
A toolkit to process real-world conversational speech data (moved) |
|
Experimental |
| 72 |
Alex-bzh/L2TAL
Resources for L2 TAL at Inalco |
|
Experimental |
| 73 |
harrycslau/finnish-vocab-learning-nlp
A small NLP project enable Finnish learners to learn new vocabulary. |
|
Experimental |
| 74 |
siara-cc/Text_frequency_research
Developing language frequency lists from Reddit/Twitter posts and Wikipedia articles |
|
Experimental |
| 75 |
Dellonath/nlp-cognitive-engine
Development of a NLP Cognitive Engine using the KERAS Deep Learning... |
|
Experimental |
| 76 |
JJWRoeloffs/DynamicFluency
DynamicFluency - Monitor and understand the dynamicity of linguistic aspects... |
|
Experimental |
| 77 |
PujanMotiwala/the_nlp_toolkit
Your comprehensive resource for mastering NLP techniques. Includes text... |
|
Experimental |
| 78 |
TohidN/ElevateNLP
ElevateNLP is a web based NLP Platform for creation, organization,... |
|
Experimental |