General NLP Toolkits NLP Tools

Comprehensive NLP libraries and toolkits that bundle multiple fundamental NLP tasks (tokenization, stemming, POS tagging, sentiment analysis, NER) into reusable modules. Does NOT include single-task focused tools, domain-specific NLP applications, or language-specific implementations.

There are 78 general nlp toolkits tools tracked. 3 score above 70 (verified tier). The highest-rated is sloria/TextBlob at 91/100 with 9,515 stars and 1,988,445 monthly downloads. 2 of the top 10 are actively maintained.

Get all 78 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=general-nlp-toolkits&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 sloria/TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech...

91
Verified
2 chrismattmann/tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing...

87
Verified
3 cltk/cltk

The Classical Language Toolkit

72
Verified
4 allenai/scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

67
Established
5 wi2trier/cbrkit

Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in...

59
Established
6 delph-in/pydelphin

Python libraries for DELPH-IN

58
Established
7 SamEdwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

57
Established
8 grid-parity-exchange/Egret

Tools for building power systems optimization problems

52
Established
9 ysenarath/sinling

A collection of NLP tools for Sinhalese (සිංහල).

51
Established
10 CGCL-codes/naturalcc

NaturalCC: An Open-Source Toolkit for Code Intelligence

50
Established
11 NIHOPA/NLPre

Python library for Natural Language Preprocessing (NLPre)

44
Emerging
12 rajcscw/nlp-gym

NLPGym - A toolkit to develop RL agents to solve NLP tasks.

44
Emerging
13 rlayers/pawpaw

Text Processing & Segmentation Framework

44
Emerging
14 sharejing/Takin

A Python toolkit for file processing, text cleaning and data splitting....

44
Emerging
15 StarlangSoftware/NlpToolkit

Natural Language Processing Library

43
Emerging
16 DukeEnglish/knlp

非常好用的工具包,可以直接安装并使用

43
Emerging
17 dhopp1/nlp_pipeline

Collection of NLP tools for processing and analyzing text data.

41
Emerging
18 lvzii/nlpertools

NLP/数据工作人员常用函数,简化读写操作,代码更加可读。Functions commonly used by NLP/data...

40
Emerging
19 lyeoni/prenlp

Preprocessing Library for Natural Language Processing

40
Emerging
20 neocl/speach

🐍🍑 Python 3 library for managing, annotating, and converting natural...

40
Emerging
21 soskek/bookcorpus

Crawl BookCorpus

40
Emerging
22 quadrismegistus/prosodic

Prosodic: a metrical-phonological parser, written in Python. For English and...

39
Emerging
23 stevewyl/nlp_toolkit

Basic NLP Toolkits include text classification, sequence labeling etc.

39
Emerging
24 sliderSun/pynlp

NLP related tasks, including text classification, sequence annotation, text...

38
Emerging
25 alexandrainst/danlp

DaNLP is a repository for Natural Language Processing resources for the...

38
Emerging
26 lattice-8094/propp

PROPP: A Python library for narrative analysis

36
Emerging
27 DerXter/State-of-NLP-Research-in-Senegal

First comprehensive survey of NLP work carried out in Senegalese languages...

36
Emerging
28 wyounas/homer

Homer, a text analyser in Python, can help make your text more clear, simple...

35
Emerging
29 Vidito/vidinlp

VidiNLP is a simple, modern, and fast NLP library built on top of spaCy. It...

34
Emerging
30 dlite-tools/NLPiper

NLPiper is a package that agglomerates different NLP tools and applies their...

33
Emerging
31 ckiplab/ckipnlp

CKIP CoreNLP Toolkits

33
Emerging
32 microsoft/verseagility

Ramp up your custom natural language processing (NLP) task, allowing you to...

33
Emerging
33 d99kris/spacy-cpp

C++ wrapper library for the NLP library spaCy

33
Emerging
34 paudan/opennlp_python

Python NLTK module for interfacing with the Apache OpenNLP

32
Emerging
35 asyml/forte-wrappers

Forte wrapper of third-party toolkits.

31
Emerging
36 worldbank/wb-nlp-tools

Natural language processing tools developed by the World Bank's DECAT unit....

30
Emerging
37 lord-alfred/dnlp

📚 Сборник полезных штук из Natural Language Processing: Определение языка...

30
Emerging
38 NLP-Suite/NLP-Suite

CLICK ON WIKI LINK BELOW OR ON Wiki TAB AT TOP BANNER FOR DOWNLOAD...

30
Emerging
39 wjbmattingly/bagpipes-spacy

Bagpipes spaCy is a collection of custom spaCy pipeline components designed...

30
Emerging
40 pharo-ai/Polyglot

A library for Natural Language Processing

27
Experimental
41 entelecheia/ekorpkit

ekorpkit: NLP Library for Social Science Research

26
Experimental
42 dcondrey/BridgeNLP

A universal NLP model-to-pipeline integration framework.

26
Experimental
43 F1uctus/ttc

✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].

26
Experimental
44 AdrianBZG/Polyglotter

[Nature Scientific Reports] Translating synthetic natural language to...

25
Experimental
45 tinq-ai/tinq-python

Tinq.ai python library

24
Experimental
46 hitchhicker/tweet_nlp_toolkit

Tweet NLP toolkit

23
Experimental
47 davidjurgens/sporc

Python library for working with the Structured Podcast Open Research Corpus (SPORC)

23
Experimental
48 hulsedev/hulse-py

The Python client for the Hulse platform

23
Experimental
49 olcaytaner/NlpToolkit

Natural Language Processing Library

23
Experimental
50 markgw/pimlico

The Pimlico Processing Toolkit

23
Experimental
51 goldwing4/pyml

🐍 Write Python code using a YAML-like syntax for cleaner and more readable...

22
Experimental
52 capjamesg/pysurprisal

Calculate surprisal for words in text.

22
Experimental
53 TakeLab/podium

Podium: a framework agnostic Python NLP library for data loading and preprocessing

22
Experimental
54 wencan/distilNLP

distilNLP is a natural language processing toolkits that are out-of-the-box,...

21
Experimental
55 moraleida/wordpress-nlp-tools

A toolkit for using self-hosted Natural Language Processing with...

21
Experimental
56 aditeyabaral/nlpc

Natural Language Toolkit built using the C Programming Language

21
Experimental
57 peterdalle/svensktext

Svenska språkresurser: kvinno- och mansnamn, orter, län, kommuner, länder,...

20
Experimental
58 liuziwei7/nlp-toolkit

NLP Toolkit for Social Science (NTSS)

19
Experimental
59 KhiemNguyen2104/CO3085-Chunking-in-NLP

A repository for my NLP project that focus on chuking techniques in NLP for...

19
Experimental
60 Valuebai/pyqt5_nlp_tools

PyQt5 集成Nlp 小工具(文本合并、分词、词性标注),也可作为Pyqt5的入门例子

18
Experimental
61 erickorsi/polyglot-windows-installation

Installation and setup of Polyglot package for Python 3 on Windows.

16
Experimental
62 ashishpoudel995/NLP-for-Nepali-Language

The project is all about Natural Language Processing for the Nepali...

16
Experimental
63 eistakovskii/NLP_Toolbox

This repository contains various tools intended for handling Natural...

16
Experimental
64 ojrlopez27/nl-service-composition

NLSC Unrestricted Natural Language-based Service Composition Middleware that...

14
Experimental
65 Thableaddles57/nlp-toolkit

A Python-based natural language processing toolkit with common NLP tasks...

14
Experimental
66 galax19ksh/Manipuri-NLP

A comprehensive overview of research regarding Natural Language Processing...

13
Experimental
67 andrea-ballatore/place-vocabulary

This repository contains a vocabulary of nouns used to describe places, such...

13
Experimental
68 MayADevBe/Swedish-FrequencyList-8Sidor

Data Mining - Swedisch Freqency List from Sidor8

13
Experimental
69 fbn776/Word-groups

An effort to organize more than 370k words in the English language.

13
Experimental
70 Dechrissen/hstk

Toolkit for creating and interfacing with a database of news headlines

12
Experimental
71 partigabor/scikit-talk

A toolkit to process real-world conversational speech data (moved)

12
Experimental
72 Alex-bzh/L2TAL

Resources for L2 TAL at Inalco

11
Experimental
73 harrycslau/finnish-vocab-learning-nlp

A small NLP project enable Finnish learners to learn new vocabulary.

11
Experimental
74 siara-cc/Text_frequency_research

Developing language frequency lists from Reddit/Twitter posts and Wikipedia articles

11
Experimental
75 Dellonath/nlp-cognitive-engine

Development of a NLP Cognitive Engine using the KERAS Deep Learning...

11
Experimental
76 JJWRoeloffs/DynamicFluency

DynamicFluency - Monitor and understand the dynamicity of linguistic aspects...

10
Experimental
77 PujanMotiwala/the_nlp_toolkit

Your comprehensive resource for mastering NLP techniques. Includes text...

10
Experimental
78 TohidN/ElevateNLP

ElevateNLP is a web based NLP Platform for creation, organization,...

10
Experimental

Comparisons in this category