All NLP Tools

11,854 tools ranked by quality score · Page 2 of 119

Showing 101–200 of 11,854
# Tool Score Tier
101 hyperquest-hq/hyperbase

A foundational library for Semantic Hypergraphs

68
Established
102 zaemyung/sentsplit

A flexible sentence segmentation library using CRF model and regex rules

67
Established
103 bab2min/Kiwi

Kiwi(지능형 한국어 형태소 분석기)

67
Established
104 huggingface/neuralcoref

✨Fast Coreference Resolution in spaCy with Neural Networks

67
Established
105 neuspell/neuspell

NeuSpell: A Neural Spelling Correction Toolkit

67
Established
106 HLasse/TextDescriptives

A Python library for calculating a large variety of metrics from text

67
Established
107 goru001/inltk

Natural Language Toolkit for Indic Languages aims to provide out of the box...

67
Established
108 Georgetown-IR-Lab/QuickUMLS

System for Medical Concept Extraction and Linking

66
Established
109 jidasheng/bi-lstm-crf

A PyTorch implementation of the BI-LSTM-CRF model.

66
Established
110 Droidtown/ArticutAPI

API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut...

66
Established
111 R1j1t/contextualSpellCheck

✔️Contextual word checker for better suggestions (not actively maintained)

66
Established
112 pemistahl/lingua-rs

The most accurate natural language detection library for Rust, suitable for...

66
Established
113 staticdev/human-readable

Lib to make data intended for machines, readable to humans.

66
Established
114 stanfordnlp/python-stanford-corenlp

Python interface to CoreNLP using a bidirectional server-client interface.

66
Established
115 KennethEnevoldsen/asent

Asent is a python library for performing efficient and transparent sentiment...

66
Established
116 jboynyc/textnets

Text analysis with networks.

66
Established
117 UCREL/pymusas

Python Multilingual Ucrel Semantic Analysis System

66
Established
118 ClipsAI/clipsai

Clips AI is an open-source Python library that automatically converts long...

66
Established
119 natasha/razdel

Rule-based token, sentence segmentation for Russian language

66
Established
120 thunlp/OpenHowNet

Core Data of HowNet and OpenHowNet Python API

65
Established
121 baidu/lac

百度NLP:分词,词性标注,命名实体识别,词重要性

65
Established
122 blmoistawinde/HarvestText

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

65
Established
123 polyrabbit/WeCron

:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat

65
Established
124 nlp-uoregon/trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual...

65
Established
125 jenojp/negspacy

spaCy pipeline object for negating concepts in text

65
Established
126 jacksonllee/pylangacq

Language Acquisition Research Tools

65
Established
127 fastnlp/fastNLP

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

65
Established
128 natasha/corus

Links to Russian corpora + Python functions for loading and parsing

65
Established
129 sileod/tasknet

Easy modernBERT fine-tuning and multi-task learning

65
Established
130 cbaziotis/ekphrasis

Ekphrasis is a text processing tool, geared towards text from social...

65
Established
131 andrewtavis/kwx

BERT, LDA, and TFIDF based keyword extraction in Python

65
Established
132 codertimo/BERT-pytorch

Google AI 2018 BERT pytorch implementation

65
Established
133 charles9n/bert-sklearn

a sklearn wrapper for Google's BERT model

65
Established
134 polm/cutlet

Japanese to romaji converter in Python

65
Established
135 go-ego/gse

Go efficient multilingual NLP and text segmentation; support English,...

65
Established
136 smilelight/lightNLP

基于Pytorch和torchtext的自然语言处理深度学习框架。

65
Established
137 rodrigopivi/Chatito

🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition...

65
Established
138 mcs07/ChemDataExtractor

Automatically extract chemical information from scientific documents

65
Established
139 delph-in/pydelphin

Python libraries for DELPH-IN

65
Established
140 maximtrp/bitermplus

Biterm Topic Model (BTM): modeling topics in short texts

65
Established
141 ufal/factgenie

Lightweight self-hosted span annotation tool

65
Established
142 explosion/spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

65
Established
143 bakwc/JamSpell

Modern spell checking library - accurate, fast, multi-language

64
Established
144 natasha/yargy

Rule-based facts extraction for Russian language

64
Established
145 SamEdwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

64
Established
146 gutfeeling/word_forms

Accurately generate all possible forms of an English word e.g "election" -->...

64
Established
147 JDongian/python-jamo

Hangul syllable decomposition and synthesis using jamo.

64
Established
148 tanaos/artifex

Small Language Model Inference, Fine-Tuning and Observability. No GPU, no...

64
Established
149 bobxwu/TopMost

A Topic Modeling System Toolkit (ACL 2024 Demo)

64
Established
150 vunb/vntk

Vietnamese NLP Toolkit for Node

64
Established
151 alirezatheh/perke

A keyphrase extractor for Persian

64
Established
152 smilelight/lightKG

基于Pytorch和torchtext的知识图谱深度学习框架。

64
Established
153 ownthink/Jiagu

Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类

64
Established
154 BrikerMan/Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top...

64
Established
155 thunlp/OpenAttack

An Open-Source Package for Textual Adversarial Attack.

63
Established
156 Ayanami0730/deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

63
Established
157 alvinwan/timefhuman

Extract datetimes and durations from natural language text as Python...

63
Established
158 bjascob/LemmInflect

A python module for English lemmatization and inflection.

63
Established
159 polm/unidic-py

Unidic packaged for installation via pip.

63
Established
160 FraBle/python-sutime

Python wrapper for Stanford CoreNLP's SUTime

63
Established
161 yongzhuo/Keras-TextClassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP,...

63
Established
162 princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings...

63
Established
163 natasha/natasha

Solves basic Russian NLP tasks, API for lower level Natasha projects

63
Established
164 centre-for-humanities-computing/DaCy

DaCy: The State of the Art Danish NLP pipeline using SpaCy

63
Established
165 SeanLee97/xmnlp

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能

63
Established
166 wooorm/franc

Natural language detection

63
Established
167 cannlytics/cannlytics

🔥 Cannlytics = cannabis + analytics. Data pipelines, user interfaces, and...

63
Established
168 baidu/Senta

Baidu's open-source Sentiment Analysis System.

63
Established
169 LSYS/LexicalRichness

:smile_cat: :speech_balloon: A module to compute textual lexical richness...

63
Established
170 GaoQ1/rasa_nlu_gq

turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务)

63
Established
171 ines/spacy-js

🎀 JavaScript API for spaCy with Python REST API

63
Established
172 920232796/bert_seq2seq

pytorch实现 Bert...

63
Established
173 asyml/texar

Toolkit for Machine Learning, Natural Language Processing, and Text...

63
Established
174 MLNLP-World/SimBiber

MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex with official info

63
Established
175 urduhack/urduhack

An NLP library for the Urdu language. It comes with a lot of battery...

63
Established
176 huspacy/huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

63
Established
177 ku-nlp/rhoknp

Yet another Python binding for Juman++/KNP/KWJA

63
Established
178 yongzhuo/nlp_xiaojiang

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence...

63
Established
179 fidelity/textwiser

[AAAI 2021] TextWiser: Text Featurization Library

63
Established
180 greyblake/whatlang-rs

Natural language detection library for Rust. Try demo online: https://whatlang.org/

62
Established
181 philenius/ngx-annotate-text

This Angular component library is perfect for tasks like visualizing named...

62
Established
182 FerreroJeremy/ln2sql

A tool to query a database in natural language

62
Established
183 Lilykos/pyphonetics

A Python 3 phonetics library.

62
Established
184 nert-nlp/streusle

STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword...

62
Established
185 PyThaiNLP/attacut

A Fast and Accurate Neural Thai Word Segmenter

62
Established
186 raghakot/keras-text

Text Classification Library in Keras

62
Established
187 hltcoe/turkle

Django-based clone of Amazon's Mechanical Turk service running in your local...

62
Established
188 kakaobrain/word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

62
Established
189 425776024/nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

62
Established
190 eyurtsev/kor

LLM(😽)

62
Established
191 yuchenlin/rebiber

A simple tool to update bib entries with their official information (e.g.,...

62
Established
192 FraBle/python-duckling

Python wrapper for wit.ai's Duckling Clojure library

62
Established
193 NateScarlet/holiday-cn

📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告

62
Established
194 taishi-i/awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs,...

62
Established
195 pysentimiento/pysentimiento

A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks

62
Established
196 bretttolbert/verbecc

Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French,...

62
Established
197 Rostlab/nalaf

NLP framework in python for entity recognition and relationship extraction

62
Established
198 proycon/pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language...

62
Established
199 shineware/KOMORAN

Korean Morphological Analyzer by shineware

62
Established
200 snipsco/snips-nlu

Snips Python library to extract meaning from text

62
Established