All NLP Tools
11,854 tools ranked by quality score · Page 2 of 119
| # | Tool | Score | Tier |
|---|---|---|---|
| 101 |
hyperquest-hq/hyperbase
A foundational library for Semantic Hypergraphs |
|
Established |
| 102 |
zaemyung/sentsplit
A flexible sentence segmentation library using CRF model and regex rules |
|
Established |
| 103 |
bab2min/Kiwi
Kiwi(지능형 한국어 형태소 분석기) |
|
Established |
| 104 |
huggingface/neuralcoref
✨Fast Coreference Resolution in spaCy with Neural Networks |
|
Established |
| 105 |
neuspell/neuspell
NeuSpell: A Neural Spelling Correction Toolkit |
|
Established |
| 106 |
HLasse/TextDescriptives
A Python library for calculating a large variety of metrics from text |
|
Established |
| 107 |
goru001/inltk
Natural Language Toolkit for Indic Languages aims to provide out of the box... |
|
Established |
| 108 |
Georgetown-IR-Lab/QuickUMLS
System for Medical Concept Extraction and Linking |
|
Established |
| 109 |
jidasheng/bi-lstm-crf
A PyTorch implementation of the BI-LSTM-CRF model. |
|
Established |
| 110 |
Droidtown/ArticutAPI
API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut... |
|
Established |
| 111 |
R1j1t/contextualSpellCheck
✔️Contextual word checker for better suggestions (not actively maintained) |
|
Established |
| 112 |
pemistahl/lingua-rs
The most accurate natural language detection library for Rust, suitable for... |
|
Established |
| 113 |
staticdev/human-readable
Lib to make data intended for machines, readable to humans. |
|
Established |
| 114 |
stanfordnlp/python-stanford-corenlp
Python interface to CoreNLP using a bidirectional server-client interface. |
|
Established |
| 115 |
KennethEnevoldsen/asent
Asent is a python library for performing efficient and transparent sentiment... |
|
Established |
| 116 |
jboynyc/textnets
Text analysis with networks. |
|
Established |
| 117 |
UCREL/pymusas
Python Multilingual Ucrel Semantic Analysis System |
|
Established |
| 118 |
ClipsAI/clipsai
Clips AI is an open-source Python library that automatically converts long... |
|
Established |
| 119 |
natasha/razdel
Rule-based token, sentence segmentation for Russian language |
|
Established |
| 120 |
thunlp/OpenHowNet
Core Data of HowNet and OpenHowNet Python API |
|
Established |
| 121 |
baidu/lac
百度NLP:分词,词性标注,命名实体识别,词重要性 |
|
Established |
| 122 |
blmoistawinde/HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法 |
|
Established |
| 123 |
polyrabbit/WeCron
:heavy_check_mark: 微信上的定时提醒 - Cron on WeChat |
|
Established |
| 124 |
nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual... |
|
Established |
| 125 |
jenojp/negspacy
spaCy pipeline object for negating concepts in text |
|
Established |
| 126 |
jacksonllee/pylangacq
Language Acquisition Research Tools |
|
Established |
| 127 |
fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation. |
|
Established |
| 128 |
natasha/corus
Links to Russian corpora + Python functions for loading and parsing |
|
Established |
| 129 |
sileod/tasknet
Easy modernBERT fine-tuning and multi-task learning |
|
Established |
| 130 |
cbaziotis/ekphrasis
Ekphrasis is a text processing tool, geared towards text from social... |
|
Established |
| 131 |
andrewtavis/kwx
BERT, LDA, and TFIDF based keyword extraction in Python |
|
Established |
| 132 |
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation |
|
Established |
| 133 |
charles9n/bert-sklearn
a sklearn wrapper for Google's BERT model |
|
Established |
| 134 |
polm/cutlet
Japanese to romaji converter in Python |
|
Established |
| 135 |
go-ego/gse
Go efficient multilingual NLP and text segmentation; support English,... |
|
Established |
| 136 |
smilelight/lightNLP
基于Pytorch和torchtext的自然语言处理深度学习框架。 |
|
Established |
| 137 |
rodrigopivi/Chatito
🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition... |
|
Established |
| 138 |
mcs07/ChemDataExtractor
Automatically extract chemical information from scientific documents |
|
Established |
| 139 |
delph-in/pydelphin
Python libraries for DELPH-IN |
|
Established |
| 140 |
maximtrp/bitermplus
Biterm Topic Model (BTM): modeling topics in short texts |
|
Established |
| 141 |
ufal/factgenie
Lightweight self-hosted span annotation tool |
|
Established |
| 142 |
explosion/spacy-streamlit
👑 spaCy building blocks and visualizers for Streamlit apps |
|
Established |
| 143 |
bakwc/JamSpell
Modern spell checking library - accurate, fast, multi-language |
|
Established |
| 144 |
natasha/yargy
Rule-based facts extraction for Russian language |
|
Established |
| 145 |
SamEdwardes/spacytextblob
A TextBlob sentiment analysis pipeline component for spaCy. |
|
Established |
| 146 |
gutfeeling/word_forms
Accurately generate all possible forms of an English word e.g "election" -->... |
|
Established |
| 147 |
JDongian/python-jamo
Hangul syllable decomposition and synthesis using jamo. |
|
Established |
| 148 |
tanaos/artifex
Small Language Model Inference, Fine-Tuning and Observability. No GPU, no... |
|
Established |
| 149 |
bobxwu/TopMost
A Topic Modeling System Toolkit (ACL 2024 Demo) |
|
Established |
| 150 |
vunb/vntk
Vietnamese NLP Toolkit for Node |
|
Established |
| 151 |
alirezatheh/perke
A keyphrase extractor for Persian |
|
Established |
| 152 |
smilelight/lightKG
基于Pytorch和torchtext的知识图谱深度学习框架。 |
|
Established |
| 153 |
ownthink/Jiagu
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类 |
|
Established |
| 154 |
BrikerMan/Kashgari
Kashgari is a production-level NLP Transfer learning framework built on top... |
|
Established |
| 155 |
thunlp/OpenAttack
An Open-Source Package for Textual Adversarial Attack. |
|
Established |
| 156 |
Ayanami0730/deep_research_bench
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents |
|
Established |
| 157 |
alvinwan/timefhuman
Extract datetimes and durations from natural language text as Python... |
|
Established |
| 158 |
bjascob/LemmInflect
A python module for English lemmatization and inflection. |
|
Established |
| 159 |
polm/unidic-py
Unidic packaged for installation via pip. |
|
Established |
| 160 |
FraBle/python-sutime
Python wrapper for Stanford CoreNLP's SUTime |
|
Established |
| 161 |
yongzhuo/Keras-TextClassification
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP,... |
|
Established |
| 162 |
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings... |
|
Established |
| 163 |
natasha/natasha
Solves basic Russian NLP tasks, API for lower level Natasha projects |
|
Established |
| 164 |
centre-for-humanities-computing/DaCy
DaCy: The State of the Art Danish NLP pipeline using SpaCy |
|
Established |
| 165 |
SeanLee97/xmnlp
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能 |
|
Established |
| 166 |
wooorm/franc
Natural language detection |
|
Established |
| 167 |
cannlytics/cannlytics
🔥 Cannlytics = cannabis + analytics. Data pipelines, user interfaces, and... |
|
Established |
| 168 |
baidu/Senta
Baidu's open-source Sentiment Analysis System. |
|
Established |
| 169 |
LSYS/LexicalRichness
:smile_cat: :speech_balloon: A module to compute textual lexical richness... |
|
Established |
| 170 |
GaoQ1/rasa_nlu_gq
turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务) |
|
Established |
| 171 |
ines/spacy-js
🎀 JavaScript API for spaCy with Python REST API |
|
Established |
| 172 |
920232796/bert_seq2seq
pytorch实现 Bert... |
|
Established |
| 173 |
asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text... |
|
Established |
| 174 |
MLNLP-World/SimBiber
MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex with official info |
|
Established |
| 175 |
urduhack/urduhack
An NLP library for the Urdu language. It comes with a lot of battery... |
|
Established |
| 176 |
huspacy/huspacy
HuSpaCy: industrial-strength Hungarian natural language processing |
|
Established |
| 177 |
ku-nlp/rhoknp
Yet another Python binding for Juman++/KNP/KWJA |
|
Established |
| 178 |
yongzhuo/nlp_xiaojiang
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence... |
|
Established |
| 179 |
fidelity/textwiser
[AAAI 2021] TextWiser: Text Featurization Library |
|
Established |
| 180 |
greyblake/whatlang-rs
Natural language detection library for Rust. Try demo online: https://whatlang.org/ |
|
Established |
| 181 |
philenius/ngx-annotate-text
This Angular component library is perfect for tasks like visualizing named... |
|
Established |
| 182 |
FerreroJeremy/ln2sql
A tool to query a database in natural language |
|
Established |
| 183 |
Lilykos/pyphonetics
A Python 3 phonetics library. |
|
Established |
| 184 |
nert-nlp/streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword... |
|
Established |
| 185 |
PyThaiNLP/attacut
A Fast and Accurate Neural Thai Word Segmenter |
|
Established |
| 186 |
raghakot/keras-text
Text Classification Library in Keras |
|
Established |
| 187 |
hltcoe/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local... |
|
Established |
| 188 |
kakaobrain/word2word
Easy-to-use word-to-word translations for 3,564 language pairs. |
|
Established |
| 189 |
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda |
|
Established |
| 190 |
eyurtsev/kor
LLM(😽) |
|
Established |
| 191 |
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g.,... |
|
Established |
| 192 |
FraBle/python-duckling
Python wrapper for wit.ai's Duckling Clojure library |
|
Established |
| 193 |
NateScarlet/holiday-cn
📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告 |
|
Established |
| 194 |
taishi-i/awesome-japanese-nlp-resources
A curated list of resources dedicated to Python libraries, LLMs,... |
|
Established |
| 195 |
pysentimiento/pysentimiento
A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks |
|
Established |
| 196 |
bretttolbert/verbecc
Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French,... |
|
Established |
| 197 |
Rostlab/nalaf
NLP framework in python for entity recognition and relationship extraction |
|
Established |
| 198 |
proycon/pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language... |
|
Established |
| 199 |
shineware/KOMORAN
Korean Morphological Analyzer by shineware |
|
Established |
| 200 |
snipsco/snips-nlu
Snips Python library to extract meaning from text |
|
Established |