All NLP Tools
11,854 tools ranked by quality score · Page 13 of 119
| # | Tool | Score | Tier |
|---|---|---|---|
| 1201 |
axa-group/Parsr
Transforms PDF, Documents and Images into Enriched Structured Data |
|
Emerging |
| 1202 |
jbarrow/allennlp_tutorial
Tutorial on how to use AllenNLP for sequence modeling (including... |
|
Emerging |
| 1203 |
ebenso/TextSummarizer
TextRank implementation for C# |
|
Emerging |
| 1204 |
snopt/snopt-matlab
Matlab interface for sparse nonlinear optimizer SNOPT |
|
Emerging |
| 1205 |
loresoft/NetSpell
Spell Checker for .NET |
|
Emerging |
| 1206 |
yumeng5/WeSHClass
[AAAI 2019] Weakly-Supervised Hierarchical Text Classification |
|
Emerging |
| 1207 |
abadojack/whatlanggo
Natural language detection library for Go |
|
Emerging |
| 1208 |
m-elbably/symspell-ex
Distributed spelling correction & fuzzy search based on symmetric delete... |
|
Emerging |
| 1209 |
UniversalDataTool/react-nlp-annotate
Interface for making NLP annotations. |
|
Emerging |
| 1210 |
wetneb/pynif
A small Python library for NLP Interchange Format (NIF) for NER(D) systems |
|
Emerging |
| 1211 |
doppio/word2num
A Python package for converting numbers expressed in natural language to... |
|
Emerging |
| 1212 |
barissayil/SentimentAnalysis
Sentiment analysis neural network trained by fine-tuning BERT, ALBERT, or... |
|
Emerging |
| 1213 |
philgooch/abbreviation-extraction
Python3 implementation of the Schwartz-Hearst algorithm for extracting... |
|
Emerging |
| 1214 |
Langboat/Mengzi
Mengzi Pretrained Models |
|
Emerging |
| 1215 |
aaBadri/nlp-papers
Must-read papers on Natural Language Processing (NLP) |
|
Emerging |
| 1216 |
GLambard/SMILES-X
Autonomous characterization of molecular compounds from small datasets... |
|
Emerging |
| 1217 |
Lipairui/textgo
Text preprocessing, representation, similarity calculation, text search and... |
|
Emerging |
| 1218 |
ku-nlp/jumanpp
Juman++ (a Morphological Analyzer Toolkit) |
|
Emerging |
| 1219 |
cahya-wirawan/rwkv-tokenizer
A fast RWKV Tokenizer written in Rust |
|
Emerging |
| 1220 |
mirfan899/Urdu
Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks. |
|
Emerging |
| 1221 |
sudhamstarun/Understanding-Financial-Reports-using-Natural-Language-Processing
Investigate how mutual funds leverage credit derivatives by studying their... |
|
Emerging |
| 1222 |
medkit-lib/medkit
Toolkit for a learning health system |
|
Emerging |
| 1223 |
roomylee/nlp-papers-with-arxiv
Statistics and accepted paper list of NLP conferences with arXiv link |
|
Emerging |
| 1224 |
dayyass/QaNER
Unofficial implementation of QaNER: Prompting Question Answering Models for... |
|
Emerging |
| 1225 |
zamgi/lingvo--Ner-ru
Named entity recognition (NER) in Russian texts / Определение именованных... |
|
Emerging |
| 1226 |
princeton-nlp/CoFiPruning
[ACL 2022] Structured Pruning Learns Compact and Accurate Models... |
|
Emerging |
| 1227 |
3778/icd-prediction-mimic
Predicting ICD Codes from Clinical Notes |
|
Emerging |
| 1228 |
LanguageMachines/libfolia
FoLiA library for C++ |
|
Emerging |
| 1229 |
vene/marseille
Mining Argument Structures with Expressive Inference (Linear and LSTM Engines) |
|
Emerging |
| 1230 |
lukasruff/CVDD-PyTorch
A PyTorch implementation of Context Vector Data Description (CVDD), a method... |
|
Emerging |
| 1231 |
miurahr/pykakasi
Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman. |
|
Emerging |
| 1232 |
masakhane-io/masakhane-community
All our community docs! Start here! Lets put Africa on the NLP Map |
|
Emerging |
| 1233 |
EagleW/Writing-editing-Network
Code for Paper Abstract Writing through Editing Mechanism |
|
Emerging |
| 1234 |
AliOsm/arabic-text-diacritization
Benchmark Arabic text diacritization dataset |
|
Emerging |
| 1235 |
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute. |
|
Emerging |
| 1236 |
huggingface/node-question-answering
Fast and production-ready question answering in Node.js |
|
Emerging |
| 1237 |
ankane/mitie-ruby
Named-entity recognition for Ruby |
|
Emerging |
| 1238 |
guokr/Caver
Caver: a toolkit for multilabel text classification. |
|
Emerging |
| 1239 |
mchesterkadwell/named-entity-recognition
Notebooks for teaching Named Entity Recognition at the Cultural Heritage... |
|
Emerging |
| 1240 |
bentrevett/pytorch-pos-tagging
A tutorial on how to implement models for part-of-speech tagging using... |
|
Emerging |
| 1241 |
hankcs/text-classification-svm
The missing SVM-based text classification module implementing HanLP's interface |
|
Emerging |
| 1242 |
tsafavi/codex
CoDEx: A set of knowledge graph Completion Datasets Extracted from Wikidata... |
|
Emerging |
| 1243 |
CyberZHG/keras-xlnet
Implementation of XLNet that can load pretrained checkpoints |
|
Emerging |
| 1244 |
gcunhase/NLPMetrics
Python code for various NLP metrics |
|
Emerging |
| 1245 |
wx-chevalier/NLP-Notes
人工智能与深度学习实战 - 自然语言处理篇 |
|
Emerging |
| 1246 |
ikawaha/kagome-dict
Dictionary Library for Kagome v2 |
|
Emerging |
| 1247 |
danlou/LMMS
Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings |
|
Emerging |
| 1248 |
SapienzaNLP/ewiser
A Word Sense Disambiguation system integrating implicit and explicit... |
|
Emerging |
| 1249 |
kevinscaria/InstructABSA
Instructional learning for Aspect Based Sentiment Analysis [NAACL-2024] |
|
Emerging |
| 1250 |
metterian/peep-talk
A Situational Conversation-Based English Education Platform |
|
Emerging |
| 1251 |
StarCC0/starcc-py
简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of... |
|
Emerging |
| 1252 |
LG-1/video_music_book_datasets
NLP NER datasets video/music/book bio |
|
Emerging |
| 1253 |
Vishnunkumar/doc_transformers
Document processing using transformers |
|
Emerging |
| 1254 |
shibing624/judger
自动作文评分工具,支持中文、英文作文智能评分,支持评分模型自训练,支持WEKA处理模型数据,支持自定义评分算法。java开发。 |
|
Emerging |
| 1255 |
som-shahlab/trove
Weakly supervised medical named entity classification |
|
Emerging |
| 1256 |
samzshi0529/HanziNLP
A NLP package for Chinese text:Preprocessing, Tokenization, Chinese Fonts,... |
|
Emerging |
| 1257 |
SDM-TIB/falcon2.0
Falcon 2.0 is a joint entity and relation linking tool over Wikidata. |
|
Emerging |
| 1258 |
ARBML/tkseem
Arabic Tokenization Library. It provides many tokenization algorithms. |
|
Emerging |
| 1259 |
TheHamkerCat/python-arq
Asynchronous Python Wrapper For A.R.Q API. |
|
Emerging |
| 1260 |
cjymz886/find-Chinese-medical-words
发现新词 无监督词库生成 医学词库生成 发现未登录词 |
|
Emerging |
| 1261 |
zhpmatrix/BERTem
论文实现(ACL2019):《Matching the Blanks: Distributional Similarity for Relation Learning》 |
|
Emerging |
| 1262 |
kssteven418/LTP
[KDD'22] Learned Token Pruning for Transformers |
|
Emerging |
| 1263 |
21han/nlp_qa_project
Natural Language Processing Question Answering Final Project |
|
Emerging |
| 1264 |
zliucr/coach
Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling (ACL-2020) |
|
Emerging |
| 1265 |
Koziev/GrammarEngine
Грамматический Словарь Русского Языка (+ английский, японский, etc) |
|
Emerging |
| 1266 |
ajitrajasekharan/unsupervised_NER
Self-supervised NER prototype - updated version (69 entity types - 17 broad... |
|
Emerging |
| 1267 |
1429904852/Aspect-Based-Sentiment-Analysis
A paper list for aspect based sentiment analysis. |
|
Emerging |
| 1268 |
mit-nlp/MITIE
MITIE: library and tools for information extraction |
|
Emerging |
| 1269 |
frankplus/meena-chatbot
Google's Meena transformer chatbot implementation |
|
Emerging |
| 1270 |
didasy/tldr
Text summarizer for golang using LexRank |
|
Emerging |
| 1271 |
fregu856/CS224n_project
Neural Image Captioning in TensorFlow. |
|
Emerging |
| 1272 |
oliverguhr/fullstop-deep-punctuation-prediction
A model that predicts the punctuation of English, Italian, French and German texts. |
|
Emerging |
| 1273 |
shawnh2/QA-CivilAviationKG
基于民航业知识图谱的自动问答系统 |
|
Emerging |
| 1274 |
bionlplab/radtext
Python Radiology Text Analysis System |
|
Emerging |
| 1275 |
cosmoquester/2021-dialogue-summary-competition
[2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다. |
|
Emerging |
| 1276 |
gnana70/tamil_ocr
OCR Tamil is a powerful tool that can detect and recognize text in Tamil... |
|
Emerging |
| 1277 |
erickrf/multiffn-nli
Implementation of the multi feed-forward network architecture by Parikh et... |
|
Emerging |
| 1278 |
datquocnguyen/jLDADMM
A Java package for the LDA and DMM topic models |
|
Emerging |
| 1279 |
patrickschur/language-detection
A language detection library for PHP. Detects the language from a given text string. |
|
Emerging |
| 1280 |
mynameisvinn/EmailParser
remove signature blocks from emails |
|
Emerging |
| 1281 |
rdspring1/PyTorch_GBW_LM
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset |
|
Emerging |
| 1282 |
cocoa-ai/SentimentCoreMLDemo
😃 iOS11 demo application for sentiment polarity analysis. |
|
Emerging |
| 1283 |
NonvolatileMemory/AAAI_2019_EXAM
Official implementation of "Explicit Interaction Model towards Text Classification" |
|
Emerging |
| 1284 |
daiquocnguyen/Graph-Transformer
Universal Graph Transformer Self-Attention Networks (TheWebConf WWW 2022)... |
|
Emerging |
| 1285 |
cjymz886/text_bert_cnn
在bert模型的pre_training基础上进行text_cnn文本分类 |
|
Emerging |
| 1286 |
yeyupiaoling/PunctuationModel
中文标点符号模型,可以给文本添加标点符号。 |
|
Emerging |
| 1287 |
andy840314/QANet-pytorch-
A Pytorch implementation of QANet |
|
Emerging |
| 1288 |
napsternxg/DeepSequenceClassification
Deep neural network based model for sequence to sequence classification |
|
Emerging |
| 1289 |
snipsco/snips-nlu-ontology
Ontology of Snips NLU |
|
Emerging |
| 1290 |
AMontgomerie/CEFR-English-Level-Predictor
NLP system for predicting the reading difficulty level of a text in terms of... |
|
Emerging |
| 1291 |
pemistahl/lingua
The most accurate natural language detection library for Java and the JVM,... |
|
Emerging |
| 1292 |
Receiling/UniRE
Source code for "UniRE: A Unified Label Space for Entity Relation... |
|
Emerging |
| 1293 |
rlayers/pawpaw
Text Processing & Segmentation Framework |
|
Emerging |
| 1294 |
houbb/pinyin
The high performance pinyin tool for java.(java 高性能中文转拼音工具。支持同音字。) |
|
Emerging |
| 1295 |
thunlp/paragraph2vec
Paragraph Vector Implementation |
|
Emerging |
| 1296 |
AI4Bharat/Indic-BERT-v1
Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and... |
|
Emerging |
| 1297 |
houbb/nlp-hanzi-similar
The hanzi similar tool.(汉字相似度计算工具,中文形近字算法。可用于手写汉字识别纠正,文本混淆等。) |
|
Emerging |
| 1298 |
ikegami-yukino/neologdn
Japanese text normalizer for mecab-neologd |
|
Emerging |
| 1299 |
clipperhouse/uax29
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split... |
|
Emerging |
| 1300 |
olivettigroup/materials-synthesis-generative-models
Public release of data and code for materials synthesis generation |
|
Emerging |