Vietnamese NLP Tools
Comprehensive NLP resources, toolkits, and datasets specifically for Vietnamese language processing tasks. Includes Vietnamese-specific tools, corpora, and task-specific models. Does NOT include general multilingual NLP tools, language-agnostic frameworks, or non-Vietnamese language resources.
There are 50 vietnamese nlp tools tracked. 1 score above 50 (established tier). The highest-rated is vunb/vntk at 57/100 with 218 stars and 302 monthly downloads.
Get all 50 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=vietnamese-nlp-tools&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
vunb/vntk
Vietnamese NLP Toolkit for Node |
|
Established |
| 2 |
VinAIResearch/PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging,... |
|
Emerging |
| 3 |
vncorenlp/VnCoreNLP
A Vietnamese natural language processing toolkit (NAACL 2018) |
|
Emerging |
| 4 |
IBM/transition-amr-parser
SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments... |
|
Emerging |
| 5 |
sheng-z/stog
AMR Parsing as Sequence-to-Graph Transduction |
|
Emerging |
| 6 |
nert-nlp/AMR-Bibliography
Organized inventory of research using the Abstract Meaning Representation |
|
Emerging |
| 7 |
duyvuleo/VNTC
A Large-scale Vietnamese News Text Classification Corpus |
|
Emerging |
| 8 |
Nguyendat-bit/VieTokenizer
Vietnamese Tokenizer package based on deeplearning methods |
|
Emerging |
| 9 |
undertheseanlp/NLP-Vietnamese-progress
Repository to track the progress in Vietnamese Natural Language Processing,... |
|
Emerging |
| 10 |
anhthuan1999/Vietnamese-News-Classification
We use LSTM, BiLSTM, BERT and SVM with TF-IDF, Word2vec and Bag-of-words to... |
|
Emerging |
| 11 |
henryle97/Spelling_Correction_Vietnamese
Vietnamese spelling error correction with Seq2Seq model |
|
Emerging |
| 12 |
mailong25/bert-vietnamese-question-answering
Vietnamese question answering system with BERT |
|
Emerging |
| 13 |
undertheseanlp/chatbot
Vietnamese Chatbot |
|
Emerging |
| 14 |
vTuanpham/Vietnamese_QA_System
Vietnamese long form question answering system with documents retrieval. |
|
Emerging |
| 15 |
WhySchools/VMDS-vietnamese-misspell-dataset-from-Social-media
Vietnamese Misspell Dataset - Tập dữ liệu chính tả tiếng Việt trên mạng xã hội |
|
Emerging |
| 16 |
VinAIResearch/PhoNER_COVID19
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021) |
|
Experimental |
| 17 |
plandes/amr
AMR annotation and feature generation |
|
Experimental |
| 18 |
undertheseanlp/word_tokenize
Vietnamese Word Tokenize |
|
Experimental |
| 19 |
bmd1905/vietnamese-correction
A project improves the quality and accuracy of the Vietnamese language. |
|
Experimental |
| 20 |
phongnt570/UETsegmenter
A toolkit for Vietnamese word segmentation |
|
Experimental |
| 21 |
tkhangg0910/ViConBERT
Official Codebase for ViConBERT: Context-Gloss Aligned Vietnamese Word... |
|
Experimental |
| 22 |
duongntbk/restore_vietnamese_diacritics
A Transformer based NLP solution to restore diacritics for Vietnamese text... |
|
Experimental |
| 23 |
nschneid/amr-tutorial
Abstract Meaning Representation (AMR) tutorial slides |
|
Experimental |
| 24 |
tienthanhdhcn/Vietnamese-Accent-Prediction
A simple/fast/accurate accent prediction for non-accented Vietnamese text |
|
Experimental |
| 25 |
VietHoang1512/vietnamese-spell-correct-and-text-classify
A spell corrector and text classifier using Deep Neural Network |
|
Experimental |
| 26 |
datnnt1997/bert_vn_ner
PyTorch solution of Vietnamese Named Entity Recognition task with Google... |
|
Experimental |
| 27 |
PB3002/ViMedical_Disease
A Vietnamese dataset of over 12 thousands questions about common disease... |
|
Experimental |
| 28 |
v-bible/crawler
A collection of web crawlers to crawl Catholic resources in Vietnamese language |
|
Experimental |
| 29 |
wonrax/phobert-base-vietnamese-sentiment
PhoBERT fine-tuned for sentiment analysis |
|
Experimental |
| 30 |
pbcquoc/vietnamese_word_seperate
Seperate vietnamese using lstm |
|
Experimental |
| 31 |
ds4v/vietnamese-pos-tagging
Gán nhãn từ loại Tiếng Việt sử dụng mô hình Hidden Markov kết hợp thuật toán Viterbi |
|
Experimental |
| 32 |
Avi197/Phobert-Named-Entity-Reconigtion
Applied Phobert model by VinAI research for Vietnamese NER task on various dataset |
|
Experimental |
| 33 |
bug-breeder/vant
AI-powered Vietnamese Input Method for macOS — Rust core +... |
|
Experimental |
| 34 |
dangvansam/phobert-text-classification
Phân loại văn bản Tiếng Việt sử dụng pretrained model - PhoBERT |
|
Experimental |
| 35 |
209sontung/Vietnamese-stock-article-classification
Sentiment-based classification for stock article title using PhoBert |
|
Experimental |
| 36 |
chanind/penman-js
Abstract Meaning Representation (AMR) parser and generator for Javascript |
|
Experimental |
| 37 |
tien02/ensemble-roberta-fasttext-vietnamese
Ensemble PhoBERT with FastText Embedding to improve performance on... |
|
Experimental |
| 38 |
telexyz/data
Tổng hợp ngữ liệu tiếng Việt |
|
Experimental |
| 39 |
NamSyntax/Vietnamese-QA
Vietnamese-QA is very simple with XLM-RoBERTa fine-tuned on the Vietnamese... |
|
Experimental |
| 40 |
xndien2004/ViAMR
[VLSP 2025] ViAMR: Fine-tuning LLMs for Abstract Meaning Representation in Vietnamese |
|
Experimental |
| 41 |
AnhHoang0529/Small-LexNormViHSD
A Dataset for Vietnamese Lexical Normalization |
|
Experimental |
| 42 |
kh4nh12/ViTASA
A novel dataset and method for Vietnamese Target-Aspect-Sentiment joint... |
|
Experimental |
| 43 |
phkhanhtrinh23/vietnamese_ner_bert
Vietnamese Named-Entity Recognition. |
|
Experimental |
| 44 |
nicolay-r/ViLongT5
LongT5-based model pre-trained on a large amount of unlabeled Vietnamese... |
|
Experimental |
| 45 |
vietbtx/ViTextnormASR
Our source code for the paper "Transformer-based Joint Learning Approach for... |
|
Experimental |
| 46 |
ngxtnhi/ViLexNorm
A Lexical Normalization Corpus for Vietnamese Social Media Text |
|
Experimental |
| 47 |
dinhanhx/vcc
Vietnamese Conceptual Caption |
|
Experimental |
| 48 |
manhtt-079/vipubmed-deberta
ViPubmedDeBERTa: A Pre-trained Model for Vietnamese Biomedical Text (PACLIC 2023) |
|
Experimental |
| 49 |
Vinfall/CnGal-to-VNDB
A naïve tool to detect missing CnGal releases on VNDB |
|
Experimental |
| 50 |
ndthuan/vi-word-segmenter
HTTP wrapper of the VnCoreNLP library - A Vietnamese natural language... |
|
Experimental |