kobkrit/nlp_thai_resources
More than 50+ collections of Thai Natural Language Processing libraries. Update daily.
Aggregates 50+ Thai NLP tools spanning tokenization (DeepCut, CutKum achieving 98%+ F-measure), POS tagging, named entity recognition, and word embeddings, with implementations across Python, Java, Rust, and JavaScript. Emphasizes deep learning approaches (RNNs, LSTMs, CNNs) for segmentation tasks alongside traditional algorithms like maximal matching and HMM taggers. Includes downloadable corpora (InterBEST, ORCHID) and multilingual resources like LEXiTRON Thai-English dictionary to support end-to-end NLP pipelines.
391 stars. No commits in the last 6 months.
Stars
391
Forks
79
Language
—
License
—
Category
Last pushed
Apr 09, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/kobkrit/nlp_thai_resources"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
RichardLitt/low-resource-languages
Resources for conservation, development, and documentation of low resource (human) languages.
ksopyla/awesome-nlp-polish
A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models,...
datanada/Awesome-Korean-NLP
A curated list of resources for NLP (Natural Language Processing) for Korean
oroszgy/awesome-hungarian-nlp
A curated list of NLP resources for Hungarian