Korean Text Processing NLP Tools
Tools and libraries specifically for Korean language tokenization, morphological analysis, and text preprocessing. Does NOT include general multilingual NLP tools, language identification, or Korean-specific applications like sentiment analysis or named entity recognition.
There are 48 korean text processing tools tracked. 2 score above 70 (verified tier). The highest-rated is lovit/soynlp at 75/100 with 984 stars and 122,443 monthly downloads. 1 of the top 10 are actively maintained.
Get all 48 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=korean-text-processing&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
lovit/soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다. |
|
Verified |
| 2 |
bab2min/kiwipiepy
Python API for Kiwi |
|
Verified |
| 3 |
hyunwoongko/kss
KSS: Korean String processing Suite |
|
Established |
| 4 |
bab2min/Kiwi
Kiwi(지능형 한국어 형태소 분석기) |
|
Established |
| 5 |
JDongian/python-jamo
Hangul syllable decomposition and synthesis using jamo. |
|
Established |
| 6 |
shineware/KOMORAN
Korean Morphological Analyzer by shineware |
|
Established |
| 7 |
naver/claf
CLaF: Open-Source Clova Language Framework |
|
Emerging |
| 8 |
konlpy/konlpy
Python package for Korean natural language processing. |
|
Emerging |
| 9 |
haven-jeon/PyKoSpacing
Automatic Korean word spacing with Python |
|
Emerging |
| 10 |
lovit/soyspacing
띄어쓰기 오류 교정 라이브러리입니다. CRF 와 같은 머신러닝 알고리즘이 아닌, 직관적인 접근법으로 띄어쓰기를 교정합니다. |
|
Emerging |
| 11 |
open-korean-text/open-korean-text
Open Korean Text Processor - An Open-source Korean Text Processor |
|
Emerging |
| 12 |
rokoroku/node-twitter-korean-text
(Deprecated) use open-korean-text |
|
Emerging |
| 13 |
abdalimran/pykotokenizer
PyKoTokenizer is a Korean text tokenizer for Korean Natural Language... |
|
Emerging |
| 14 |
uosdmlab/spark-nkp
Natural Korean Processor for Apache Spark |
|
Emerging |
| 15 |
bage79/nlp4kor
Natural Language Processing for Korean with Deep Learning |
|
Emerging |
| 16 |
koshort/koshort
(deprecated) :cat: koshort is a Python package for Korean internet spoken... |
|
Emerging |
| 17 |
aws-samples/sm-kornlp
A collection of Korean NLP hands-on labs on Amazon SageMaker |
|
Emerging |
| 18 |
Kyubyong/KoParadigm
KoParadigm: Korean Inflectional Paradigm Generator |
|
Experimental |
| 19 |
bab2min/kiwi-gui
C# API for Kiwi |
|
Experimental |
| 20 |
pnuailab/parser
한국어 문장 분석 시스템 BCD-KL-Parser |
|
Experimental |
| 21 |
L0Z1K/para-Kor
Create paraphrasing korean sentence with GPT-3 |
|
Experimental |
| 22 |
open-korean-text/open-korean-text-4clj
Open Korean Text Processor wrapper for Clojure |
|
Experimental |
| 23 |
QuoQA-NLP/KoQuillBot
✍️ Korean Paraphrasing Tool Using Round-trip Translation |
|
Experimental |
| 24 |
bit2r/bitNLP
Tools that support "Natural Language Processing" for Korean text analytics. |
|
Experimental |
| 25 |
fuzzythecat/awesome-spacer
Automatic Korean word spacing with TensorFlow 2.0 + Keras |
|
Experimental |
| 26 |
ttytu/UKTA-web
Unififed Korean Text Analyzer including morpheme analysis, lexical features,... |
|
Experimental |
| 27 |
Huffon/nlp-startups
국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록 |
|
Experimental |
| 28 |
seoyeon9646/KorSEC
KorSEC : Korean Space Error Correction |
|
Experimental |
| 29 |
JoonkyuChoi/polyglot-ko-1.3b-lite
Lite Korean language model |
|
Experimental |
| 30 |
mcognetta/ThreeHotKoreanModeling
A repo for parameter-efficient Korean character-level language modeling. |
|
Experimental |
| 31 |
A-baoYang/NLP-techniques-chinese
For learning. Collecting techniques of each step from knowledge graph... |
|
Experimental |
| 32 |
Seokii/Korean_NLP_Tutorial
한국어 자연어처리 튜토리얼 |
|
Experimental |
| 33 |
kyle-bong/K-TACC
문맥을 고려한 한국어 텍스트 데이터 증강 |
|
Experimental |
| 34 |
cspaliwal/KSS
🚀 Build your own kernel from scratch with KSS, an open-source guide that... |
|
Experimental |
| 35 |
iKnowLab-Projects/ko-flan
한국어 FLAN 데이터 구축과 모델 학습을 위한 프로젝트 |
|
Experimental |
| 36 |
nakosung/hangul-asm
Hangul disasm/asm |
|
Experimental |
| 37 |
sangdee/kss-java
Korean Sentence Splitter |
|
Experimental |
| 38 |
JAICHANGPARK/flutter_kiwi_nlp
Kiwi 기반 한국어 형태소 분석 Flutter 플러그인입니다. Native-first Flutter plugin for Korean... |
|
Experimental |
| 39 |
shineware/tutorials
KOMORAN Tutorials |
|
Experimental |
| 40 |
brandazine/elasticsearch-kiwi-analysis-plugin
Kiwi 형태소 분석기 ElasticSearch 플러그인 (Unofficial) |
|
Experimental |
| 41 |
pipidog/CNLP
A toolbox for Chinese Natural Language Processing |
|
Experimental |
| 42 |
shineware/RKOMORAN
RKOMORAN is KOMORAN wrapper for R users |
|
Experimental |
| 43 |
bit2r/bitTA
기능이 bitNLP로 이관되었습니다. bitNLP를 사용하시기 바랍니다. |
|
Experimental |
| 44 |
binjang/NIKL-dictionary-parser
Unofficial parser for NIKL Dictionary files. |
|
Experimental |
| 45 |
takuti/hive-udf-tokenize_ko
Korean NLP on Hive |
|
Experimental |
| 46 |
oh-gnues-iohc/korean-qa-paraphrase
This repository contains datasets and training resources for paraphrasing... |
|
Experimental |
| 47 |
dilohn/ch-en-scaffolding
research with machine learning to determine best scaffolding for bilingual students |
|
Experimental |
| 48 |
oh-gnues-iohc/korean-noise-augmentation
Generate noise by separating Korean input into consonants and vowels, and... |
|
Experimental |