Korean Text Processing NLP Tools

Tools and libraries specifically for Korean language tokenization, morphological analysis, and text preprocessing. Does NOT include general multilingual NLP tools, language identification, or Korean-specific applications like sentiment analysis or named entity recognition.

There are 48 korean text processing tools tracked. 2 score above 70 (verified tier). The highest-rated is lovit/soynlp at 75/100 with 984 stars and 122,443 monthly downloads. 1 of the top 10 are actively maintained.

Get all 48 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=korean-text-processing&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 lovit/soynlp

한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.

75
Verified
2 bab2min/kiwipiepy

Python API for Kiwi

70
Verified
3 hyunwoongko/kss

KSS: Korean String processing Suite

63
Established
4 bab2min/Kiwi

Kiwi(지능형 한국어 형태소 분석기)

60
Established
5 JDongian/python-jamo

Hangul syllable decomposition and synthesis using jamo.

57
Established
6 shineware/KOMORAN

Korean Morphological Analyzer by shineware

55
Established
7 naver/claf

CLaF: Open-Source Clova Language Framework

47
Emerging
8 konlpy/konlpy

Python package for Korean natural language processing.

44
Emerging
9 haven-jeon/PyKoSpacing

Automatic Korean word spacing with Python

43
Emerging
10 lovit/soyspacing

띄어쓰기 오류 교정 라이브러리입니다. CRF 와 같은 머신러닝 알고리즘이 아닌, 직관적인 접근법으로 띄어쓰기를 교정합니다.

41
Emerging
11 open-korean-text/open-korean-text

Open Korean Text Processor - An Open-source Korean Text Processor

40
Emerging
12 rokoroku/node-twitter-korean-text

(Deprecated) use open-korean-text

39
Emerging
13 abdalimran/pykotokenizer

PyKoTokenizer is a Korean text tokenizer for Korean Natural Language...

36
Emerging
14 uosdmlab/spark-nkp

Natural Korean Processor for Apache Spark

35
Emerging
15 bage79/nlp4kor

Natural Language Processing for Korean with Deep Learning

35
Emerging
16 koshort/koshort

(deprecated) :cat: koshort is a Python package for Korean internet spoken...

32
Emerging
17 aws-samples/sm-kornlp

A collection of Korean NLP hands-on labs on Amazon SageMaker

30
Emerging
18 Kyubyong/KoParadigm

KoParadigm: Korean Inflectional Paradigm Generator

29
Experimental
19 bab2min/kiwi-gui

C# API for Kiwi

28
Experimental
20 pnuailab/parser

한국어 문장 분석 시스템 BCD-KL-Parser

28
Experimental
21 L0Z1K/para-Kor

Create paraphrasing korean sentence with GPT-3

28
Experimental
22 open-korean-text/open-korean-text-4clj

Open Korean Text Processor wrapper for Clojure

27
Experimental
23 QuoQA-NLP/KoQuillBot

✍️ Korean Paraphrasing Tool Using Round-trip Translation

27
Experimental
24 bit2r/bitNLP

Tools that support "Natural Language Processing" for Korean text analytics.

27
Experimental
25 fuzzythecat/awesome-spacer

Automatic Korean word spacing with TensorFlow 2.0 + Keras

27
Experimental
26 ttytu/UKTA-web

Unififed Korean Text Analyzer including morpheme analysis, lexical features,...

25
Experimental
27 Huffon/nlp-startups

국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록

25
Experimental
28 seoyeon9646/KorSEC

KorSEC : Korean Space Error Correction

25
Experimental
29 JoonkyuChoi/polyglot-ko-1.3b-lite

Lite Korean language model

24
Experimental
30 mcognetta/ThreeHotKoreanModeling

A repo for parameter-efficient Korean character-level language modeling.

24
Experimental
31 A-baoYang/NLP-techniques-chinese

For learning. Collecting techniques of each step from knowledge graph...

24
Experimental
32 Seokii/Korean_NLP_Tutorial

한국어 자연어처리 튜토리얼

24
Experimental
33 kyle-bong/K-TACC

문맥을 고려한 한국어 텍스트 데이터 증강

23
Experimental
34 cspaliwal/KSS

🚀 Build your own kernel from scratch with KSS, an open-source guide that...

23
Experimental
35 iKnowLab-Projects/ko-flan

한국어 FLAN 데이터 구축과 모델 학습을 위한 프로젝트

22
Experimental
36 nakosung/hangul-asm

Hangul disasm/asm

22
Experimental
37 sangdee/kss-java

Korean Sentence Splitter

20
Experimental
38 JAICHANGPARK/flutter_kiwi_nlp

Kiwi 기반 한국어 형태소 분석 Flutter 플러그인입니다. Native-first Flutter plugin for Korean...

20
Experimental
39 shineware/tutorials

KOMORAN Tutorials

20
Experimental
40 brandazine/elasticsearch-kiwi-analysis-plugin

Kiwi 형태소 분석기 ElasticSearch 플러그인 (Unofficial)

19
Experimental
41 pipidog/CNLP

A toolbox for Chinese Natural Language Processing

18
Experimental
42 shineware/RKOMORAN

RKOMORAN is KOMORAN wrapper for R users

15
Experimental
43 bit2r/bitTA

기능이 bitNLP로 이관되었습니다. bitNLP를 사용하시기 바랍니다.

14
Experimental
44 binjang/NIKL-dictionary-parser

Unofficial parser for NIKL Dictionary files.

13
Experimental
45 takuti/hive-udf-tokenize_ko

Korean NLP on Hive

11
Experimental
46 oh-gnues-iohc/korean-qa-paraphrase

This repository contains datasets and training resources for paraphrasing...

11
Experimental
47 dilohn/ch-en-scaffolding

research with machine learning to determine best scaffolding for bilingual students

11
Experimental
48 oh-gnues-iohc/korean-noise-augmentation

Generate noise by separating Korean input into consonants and vowels, and...

10
Experimental