Document Chunking NLP Tools
There are 3 document chunking tools tracked. 1 score above 50 (established tier). The highest-rated is mirth/chonky at 50/100 with 407 stars and 312 monthly downloads.
Get all 3 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=document-chunking&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
mirth/chonky
Fully neural approach for text chunking |
|
Established |
| 2 |
sentencizer/sentencizer
A sentence splitting (sentence boundary disambiguation) library for Go. It... |
|
Emerging |
| 3 |
prajwal10001/semantic-chunker-langchain
Token-aware, LangChain-compatible semantic chunker with PDF, markdown, and... |
|
Experimental |