chonkie-inc/chonkiejs
🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library
Supports multiple chunking strategies (recursive, token-based, semantic, and neural) through a modular package architecture, with optional HuggingFace tokenizer integration for improved accuracy. Built specifically for RAG pipelines, it provides on-the-fly chunking with token counting capabilities and includes cloud-based options via api.chonkie.ai for advanced algorithms without local dependencies.
318 stars.
Stars
318
Forks
9
Language
TypeScript
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/chonkie-inc/chonkiejs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
chonkie-inc/chonkie
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust...
speedyk-005/chunklet-py
One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs,...
andreshere00/Splitter_MR
Chunk your data into markdown text blocks for your LLM applications
jchunk-io/jchunk
JChunk is a lightweight and flexible library designed to provide multiple strategies for text...
messkan/rag-chunk
A Python CLI to test, benchmark, and find the best RAG chunking strategy for your Markdown documents.