chonkie-inc/chonkiejs

🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library

/ 100

Emerging

Supports multiple chunking strategies (recursive, token-based, semantic, and neural) through a modular package architecture, with optional HuggingFace tokenizer integration for improved accuracy. Built specifically for RAG pipelines, it provides on-the-fly chunking with token counting capabilities and includes cloud-based options via api.chonkie.ai for advanced algorithms without local dependencies.

318 stars.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 8 / 25

How are scores calculated?

Stars

318

Forks

Language

TypeScript

License

MIT

Compare

chonkiejs and chonkie

Higher-rated alternatives

chonkie-inc/chonkie

🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust...

speedyk-005/chunklet-py

One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs,...

andreshere00/Splitter_MR

Chunk your data into markdown text blocks for your LLM applications

jchunk-io/jchunk

JChunk is a lightweight and flexible library designed to provide multiple strategies for text...

messkan/rag-chunk

A Python CLI to test, benchmark, and find the best RAG chunking strategy for your Markdown documents.

Explore RAG Tools

All categories Trending RAG directory Insights