chonkie and Sentences-Chunker
These are competitors offering alternative approaches to document chunking for RAG systems, with Chonkie providing a production-ready, feature-rich library while Sentences-Chunker offers a specialized alternative focused on intelligent semantic segmentation.
About chonkie
chonkie-inc/chonkie
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines
Provides pluggable chunking strategies—recursive, semantic, code-aware, and LLM-based—with composable pipeline workflows that chain multiple chunkers and refineries together. Integrates with 32+ tools across tokenizers (GPT-2, BPE), embeddings (OpenAI, Sentence Transformers), vector databases, and LLMs, while supporting 56 languages out-of-the-box through modular dependency installation.
About Sentences-Chunker
smart-models/Sentences-Chunker
Cutting-edge tool designed to intelligently segment text documents into optimally-sized chunks
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work