chonkie and RAG-chunker

These are competitors in the document-chunking space, as both aim to split and prepare documents for RAG pipelines, though Chonkie is a mature, production-ready library while RAG-chunker appears to be an early-stage project with minimal adoption.

chonkie
83
Verified
RAG-chunker
25
Experimental
Maintenance 25/25
Adoption 15/25
Maturity 25/25
Community 18/25
Maintenance 2/25
Adoption 2/25
Maturity 9/25
Community 12/25
Stars: 3,829
Forks: 256
Downloads:
Commits (30d): 53
Language: Python
License: MIT
Stars: 2
Forks: 1
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
No risk flags
Stale 6m No Package No Dependents

About chonkie

chonkie-inc/chonkie

🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines

Provides pluggable chunking strategies—recursive, semantic, code-aware, and LLM-based—with composable pipeline workflows that chain multiple chunkers and refineries together. Integrates with 32+ tools across tokenizers (GPT-2, BPE), embeddings (OpenAI, Sentence Transformers), vector databases, and LLMs, while supporting 56 languages out-of-the-box through modular dependency installation.

About RAG-chunker

AceAtDev/RAG-chunker

The easiest and most effective way tool to retrain a RAG LLM/GEN AI/Agent on your data

Scores updated daily from GitHub, PyPI, and npm data. How scores work