chonkie and chunky

These are complementary tools: Chonkie handles the core chunking and ingestion for RAG pipelines, while Chunky provides validation, visualization, and editing capabilities for inspecting and refining the chunks that Chonkie produces.

chonkie
83
Verified
chunky
33
Emerging
Maintenance 25/25
Adoption 15/25
Maturity 25/25
Community 18/25
Maintenance 13/25
Adoption 6/25
Maturity 9/25
Community 5/25
Stars: 3,829
Forks: 256
Downloads:
Commits (30d): 53
Language: Python
License: MIT
Stars: 17
Forks: 1
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No risk flags
No Package No Dependents

About chonkie

chonkie-inc/chonkie

🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines

Provides pluggable chunking strategies—recursive, semantic, code-aware, and LLM-based—with composable pipeline workflows that chain multiple chunkers and refineries together. Integrates with 32+ tools across tokenizers (GPT-2, BPE), embeddings (OpenAI, Sentence Transformers), vector databases, and LLMs, while supporting 56 languages out-of-the-box through modular dependency installation.

About chunky

GiovanniPasq/chunky

Validate, visualize, edit, and export chunks for RAG pipelines.

Scores updated daily from GitHub, PyPI, and npm data. How scores work