jchunk-io/jchunk
JChunk is a lightweight and flexible library designed to provide multiple strategies for text chunking within Java applications
Provides fixed-size, recursive character-based, and semantic chunking strategies as modular Maven dependencies, enabling RAG pipelines to select splitting approaches based on use case. The library's pluggable architecture allows independent selection and composition of chunking strategies, with semantic chunking offering context-aware splitting beyond simple delimiter or size-based methods.
Stars
17
Forks
4
Language
Java
License
Apache-2.0
Category
Last pushed
Mar 03, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/jchunk-io/jchunk"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
chonkie-inc/chonkie
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust...
andreshere00/Splitter_MR
Chunk your data into markdown text blocks for your LLM applications
speedyk-005/chunklet-py
One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs,...
chonkie-inc/chonkiejs
🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library
thom-heinrich/chonkify
Extractive document compression for RAG and agent pipelines. +69% vs LLMLingua, +175% vs...