semantic-chunking and MSchunker
Maintenance
10/25
Adoption
20/25
Maturity
25/25
Community
13/25
Maintenance
6/25
Adoption
5/25
Maturity
18/25
Community
0/25
Stars: 134
Forks: 14
Downloads: 5,194
Commits (30d): 0
Language: JavaScript
License: MIT
Stars: 1
Forks: —
Downloads: 41
Commits (30d): 0
Language: Python
License: MIT
No risk flags
No Dependents
About semantic-chunking
jparkerweb/semantic-chunking
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
Performs semantic chunking by embedding sentences with ONNX models and grouping them based on cosine similarity scores, with configurable thresholds and optional chunk rebalancing. Supports multiple embedding models including quantized variants (q4, q8), and can return chunk embeddings for RAG workflows. Deployable as a Node.js library, microservice API, or Docker container with an included web UI for interactive configuration.
About MSchunker
cspnms/MSchunker
Smart text chunker for LLM preprocessing (sections → paragraphs → sentences → hard splits).
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work