jparkerweb/semantic-chunking

🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows

61
/ 100
Established

Performs semantic chunking by embedding sentences with ONNX models and grouping them based on cosine similarity scores, with configurable thresholds and optional chunk rebalancing. Supports multiple embedding models including quantized variants (q4, q8), and can return chunk embeddings for RAG workflows. Deployable as a Node.js library, microservice API, or Docker container with an included web UI for interactive configuration.

134 stars and 5,194 monthly downloads. Used by 1 other package. Available on npm.

Maintenance 10 / 25
Adoption 20 / 25
Maturity 18 / 25
Community 13 / 25

How are scores calculated?

Stars

134

Forks

14

Language

JavaScript

License

MIT

Last pushed

Feb 03, 2026

Monthly downloads

5,194

Commits (30d)

0

Dependencies

5

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/jparkerweb/semantic-chunking"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.