aryn-ai/sycamore

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

54
/ 100
Established

Leverages Aryn DocParse, a GPU-powered document segmentation API with a DETR vision model trained on 80k+ enterprise documents, to intelligently partition complex PDFs, images, tables, and infographics while preserving semantic structure. Built on a scalable DocSet abstraction with functional Python transforms for data extraction, enrichment, and cleaning, then loads results into vector databases (OpenSearch, Elasticsearch, Pinecone, DuckDB, Qdrant, Weaviate) with a Ray backend for distributed processing.

592 stars. Actively maintained with 5 commits in the last 30 days.

No Package No Dependents
Maintenance 16 / 25
Adoption 10 / 25
Maturity 9 / 25
Community 19 / 25

How are scores calculated?

Stars

592

Forks

68

Language

Python

License

Apache-2.0

Last pushed

Mar 12, 2026

Commits (30d)

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/aryn-ai/sycamore"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.