AstraBert/ingest-anything

From data to vector database effortlessly

50
/ 100
Established

Supports diverse file formats (DOCX, CSV, JSON, XML, code files) and web content through format-specific pipelines: text files convert via PdfItDown before chunking with Chonkie, while code uses semantic-aware CodeChunker. Integrates with LlamaIndex vector stores (Qdrant, Weaviate) and multiple embedding providers (Sentence Transformers, OpenAI, Cohere), plus includes an agentic RAG interface for automated document ingestion and querying workflows.

No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 2 / 25
Adoption 9 / 25
Maturity 24 / 25
Community 15 / 25

How are scores calculated?

Stars

89

Forks

12

Language

Python

License

MIT

Last pushed

May 17, 2025

Commits (30d)

0

Dependencies

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/AstraBert/ingest-anything"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.