lexy-ai/lexy

Data pipelines for AI applications

/ 100

Emerging

Provides document ingestion with configurable cloud storage (S3/GCS), task-based processing via Celery workers, and structured data extraction from unstructured content. Built as a containerized REST API with PostgreSQL persistence and optional embedding integration (OpenAI), accessible through a Python SDK or Swagger interface. Enables modular pipeline construction for RAG, agent context management, and document indexing workflows.

Available on PyPI.

Maintenance 10 / 25

Adoption 9 / 25

Maturity 18 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Category

agentic-workflow-orchestration

Last pushed

Mar 02, 2026

Monthly downloads

Commits (30d)

Dependencies

GitHub PyPI

Agentic Workflow Orchestration · 87 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/lexy-ai/lexy"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

airweave-ai/airweave

Open-source context retrieval layer for AI agents

lotus-data/lotus

AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings....

similigh/simili-bot

AI-powered GitHub issue intelligence - semantic duplicate detection, cross-repo search, and...

superduper-io/superduper

Superduper: End-to-end framework for building custom AI applications and agents.

supabase/headless-vector-search

Supabase Toolkit to perform vector similarity search on your knowledge base embeddings.

Explore Embedding Tools

All categories Trending Embeddings directory Insights