awslabs/project-lakechain
:zap: Cloud-native, AI-powered, document processing pipelines on AWS.
Provides 60+ composable middleware components for text extraction, transcription, summarization, and RAG pipelines, deployable as infrastructure-as-code using AWS CDK. Automatically scales document processing across Lambda, ECS, and SageMaker with GPU/CPU options, handling millions of documents while scaling to zero when idle. Supports custom middlewares for extensibility and includes 50+ ready-made examples covering video analysis, podcast generation, and multimodal AI workflows.
186 stars.
Stars
186
Forks
27
Language
TypeScript
License
Apache-2.0
Category
Last pushed
Jan 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mlops/awslabs/project-lakechain"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.