Layout-Parser/layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

/ 100

Emerging

Provides pre-trained deep learning models (EfficientDet, Detectron2-based) for layout detection via unified APIs, with specialized data structures for spatial filtering and region-based operations on document elements. Integrates OCR backends like Tesseract and supports loading/serializing layouts from JSON, CSV, and PDF formats. Designed as an open platform for community contribution of detection models and document analysis pipelines.

5,678 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

5,678

Forks

525

Language

Python

License

Apache-2.0

Related frameworks

Psarpei/Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and...

ses4255/Versatile-OCR-Program

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

Sudhanshu1304/table-transformer

🔍 Table Extraction Tool: A powerful open-source solution combining OCR and computer vision for...

asagar60/TableNet-pytorch

Pytorch Implementation of TableNet

JG1VPP/MuTabNet

ICDAR 2024 Table OCR Model

Explore ML Frameworks

All categories Trending ML Framework directory Insights