Table Extraction OCR ML Frameworks
Tools for detecting, extracting, and recognizing tables from document images using computer vision and OCR techniques. Does NOT include general document processing, chart/figure extraction, or standalone OCR for unstructured text.
There are 23 table extraction ocr frameworks tracked. The highest-rated is Layout-Parser/layout-parser at 46/100 with 5,678 stars.
Get all 23 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=table-extraction-ocr&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis |
|
Emerging |
| 2 |
Psarpei/Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for... |
|
Emerging |
| 3 |
ses4255/Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math,... |
|
Emerging |
| 4 |
Sudhanshu1304/table-transformer
🔍 Table Extraction Tool: A powerful open-source solution combining OCR and... |
|
Emerging |
| 5 |
asagar60/TableNet-pytorch
Pytorch Implementation of TableNet |
|
Emerging |
| 6 |
JG1VPP/MuTabNet
ICDAR 2024 Table OCR Model |
|
Emerging |
| 7 |
VikParuchuri/tabled
Detect and extract tables to markdown and csv |
|
Emerging |
| 8 |
MrZilinXiao/Hyper-Table-OCR
A carefully-designed OCR pipeline for universal boarded table recognition... |
|
Emerging |
| 9 |
Cvrane/ChartReader
Fully automated end-to-end framework to extract data from bar plots and... |
|
Experimental |
| 10 |
MasterAI-EAM/GraphMaster
Fully automated end to end framework to extract data from complex charts and... |
|
Experimental |
| 11 |
abdoelsayed2016/TNCR_Dataset
Deep learning, Convolutional neural networks, Image processing, Document... |
|
Experimental |
| 12 |
muhd-umer/pyramidtabnet
Official PyTorch implementation of PyramidTabNet: Transformer-based Table... |
|
Experimental |
| 13 |
Kaisanya/NanoTabVLM
📊 Transform images of tables into accurate HTML text with NanoTabVLM, a... |
|
Experimental |
| 14 |
CaseDrive/publaynet-models
Trained Detectron2 object detection models for document layout analysis... |
|
Experimental |
| 15 |
AshishSalaskar1/TableNet_Implementation
Extract Tabular data from scanned document images and save the tabular data... |
|
Experimental |
| 16 |
stuartemiddleton/glosat_table_dataset
GloSAT Historical Measurement Table Dataset |
|
Experimental |
| 17 |
SAP-samples/clustertabnet
Implementation of the table detection and table structure recognition deep... |
|
Experimental |
| 18 |
FutureRising007/Table_Structure_Recognition
Table Structure Recognition |
|
Experimental |
| 19 |
Wa1den-jy/Topic-on-Table-Recognition
This is a survey on the topic of table recognition |
|
Experimental |
| 20 |
shahin-ro/Table-Detection
Python tool for table extraction & Persian OCR. Uses OpenCV for table... |
|
Experimental |
| 21 |
kanhao100/IndustrialDigitDatasetGenerator
A tool for generating number image datasets in industrial scenarios. |
|
Experimental |
| 22 |
NeelDevenShah/Document-Layout-Analysis-for-Figure-Localization-in-Technical-PDFs
This repository provides three state-of-the-art models fine-tuned on the... |
|
Experimental |
| 23 |
Gwinke/TIDE-4-Bachelor-Project-2025
TIDE-4: An End-to-End Pipeline for Scientific Table Extraction in... |
|
Experimental |