aphp/edspdf

EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.

/ 100

Established

No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 13 / 25

Maturity 25 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

BSD-3-Clause

Category

document-intelligence-extraction

Last pushed

Feb 12, 2025

Monthly downloads

189

Commits (30d)

Dependencies

GitHub PyPI

Document Intelligence Extraction · 99 frameworks

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/aphp/edspdf"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Related frameworks

paperless-ngx/paperless-ngx

A community-supported supercharged document management system: scan, index and archive all your documents

GoogleCloudPlatform/document-ai-samples

Sample applications and demos for Document AI, the end-to-end document processing platform on...

aws-solutions/document-understanding-solution

Example of integrating & using Amazon Textract, Amazon Comprehend, Amazon Comprehend Medical,...

naiveHobo/InvoiceNet

Deep neural network to extract intelligent information from invoice documents.

jonaswinkler/paperless-ng

A supercharged version of paperless: scan, index and archive all your physical documents

Explore ML Frameworks

All categories Trending ML Framework directory Insights