paperless-ngx and paperless-ng
Paperless-ngx is a community-maintained fork that supersedes paperless-ng, with the latter being an abandoned predecessor that is no longer actively developed.
About paperless-ngx
paperless-ngx/paperless-ngx
A community-supported supercharged document management system: scan, index and archive all your documents
Leverages OCR (Tesseract) to extract searchable text from scanned documents, supports bulk importing via IMAP email integration and file watchers, and provides a REST API alongside a Django/Angular web interface. Built on PostgreSQL for document metadata with containerized Docker deployment, enabling self-hosted document workflows with full-text search, tagging, and automatic document classification.
About paperless-ng
jonaswinkler/paperless-ng
A supercharged version of paperless: scan, index and archive all your physical documents
Performs automatic OCR and full-text indexing on documents (PDF, images, Office formats via Apache Tika), with machine learning-powered auto-tagging of correspondents and document types. Provides a modern single-page web frontend with relevance-ranked full-text search, email ingestion with filtering rules, and parallel document processing optimized for multi-core systems. Stores documents plainly on disk with configurable naming schemes, integrates with network scanners via FTP or mobile apps, and ships as a Docker Compose deployment.
Scores updated daily from GitHub, PyPI, and npm data. How scores work