presidio and Redactify

Redactify builds upon Presidio as a downstream wrapper, combining it with additional NLP techniques (transformer-based NER and regex) to create a higher-level redaction interface, making them complements rather than competitors.

presidio
80
Verified
Redactify
27
Experimental
Maintenance 23/25
Adoption 10/25
Maturity 25/25
Community 22/25
Maintenance 6/25
Adoption 4/25
Maturity 9/25
Community 8/25
Stars: 7,198
Forks: 960
Downloads:
Commits (30d): 22
Language: Python
License: MIT
Stars: 8
Forks: 1
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
No risk flags
No Package No Dependents

About presidio

microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Built on modular components (Analyzer for detection, Anonymizer for transformation, Image-Redactor for visual PII), Presidio combines NER, regex patterns, and checksum validation with context-aware logic across multiple languages. Deployable as Python/PySpark libraries, Docker containers, or Kubernetes clusters, it supports external model integration and handles specialized formats like DICOM medical images alongside standard text and structured data.

About Redactify

rushilpatel21/Redactify

Redactify is an efficient data redaction tool that secures sensitive text using advanced NLP and rule-based methods. It combines transformer-based NER, regex, and Presidio analysis to detect and mask personal information through full redaction or partial masking—ensuring compliance while preserving data utility.

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work