presidio and Redactify
Redactify builds upon Presidio as a downstream wrapper, combining it with additional NLP techniques (transformer-based NER and regex) to create a higher-level redaction interface, making them complements rather than competitors.
About presidio
microsoft/presidio
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
Built on modular components (Analyzer for detection, Anonymizer for transformation, Image-Redactor for visual PII), Presidio combines NER, regex patterns, and checksum validation with context-aware logic across multiple languages. Deployable as Python/PySpark libraries, Docker containers, or Kubernetes clusters, it supports external model integration and handles specialized formats like DICOM medical images alongside standard text and structured data.
About Redactify
rushilpatel21/Redactify
Redactify is an efficient data redaction tool that secures sensitive text using advanced NLP and rule-based methods. It combines transformer-based NER, regex, and Presidio analysis to detect and mask personal information through full redaction or partial masking—ensuring compliance while preserving data utility.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work