Welding-Torch/Excel-Anonymizer
A Python script that anonymizes an Excel file and synthesizes new data in its place.
Leverages Microsoft Presidio for PII detection (names, phone numbers, emails, locations, dates, credit card numbers) and Faker for synthetic data generation, operating as a two-stage pipeline: identification followed by deterministic replacement. Processes Excel files directly via CLI with automatic spaCy NLP model installation, producing irreversibly anonymized outputs unsuitable for decryption or reverse engineering.
No commits in the last 6 months.
Stars
84
Forks
5
Language
Python
License
MIT
Category
Last pushed
Sep 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Welding-Torch/Excel-Anonymizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DataFog/datafog-python
Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines...
vmenger/deduce
Deduce: de-identification method for Dutch medical text
aphp/eds-pseudo
EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports
seanpedrick-case/doc_redaction
Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface....
thoughtbot/top_secret
Filter sensitive information from free text before sending it to external services or APIs, such...