HydroXai/pii-masker

PII Masker is an open-source tool for protecting sensitive data by automatically detecting and masking PII using advanced AI, powered by DeBERTa-v3. It provides high-precision detection, scalable performance, and a simple Python API for seamless integration into workflows, ensuring privacy compliance in various industries.

31
/ 100
Emerging

Supports both structured PII extraction and masked text output through a two-stage pipeline combining tokenization, DeBERTa-v3 inference, and entity recognition. Integrates with Milvus vector database for scalable storage and includes a Longformer variant offering 4096-token context (vs. 1024) with ~4% improved detection accuracy, plus planned OCR-based video frame analysis for multi-modal PII protection.

157 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 13 / 25

How are scores calculated?

Stars

157

Forks

15

Language

Jupyter Notebook

License

Last pushed

Dec 03, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/HydroXai/pii-masker"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.