HydroXai/pii-masker
PII Masker is an open-source tool for protecting sensitive data by automatically detecting and masking PII using advanced AI, powered by DeBERTa-v3. It provides high-precision detection, scalable performance, and a simple Python API for seamless integration into workflows, ensuring privacy compliance in various industries.
Supports both structured PII extraction and masked text output through a two-stage pipeline combining tokenization, DeBERTa-v3 inference, and entity recognition. Integrates with Milvus vector database for scalable storage and includes a Longformer variant offering 4096-token context (vs. 1024) with ~4% improved detection accuracy, plus planned OCR-based video frame analysis for multi-modal PII protection.
157 stars. No commits in the last 6 months.
Stars
157
Forks
15
Language
Jupyter Notebook
License
—
Category
Last pushed
Dec 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/HydroXai/pii-masker"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
LLAMATOR-Core/llamator
Red Teaming python-framework for testing chatbots and GenAI systems.
sleeepeer/PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented...
JuliusHenke/autopentest
CLI enabling more autonomous black-box penetration tests using Large Language Models (LLMs)
kelkalot/simpleaudit
Allows to red-team your AI systems through adversarial probing. It is simple, effective, and...
SecurityClaw/SecurityClaw
A modular, skill-based autonomous Security Operations Center (SOC) agent that monitors...