redhuntlabs/Octopii
An AI-powered Personal Identifiable Information (PII) scanner.
Combines OCR (Tesseract), regex pattern matching, and NLP (spaCy/NLTK) to extract and classify PII types from images, PDFs, and documents with face detection via Haar cascades. Supports scanning from local filesystems, S3 buckets, and Apache open directory listings, outputting structured JSON with detected identifiers, contact information, and geolocation data.
725 stars. No commits in the last 6 months.
Stars
725
Forks
63
Language
Python
License
—
Category
Last pushed
Jan 22, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/redhuntlabs/Octopii"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nmap/nmap
Nmap - the Network Mapper. Github mirror of official SVN repository.
e-m-b-a/emba
EMBA - The firmware security analyzer
ait-testbed/attackbed
The AttackBed is a simulated enterprise network with numerous vulnerabilities. Attacks in this...
ritesh-gupta-git/AI-Powered-Vulnerability-Management
AI-VMF: AI-Powered Vulnerability Management Framework demo (ensemble exploit prediction + risk scoring)
scorpiondefense/cyberweapons
Automated Cyber Offense