brian-lou/Training-Data-Extraction-Attack-on-LLMs

This project explores training data extraction attacks on the LLaMa 7B, GPT-2XL, and GPT-2-IMDB models to discover memorized content using perplexity, perturbation scoring metrics, and large scale search queries.

/ 100

Emerging

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 9 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Category

membership-inference-attacks

Last pushed

Jun 15, 2023

Commits (30d)

GitHub

Membership Inference Attacks · 45 frameworks

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/brian-lou/Training-Data-Extraction-Attack-on-LLMs"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

google/scaaml

SCAAML: Side Channel Attacks Assisted with Machine Learning

Koukyosyumei/AIJack

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

pralab/secml

A Python library for Secure and Explainable Machine Learning

AI-SDC/SACRO-ML

Collection of tools and resources for managing the statistical disclosure control of trained...

oss-slu/mithridatium

Mithridatium is a research-driven project aimed at detecting backdoors and data poisoning in...

Explore ML Frameworks

All categories Trending ML Framework directory Insights