liu00222/Open-Prompt-Injection

This repository provides a benchmark for prompt injection attacks and defenses in LLMs

/ 100

Established

Provides modular factory methods for constructing attacks (combining multiple injection tasks), defenses (DataSentinel detection, PromptLocate localization), and LLM-integrated applications across multiple models (PaLM2, Llama, GPT). Introduces Attack Success Rate (ASV) as a quantitative evaluation metric and includes a detection-plus-localization pipeline that identifies contaminated prompts and recovers original data. Supports configuration-driven experimentation with pre-built task datasets (sentiment analysis, spam detection) and fine-tuned checkpoint integration via LoRA adapters.

406 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

406

Forks

Language

Python

License

MIT

Related tools

cybozu/prompt-hardener

Prompt Hardener analyzes prompt-injection-originated risk in LLM-based agents and applications.

R3dShad0w7/PromptMe

PromptMe is an educational project that showcases security vulnerabilities in large language...

lakeraai/pint-benchmark

A benchmark for prompt injection detection systems.

StavC/Here-Comes-the-AI-Worm

Here Comes the AI Worm: Preventing the Propagation of Adversarial Self-Replicating Prompts...

grepstrength/WideOpenAI

Short list of indirect prompt injection attacks for OpenAI-based models.

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights