wuyoscar/ISC-Bench

Internal Safety Collapse: Turning LLMs into a "Jailbroken State" Without "a Jailbreak Attack".

/ 100

Established

Provides both single-turn templates and autonomous agent-based execution modes for systematically triggering safety failures in frontier LLMs through incomplete professional workflows (e.g., biomedical analysis, chemical synthesis) rather than adversarial prompts. The benchmark uses domain-specific task templates across biology, chemistry, and epidemiology with layered evaluation scaffolds, paired with an ISC Arena leaderboard tracking vulnerability patterns across models via API-based reproducibility and community submissions through GitHub Issues.

677 stars. Actively maintained with 289 commits in the last 30 days.

No Package No Dependents

Maintenance 25 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 24 / 25

How are scores calculated?

Stars

677

Forks

127

Language

Python

License

—

Related tools

yueliu1999/Awesome-Jailbreak-on-LLMs

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods...

yiksiu-chan/SpeakEasy

[ICML 2025] Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

xirui-li/DrAttack

Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes...

tmlr-group/DeepInception

[arXiv:2311.03191] "DeepInception: Hypnotize Large Language Model to Be Jailbreaker"

Techiral/awesome-llm-jailbreaks

Latest AI Jailbreak Payloads & Exploit Techniques for GPT, QWEN, and all LLM Models

Explore LLM Tools

All categories Trending LLM Tool directory Insights