wuyoscar/ISC-Bench

Internal Safety Collapse: Turning LLMs into a "Jailbroken State" Without "a Jailbreak Attack".

68
/ 100
Established

Provides both single-turn templates and autonomous agent-based execution modes for systematically triggering safety failures in frontier LLMs through incomplete professional workflows (e.g., biomedical analysis, chemical synthesis) rather than adversarial prompts. The benchmark uses domain-specific task templates across biology, chemistry, and epidemiology with layered evaluation scaffolds, paired with an ISC Arena leaderboard tracking vulnerability patterns across models via API-based reproducibility and community submissions through GitHub Issues.

677 stars. Actively maintained with 289 commits in the last 30 days.

No Package No Dependents
Maintenance 25 / 25
Adoption 10 / 25
Maturity 9 / 25
Community 24 / 25

How are scores calculated?

Stars

677

Forks

127

Language

Python

License

Last pushed

Mar 28, 2026

Commits (30d)

289

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/wuyoscar/ISC-Bench"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.