llamator and redteam-ai-benchmark

These are complementary tools: LLAMATOR-Core provides a framework for executing red team tests against chatbots and GenAI systems, while redteam-ai-benchmark supplies a structured evaluation methodology and benchmark dataset for assessing LLM vulnerabilities in offensive security contexts.

llamator
65
Established
redteam-ai-benchmark
32
Emerging
Maintenance 10/25
Adoption 16/25
Maturity 25/25
Community 14/25
Maintenance 6/25
Adoption 7/25
Maturity 9/25
Community 10/25
Stars: 201
Forks: 20
Downloads: 275
Commits (30d): 0
Language: Python
License:
Stars: 27
Forks: 3
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No risk flags
No Package No Dependents

About llamator

LLAMATOR-Core/llamator

Red Teaming python-framework for testing chatbots and GenAI systems.

Provides modular attack vectors targeting prompt injection, jailbreaks, system prompt leakage, and resource exhaustion across LLMs, RAGs, and vision models. Supports multiple client integrations including LangChain, OpenAI-compatible APIs, and web interfaces (Selenium, Telethon), with extensible custom attack definitions. Generates detailed audit trails in Excel/CSV formats and DOCX test reports mapped to OWASP LLM vulnerability classifications.

About redteam-ai-benchmark

toxy4ny/redteam-ai-benchmark

Red Team AI Benchmark: Evaluating Uncensored LLMs for Offensive Security

Scores updated daily from GitHub, PyPI, and npm data. How scores work