thu-coai/Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。

/ 100

Emerging

Contains 100k Chinese safety prompts across 7 typical scenarios (insult, discrimination, crimes, physical harm, mental health, privacy, ethics) and 6 instruction attack types, paired with ChatGPT responses for training safer models. Data is accessible via Hugging Face Datasets and organized in JSON format, designed primarily for fine-tuning rather than evaluation—the project recommends SafetyBench for benchmarking. Complements the broader Safety-Prompts ecosystem including ShieldLM (a customizable safety detector) and provides integration with Tsinghua's Chinese LLM safety evaluation platform.

1,135 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

1,135

Forks

Language

—

License

Apache-2.0

Related tools

x-hannibal/open-webui-easymage

Multi-engine image generation filter for Open WebUI. Features automated prompt enhancement,...

ahmadbuilds/multi-agent-hr-assistant

An autonomous, multi-agent HR service desk. It uses a supervisor architecture to route employee...

Shaw1011/prompt-lint

A linter for LLM system prompts - detects contradictions, injection risks, security...

Dewensong/email-marketing-skill

Reusable email marketing skill with local-first setup, knowledge-driven drafting, SMTP/IMAP...

wisterx-spec/agent-rails

Description：Opinionated workflow framework for AI-assisted development — rules, skills &...

Explore Prompt Engineering Tools

All categories Trending Prompt Engineering directory Insights