VSteinborn/politeness-attacks
Code and data for the paper "Politeness Stereotypes and Attack Vectors: Gender Stereotypes in Japanese and Korean Language Models" (Arxiv 2023)
No commits in the last 6 months.
Stars
1
Forks
—
Language
Python
License
GPL-3.0
Category
Last pushed
Jun 19, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/VSteinborn/politeness-attacks"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dccuchile/wefe
WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes...
dreji18/Fairness-in-AI
Detecting Bias and ensuring Fairness in AI solutions
amazon-science/bold
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language...
dhfbk/variationist
Variationist: Exploring Multifaceted Variation and Bias in Written Language Data (ACL 2024 demo track)
microsoft/SafeNLP
Safety Score for Pre-Trained Language Models