yamanalab/ec-darkpattern

[IEEE BigData 2022] Dark patterns in e-commerce: a dataset and its baseline evaluations

41
/ 100
Emerging

Provides a TSV dataset of 1,818 dark pattern texts paired with non-dark pattern samples scraped from e-commerce sites using Puppeteer. Implements dual baseline approaches: classical bag-of-words models (logistic regression, SVM, gradient boosting) and transformer-based architectures (BERT, RoBERTa, XLNet) for binary classification, with RoBERTa_large achieving 97.5% accuracy. Includes complete experimental pipelines and web scraping utilities for dataset collection and reproducibility.

No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

40

Forks

14

Language

Python

License

Apache-2.0

Last pushed

Mar 16, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/yamanalab/ec-darkpattern"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.