voxel51/reconstruction-error-ratios

Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!

15
/ 100
Experimental

This tool helps machine learning engineers and data scientists quickly assess the quality and difficulty of their image classification datasets. By analyzing images and their associated labels, it provides scores that reveal how challenging a dataset is and points out potential labeling errors. You input your image classification dataset, and it outputs overall dataset difficulty, class-level difficulty, and a list of potentially mislabeled images.

No commits in the last 6 months.

Use this if you need to understand the inherent difficulty of your image classification task or want to efficiently find and fix mistakes in your dataset labels.

Not ideal if your dataset does not consist of images with classification labels or if you are not working with computer vision models.

image-classification dataset-quality label-error-detection computer-vision data-curation
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 0 / 25

How are scores calculated?

Stars

28

Forks

Language

Python

License

Last pushed

Jan 10, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/voxel51/reconstruction-error-ratios"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.