visual-layer/fastdup

fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.

44
/ 100
Emerging

Leverages an optimized C++ engine for duplicate detection, outlier identification, and label quality assessment across unlabeled or labeled datasets. Provides interactive web UI and static gallery visualizations, with native support for both image and video formats. Integrates seamlessly into Python workflows via pip and includes exportable results compatible with major ML frameworks.

1,834 stars.

No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 9 / 25
Community 15 / 25

How are scores calculated?

Stars

1,834

Forks

87

Language

Python

License

Last pushed

Feb 18, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/visual-layer/fastdup"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.