visual-layer/fastdup
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
Leverages an optimized C++ engine for duplicate detection, outlier identification, and label quality assessment across unlabeled or labeled datasets. Provides interactive web UI and static gallery visualizations, with native support for both image and video formats. Integrates seamlessly into Python workflows via pip and includes exportable results compatible with major ML frameworks.
1,834 stars.
Stars
1,834
Forks
87
Language
Python
License
—
Category
Last pushed
Feb 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/visual-layer/fastdup"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Cloud-CV/EvalAI
:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI
fireindark707/Python-Schema-Matching
A python tool using XGboost and sentence-transformers to perform schema matching task on tables.
graphbookai/graphbook
Visual AI development framework for training and inference of ML models, scaling pipelines, and...
Alir3z4/tb-query
A CLI tool and MCP (Model Context Protocol) server for querying and analyzing TensorBoard event...
RAILethicsHub/rail-score
Python SDK