Fardeen37/Data-Duplication-Remover-ML
A powerful machine learning based tool for detecting, analyzing, and removing duplicates in CSV datasets. Includes text similarity detection, numeric near-duplicate clustering, ML classification, visual analytics, and data cleaning. Features both Streamlit and Flask apps with ngrok support for easy deployment.
Stars
1
Forks
—
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Dec 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Fardeen37/Data-Duplication-Remover-ML"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Cloud-CV/EvalAI
:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI
fireindark707/Python-Schema-Matching
A python tool using XGboost and sentence-transformers to perform schema matching task on tables.
graphbookai/graphbook
Visual AI development framework for training and inference of ML models, scaling pipelines, and...
visual-layer/fastdup
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and...
josh-ashkinaze/plurals
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles