Pinak-Datta/wiz-craft
A CLI-based dataset preprocessing tool for machine learning tasks. Features include data exploration, null value handling, one-hot encoding, and feature scaling, and download the modified dataset effortlessly.
This tool helps data scientists prepare raw datasets for machine learning models. You input a dataset with various data types and it outputs a cleaned, transformed dataset, ready for model training. It's designed for data scientists who need to quickly handle common data preparation tasks.
No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly inspect, clean, and transform your datasets using a command-line interface before training machine learning models.
Not ideal if you prefer a graphical user interface or need highly customized, advanced data manipulation techniques beyond standard preprocessing.
Stars
16
Forks
4
Language
Python
License
MIT
Category
Last pushed
Oct 18, 2023
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/Pinak-Datta/wiz-craft"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alan-turing-institute/CleverCSV
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement...
kalininalab/DataSAIL
DataSAIL is a tool to split datasets while reducing information leakage.
Vetdatahub/VetDataHub
VetDataHub is an opensource veterinary datasets repository dedicated to advancing veterinary...
lennox55555/Savvy-CSV
Savvy CSV is an web application designed to effortlessly create the ideal CSV file. By...