dcai-course/dcai-lab

Lab assignments for Introduction to Data-Centric AI, MIT IAP 2024 👩🏽‍💻

44
/ 100
Emerging

Covers nine progressive labs spanning label error detection via Confident Learning, multi-annotator dataset curation, outlier identification, active learning, feature interpretability, and membership inference attacks. Jupyter notebooks combine hands-on implementation of data-centric techniques (data quality improvement, augmentation, prompt engineering) with black-box model evaluation, emphasizing how dataset engineering outperforms model-centric optimization. Integrates with standard ML libraries and LLMs, with Labs 8+ supporting Colab execution for zero-setup experimentation.

479 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 9 / 25
Community 25 / 25

How are scores calculated?

Stars

479

Forks

162

Language

Jupyter Notebook

License

AGPL-3.0

Last pushed

Feb 24, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/dcai-course/dcai-lab"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.