tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Provides standardized dataset loading with automatic downloading, caching, and conversion to `tf.data.Dataset` pipelines optimized for training performance. Includes 100+ curated public datasets with deterministic splitting and reproducible ordering across users. Supports custom dataset contributions through a declarative API, enabling community-driven dataset expansion beyond the built-in catalog.
4,550 stars. Actively maintained with 1 commit in the last 30 days.
Stars
4,550
Forks
1,597
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 12, 2026
Commits (30d)
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/tensorflow/datasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
open-edge-platform/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage...
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with...
explosion/ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
alan-turing-institute/CleverCSV
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement...
JovianHQ/opendatasets
A Python library for downloading datasets from Kaggle, Google Drive, and other online sources.