apple/ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

/ 100

Emerging

Synthesizes 650+ hours of first-order ambisonics audio by augmenting LibriSpeech recordings with 200k+ simulated acoustic conditions across 8k+ synthetic rooms. Includes rich spatial labels for source position, speaking direction, room acoustics, and geometry, plus optional distractor noise tracks. Provides PyTorch dataloader and Parquet-based metadata schema for straightforward integration into audio ML pipelines.

125 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

125

Forks

Language

—

License

—

Related frameworks

Ijwi-ry-Ikirundi-AI/Kirundi_Dataset

🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi language. Building AI...

hstsethi/in-mob-prefix

Dataset, charts, models of 4 digit mobile number prefixes in India by state, operator name.

Jahangirbd23/WenetSpeech-Yue

📑 Explore WenetSpeech-Yue, a comprehensive Cantonese speech corpus with rich annotations,...

Nexdata-AI/359-Hours-Indonesian-Speech-Data-by-Mobile-Phone_Reading

Indonesian Speech Dataset

Nexdata-AI/207-Hours-Japanese-Speaking-English-Speech-Data-by-Mobile-Phone

Japanese Speaking English Speech Dataset

Explore ML Frameworks

All categories Trending ML Framework directory Insights