apple/ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

36
/ 100
Emerging

Synthesizes 650+ hours of first-order ambisonics audio by augmenting LibriSpeech recordings with 200k+ simulated acoustic conditions across 8k+ synthetic rooms. Includes rich spatial labels for source position, speaking direction, room acoustics, and geometry, plus optional distractor noise tracks. Provides PyTorch dataloader and Parquet-based metadata schema for straightforward integration into audio ML pipelines.

125 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 10 / 25

How are scores calculated?

Stars

125

Forks

8

Language

License

Last pushed

Oct 25, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/apple/ml-spatial-librispeech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.