ynop/audiomate

Python library for handling audio datasets.

/ 100

Established

Provides unified access to 20+ audio datasets (ESC-50, LibriSpeech, Mozilla Common Voice, etc.) through a generic Corpus API, with built-in downloaders and format readers for Kaldi, DeepSpeech, and Wav2Letter. Includes dataset manipulation tools (validation, splitting, filtering, merging) and feature extraction pipelines for preparing audio data for machine learning workflows.

138 stars and 252 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 16 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

138

Forks

Language

Python

License

MIT

Category

speech-corpora-datasets

Last pushed

Jul 06, 2023

Monthly downloads

252

Commits (30d)

Dependencies

GitHub PyPI

Speech Corpora Datasets · 63 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ynop/audiomate"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Related tools

davidmartinrius/speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...

common-voice/cv-dataset

Metadata and versioning details for the Common Voice dataset

reazon-research/ReazonSpeech

Massive open Japanese speech corpus

EgorLakomkin/KTSpeechCrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Explore Voice AI Tools

All categories Trending Voice AI directory Insights