reazon-research/ReazonSpeech
Massive open Japanese speech corpus
Provides multiple pre-trained ASR models (Kaldi2/sherpa-onnx, FastConformer-RNNT/NeMo, Conformer-Transducer/ESPnet) ranging from 120M–619M parameters, plus audio-visual speech recognition following Hugging Face Transformers conventions. Includes bilingual ja-en language detection, evaluation tools, and utilities for harvesting Japanese TV streams to expand the corpus.
373 stars.
Stars
373
Forks
34
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 19, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/reazon-research/ReazonSpeech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
ynop/audiomate
Python library for handling audio datasets.
davidmartinrius/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
EgorLakomkin/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies