gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

58
/ 100
Established

Built on deep learning with modular architecture (CNN/feedforward → self-attention/LSTM → pooling layers), NISQA provides both single-ended quality assessment for transmitted speech and TTS naturalness prediction, with v2.0 adding multidimensional breakdowns (noisiness, coloration, discontinuity, loudness). Supports end-to-end training, finetuning, and transfer learning via PyTorch, configured through YAML files, plus includes a 14,000+ sample corpus with subjective quality labels for model development.

917 stars. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 23 / 25

How are scores calculated?

Stars

917

Forks

150

Language

Python

License

MIT

Last pushed

Dec 01, 2024

Commits (30d)

0

Dependencies

13

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gabrielmittag/NISQA"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.