r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

/ 100

Established

Provides core DSP utilities for audio feature extraction, vocoding, and parameterization alongside PyTorch-based autograd modules for neural network training. Built on NumPy with optional PyTorch integration, it handles mel-spectrogram conversion, fundamental frequency estimation, and various vocoder implementations inspired by Merlin and Librosa.

399 stars and 3,871 monthly downloads. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 19 / 25

Maturity 25 / 25

Community 22 / 25

How are scores calculated?

Stars

399

Forks

Language

Python

License

—

Related tools

pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to automatically...

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Amey-Thakur/DEEPFAKE-AUDIO

🎙️ Deepfake Audio – A neural voice cloning studio powered by SV2TTS technology.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights