r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Provides core DSP utilities for audio feature extraction, vocoding, and parameterization alongside PyTorch-based autograd modules for neural network training. Built on NumPy with optional PyTorch integration, it handles mel-spectrogram conversion, fundamental frequency estimation, and various vocoder implementations inspired by Merlin and Librosa.
399 stars and 3,871 monthly downloads. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Stars
399
Forks
71
Language
Python
License
—
Category
Last pushed
Jun 29, 2024
Monthly downloads
3,871
Commits (30d)
0
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/r9y9/nnmnkwii"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
Amey-Thakur/DEEPFAKE-AUDIO
🎙️ Deepfake Audio – A neural voice cloning studio powered by SV2TTS technology.