tsurumeso/vocal-remover
Vocal Remover using Deep Neural Networks
Implements a U-Net convolutional architecture trained on paired instrumental/vocal audio for source separation, with support for CPU and GPU inference. Includes Test-Time Augmentation (TTA) for improved separation quality and optional post-processing that masks instrumental output based on vocal energy detection. Built on PyTorch with a trainable pipeline allowing users to fine-tune models on custom datasets.
1,744 stars. No commits in the last 6 months.
Stars
1,744
Forks
254
Language
Python
License
MIT
Category
Last pushed
Jul 23, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tsurumeso/vocal-remover"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
lucidrains/HS-TasNet
Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation using Hybrid...
justinsalamon/scaper
A library for soundscape synthesis and augmentation
stemrollerapp/stemroller
Isolate vocals, drums, bass, and other instrumental stems from any song
thepirat000/spleeter-api
Audio separation API using Spleeter from Deezer
gitmylo/audio-webui
A webui for different audio related Neural Networks