carlosholivan/audiolm-google-torch
Implementation of the AudioLM model by Google in Pytorch
AudioLM helps researchers and sound designers generate novel audio, like speech or music, directly from existing audio inputs. It takes a piece of sound as input and creates new, high-quality audio that matches the style or characteristics of the original. This is useful for anyone working with audio synthesis or creative sound generation.
No commits in the last 6 months.
Use this if you need to generate realistic and contextually relevant audio waveforms from a given audio sample, for tasks such as speech synthesis or sound design.
Not ideal if you're looking to simply compress audio or analyze existing audio for features without generating new content.
Stars
8
Forks
1
Language
—
License
—
Category
Last pushed
Dec 18, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/carlosholivan/audiolm-google-torch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz...
drethage/speech-denoising-wavenet
A neural network for end-to-end speech denoising
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
iver56/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
lmnt-com/wavegrad
A fast, high-quality neural vocoder.