yeyupiaoling/YeAudio
Python的音频工具
Provides multi-format audio I/O (WAV, MP3, MP4 video tracks) with NumPy-based sample manipulation, supporting both batch and streaming operations via `slice_from_file()`. Includes data augmentation modules (SpecAugment, speed/volume perturbation, reverb, noise injection) and specialized processors like Voice Activity Detection (VAD) for speech-focused tasks including ASR, TTS, speaker verification, and audio classification pipelines.
Used by 2 other packages. Available on PyPI.
Stars
16
Forks
3
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 05, 2025
Monthly downloads
534
Commits (30d)
0
Dependencies
8
Reverse dependents
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/yeyupiaoling/YeAudio"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
Picovoice/leopard
On-device speech-to-text engine powered by deep learning
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端