yeyupiaoling/YeAudio

Python的音频工具

/ 100

Established

Provides multi-format audio I/O (WAV, MP3, MP4 video tracks) with NumPy-based sample manipulation, supporting both batch and streaming operations via `slice_from_file()`. Includes data augmentation modules (SpecAugment, speed/volume perturbation, reverb, noise injection) and specialized processors like Voice Activity Detection (VAD) for speech-focused tasks including ASR, TTS, speaker verification, and audio classification pipelines.

Used by 2 other packages. Available on PyPI.

Maintenance 6 / 25

Adoption 14 / 25

Maturity 18 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Category

funasr-speech-recognition

Last pushed

Dec 05, 2025

Monthly downloads

534

Commits (30d)

Dependencies

Reverse dependents

GitHub PyPI

FunASR Speech Recognition · 46 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/yeyupiaoling/YeAudio"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Related tools

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

Picovoice/leopard

On-device speech-to-text engine powered by deep learning

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

Explore Voice AI Tools

All categories Trending Voice AI directory Insights