chenyme/Chenyme-AAVT

这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。

47
/ 100
Emerging

Supports local model deployment with multiple LLM backends (ChatGPT, Claude, Gemini, DeepSeek) for translation, includes VAD voice activity detection and GPU acceleration via CUDA, and extends beyond subtitling to auto-generate marketing content and provide subtitle-only translation workflows. Built with Streamlit WebUI and available via Docker/Colab for cloud deployment, supporting custom fine-tuned Whisper models alongside word-level segmentation optimization.

2,973 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

2,973

Forks

239

Language

Python

License

MIT

Last pushed

Apr 07, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/chenyme/Chenyme-AAVT"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.