pyvideotrans and Auto-Synced-Translated-Dubs
These two tools are competitors, as both aim to provide an end-to-end solution for translating video language and generating synchronized dubbed audio and subtitles using AI voice services.
About pyvideotrans
jianchang512/pyvideotrans
Translate the video from one language to another and embed dubbing & subtitles.
Combines speech recognition, LLM-based translation, and text-to-speech synthesis in a unified pipeline with support for speaker diarization and multi-role dubbing. Integrates pluggable ASR models (Faster-Whisper, Qwen, WhisperX), translation backends (DeepSeek, ChatGPT, Ollama), and TTS engines (Edge-TTS, F5-TTS, CosyVoice), enabling both cloud API and fully local offline workflows. Provides interactive editing checkpoints throughout the translation chain plus a CLI interface for headless batch processing and server deployment.
About Auto-Synced-Translated-Dubs
ThioJoe/Auto-Synced-Translated-Dubs
Automatically translates the text of a video based on a subtitle file, and then uses AI voice services to create a new dubbed & translated audio track where the speech is synced using the subtitle's timings.
Supports multiple translation engines (Google Translate, DeepL) and TTS providers (Google Cloud, Azure, Eleven Labs) with configurable voice selection per language. Uses subtitle timings to calculate required speech duration, then either time-stretches synthesized audio or performs two-pass synthesis to match original pacing—Azure TTS can specify duration natively, eliminating stretching. Includes companion tools for batch YouTube operations: adding audio tracks to video files, translating titles/descriptions, and syncing auto-generated transcripts via the YouTube API.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work