GeminiASR and gemini-speech2srt
These are **competitors**: both tools independently convert audio/video to SRT subtitles using the Google Gemini API, performing the same transcription-to-subtitle function without dependency on each other.
About GeminiASR
cxyfer/GeminiASR
A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.
Supports batch processing with configurable chunking (default 900-second segments) and multi-threaded parallel transcription, with optional API key rotation across multiple Google keys to bypass rate limits. Features a flexible four-tier configuration system (CLI args > environment variables > TOML files > defaults) and supports custom prompts to guide transcription quality. Compatible with OpenAI-compatible endpoints and proxy services like gemini-balance, enabling use of alternative model providers while maintaining the same SRT output interface.
About gemini-speech2srt
jianchang512/gemini-speech2srt
使用 Gemini AI 转写音视频为 SRT 字幕
Implements intelligent audio segmentation using VAD (Voice Activity Detection) to split media into chunks before sending each to Gemini AI, ensuring precise subtitle timing that avoids the axis drift occurring with full-file processing. Provides both a Windows GUI executable and cross-platform Python deployment, with configurable prompts and proxy support for regions where Gemini access is restricted.
Scores updated daily from GitHub, PyPI, and npm data. How scores work