cxyfer/GeminiASR

A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.

/ 100

Emerging

Supports batch processing with configurable chunking (default 900-second segments) and multi-threaded parallel transcription, with optional API key rotation across multiple Google keys to bypass rate limits. Features a flexible four-tier configuration system (CLI args > environment variables > TOML files > defaults) and supports custom prompts to guide transcription quality. Compatible with OpenAI-compatible endpoints and proxy services like gemini-balance, enabling use of alternative model providers while maintaining the same SRT output interface.

No Package No Dependents

Maintenance 6 / 25

Adoption 6 / 25

Maturity 9 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Compare

GeminiASR and gemini-speech2srt

Higher-rated alternatives

mozilla-ai/document-to-podcast

Blueprint by Mozilla.ai for generating podcasts from documents using local AI

iMicknl/azure-podcast-generator

Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.

BandarLabs/gitpodcast

Convert any git repository into an engaging podcast

puntorigen/podcast_tts

A class for generating realistic audio (TTS) for podcasts and dialogues.

ismailperim/reportcast

Transform reports into podcasts with AI - Nobody reads your reports. But they'll listen.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights