cxyfer/GeminiASR

A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.

36
/ 100
Emerging

Supports batch processing with configurable chunking (default 900-second segments) and multi-threaded parallel transcription, with optional API key rotation across multiple Google keys to bypass rate limits. Features a flexible four-tier configuration system (CLI args > environment variables > TOML files > defaults) and supports custom prompts to guide transcription quality. Compatible with OpenAI-compatible endpoints and proxy services like gemini-balance, enabling use of alternative model providers while maintaining the same SRT output interface.

No Package No Dependents
Maintenance 6 / 25
Adoption 6 / 25
Maturity 9 / 25
Community 15 / 25

How are scores calculated?

Stars

17

Forks

5

Language

Python

License

MIT

Last pushed

Jan 02, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/cxyfer/GeminiASR"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.