jianchang512/gemini-speech2srt

使用 Gemini AI 转写音视频为 SRT 字幕

/ 100

Emerging

Implements intelligent audio segmentation using VAD (Voice Activity Detection) to split media into chunks before sending each to Gemini AI, ensuring precise subtitle timing that avoids the axis drift occurring with full-file processing. Provides both a Windows GUI executable and cross-platform Python deployment, with configurable prompts and proxy support for regions where Gemini access is restricted.

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Compare

gemini-speech2srt and GeminiASR

Higher-rated alternatives

mozilla-ai/document-to-podcast

Blueprint by Mozilla.ai for generating podcasts from documents using local AI

iMicknl/azure-podcast-generator

Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.

BandarLabs/gitpodcast

Convert any git repository into an engaging podcast

puntorigen/podcast_tts

A class for generating realistic audio (TTS) for podcasts and dialogues.

ismailperim/reportcast

Transform reports into podcasts with AI - Nobody reads your reports. But they'll listen.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights