antor44/livestream_video
playlist4whisper manages media streams playlists for livestream_video.sh, plays media, and transcribes audio via AI with configurable timeshift, multi-instance/user support, translation, and TTS. Compatible with Linux, Windows (WSL2), macOS.
Leverages whisper.cpp with integrated Voice Activity Detection (Silero model) for efficient local transcription, while supporting real-time translation via Google Gemini API with configurable context levels (0-3) to balance literal accuracy against context-aware fluency. Provides a Python GUI wrapper around bash scripts that manage media playback through VLC/streamlink/yt-dlp, enabling subtitle generation, timeshift recording, and session transcript export across Linux, macOS, and Windows (WSL2).
Stars
16
Forks
5
Language
Python
License
GPL-3.0
Category
Last pushed
Mar 20, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/antor44/livestream_video"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a given YouTube video....
MatteoFasulo/Whisper-TikTok
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model
pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper
chenyme/Chenyme-AAVT
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。