nezhar/speech-condenser
A tool for summarizing dialogues from videos or audio
Implements a containerized multi-stage pipeline combining audio extraction, speaker diarization (via PyAnnote), speech-to-text transcription, and abstractive summarization using Hugging Face models. Supports both local video files and YouTube URLs, with each processing stage isolated in Docker/Podman containers for reproducibility. Outputs speaker-attributed dialogue summaries organized by speaker turns.
No commits in the last 6 months.
Stars
84
Forks
10
Language
Python
License
—
Category
Last pushed
Aug 29, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/nezhar/speech-condenser"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
byjlw/video-analyzer
Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition
XnneHangLab/XnneHangLab
不会聊天的字幕提取器不是一个好 B 站下载器~
harry0703/AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记
bakaburg1/minutemaker
Generate meeting minutes starting from an audio recording or a transcripts using speech-to-text and LLMs.
yufan-aslp/AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party...