nezhar/speech-condenser

A tool for summarizing dialogues from videos or audio

/ 100

Emerging

Implements a containerized multi-stage pipeline combining audio extraction, speaker diarization (via PyAnnote), speech-to-text transcription, and abstractive summarization using Hugging Face models. Supports both local video files and YouTube URLs, with each processing stage isolated in Docker/Podman containers for reproducibility. Outputs speaker-attributed dialogue summaries organized by speaker turns.

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

byjlw/video-analyzer

Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition

XnneHangLab/XnneHangLab

不会聊天的字幕提取器不是一个好 B 站下载器~

harry0703/AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

bakaburg1/minutemaker

Generate meeting minutes starting from an audio recording or a transcripts using speech-to-text and LLMs.

yufan-aslp/AliMeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights