YaoFANGUK/video-subtitle-extractor

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

/ 100

Emerging

Combines PaddlePaddle-based OCR with keyframe extraction and text region detection to process videos in three modes—fast (lightweight model), automatic (adaptive), and precise (frame-by-frame)—while supporting 87 languages and batch processing without external APIs. Integrates VideoSubFinder as the subtitle detection engine on Windows/Linux, supports GPU acceleration via CUDA/DirectML/ONNX, and includes post-processing features like watermark removal and typo correction via JSON configuration mapping.

8,505 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

8,505

Forks

871

Language

Python

License

Apache-2.0

Higher-rated alternatives

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

oseiskar/autosubsync

Automatically synchronize subtitles with audio using machine learning

FL33TW00D/whisper-turbo

Cross-Platform, GPU Accelerated Whisper 🏎️

machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

Explore Voice AI Tools

All categories Trending Voice AI directory Insights