pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
Leverages OpenAI's Whisper model within a multi-stage NLP pipeline that chunks video into manageable audio segments, transcribes via transformer inference, then applies spell-correction (Neuspell), sentence boundary detection (pySBD), and keyword extraction (YAKE) to produce publication-ready text. Integrates with Hugging Face's transformers library and supports both CLI and Python API interfaces, with optional Google Colab deployment for GPU-accelerated processing without local hardware requirements.
220 stars and 19 monthly downloads. No commits in the last 6 months. Available on PyPI.
Stars
220
Forks
29
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Oct 29, 2024
Monthly downloads
19
Commits (30d)
0
Dependencies
25
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/pszemraj/vid2cleantxt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a given YouTube video....
MatteoFasulo/Whisper-TikTok
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model
ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper
chenyme/Chenyme-AAVT
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
JJWRoeloffs/transcribe_align_textgrid
A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids...