jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!
Supports multi-language transcript retrieval with priority-based fallback, subtitle translation, and metadata filtering to distinguish between manually-created and auto-generated captions. Extracts transcripts by directly parsing YouTube's client-side data structures rather than relying on browser automation, returning structured objects with timing information and optional HTML formatting preservation. Provides both programmatic Python API and CLI interfaces for integration into data pipelines or content analysis workflows.
7,078 stars and 27,416,843 monthly downloads. Used by 25 other packages. Actively maintained with 3 commits in the last 30 days. Available on PyPI.
Stars
7,078
Forks
734
Language
Python
License
MIT
Category
Last pushed
Mar 09, 2026
Monthly downloads
27,416,843
Commits (30d)
3
Dependencies
2
Reverse dependents
25
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jdepoix/youtube-transcript-api"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
MatteoFasulo/Whisper-TikTok
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model
pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
chenyme/Chenyme-AAVT
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
JJWRoeloffs/transcribe_align_textgrid
A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids...
antor44/livestream_video
playlist4whisper manages media streams playlists for livestream_video.sh, plays media, and...