kaya70875/ytfetcher
⚡ Build structured YouTube datasets at scale — effortlessly fetch transcripts and rich metadata for NLP, ML, and AI workflows.
Provides pluggable fetching strategies (channel, playlist, video IDs, search queries) with concurrent transcript retrieval, SQLite caching, and multi-language fallback support. Built-in filtering on metadata (duration, view count, title) reduces processing overhead before transcript extraction. Exports structured data to CSV, JSON, or TXT formats, with optional comment fetching and support for manually-created transcripts only via CLI and Python API.
Available on PyPI.
Stars
62
Forks
10
Language
Python
License
—
Category
Last pushed
Mar 08, 2026
Monthly downloads
524
Commits (30d)
0
Dependencies
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/kaya70875/ytfetcher"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
NVIDIA-AI-Blueprints/video-search-and-summarization
Blueprint for Ingesting massive volumes of live or archived videos and extract insights for...
HKUDS/VideoRAG
[KDD'2026] "VideoRAG: Chat with Your Videos"
jonaskahn/asktube
AskTube - An AI-powered YouTube video summarizer and QA assistant powered by Retrieval Augmented...
wassim249/YT-Navigator
YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel...
video-db/StreamRAG
Video Search and Streaming Agent 🕵️♂️