xndien2004/LLM_Powered_Video_Search
[SOICT 2024] LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System
Combines CLIP vision embeddings with FAISS indexing and TF-IDF for hybrid multimodal search across text (ASR/OCR/captions), images, and metadata attributes. LLM integration processes natural language queries to contextually route requests across these retrieval modalities. Built on Django with TransnetV2 for automated keyframe extraction and supports Docker deployment for the complete pipeline.
No commits in the last 6 months.
Stars
65
Forks
1
Language
Jupyter Notebook
License
—
Category
Last pushed
Aug 16, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/xndien2004/LLM_Powered_Video_Search"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
llmware-ai/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
Sinapsis-AI/sinapsis-chatbots
Monorepo for sinapsis templates supporting LLM based Agents
aimclub/ProtoLLM
Framework for prototyping of LLM-based applications
Azure-Samples/azureai-foundry-finetuning-raft
A recipe that will walk you through using either Meta Llama 3.1 405B or OpenAI GPT-4o deployed...
pkargupta/taxoadapt
Dynamically constructs and adapts an LLM-generated taxonomy to a given corpus across multiple dimensions.