Zhennor/Multimodal-Video-Retrieval-Engine-with-Vision-and-Text
A video search engine combining OCR, ASR, CLIP, Image Captioning, Object & Color Detection. It enables accurate retrieval based on text, speech, images, objects, and colors in video content.
26
/ 100
Experimental
No commits in the last 6 months.
No License
Stale 6m
No Package
No Dependents
Maintenance
0 / 25
Adoption
3 / 25
Maturity
8 / 25
Community
15 / 25
Stars
4
Forks
4
Language
—
License
—
Category
Last pushed
Jan 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/Zhennor/Multimodal-Video-Retrieval-Engine-with-Vision-and-Text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.