hayabhay/frogbase

Transform audio-visual content into navigable knowledge.

Archived

/ 100

Emerging

Orchestrates a complete pipeline linking media sources (yt_dlp, local files) through OpenAI Whisper transcription, SentenceTransformers embeddings, and hnswlib vector search—enabling semantic queries across multi-modal content. Includes both a Python API and Streamlit UI, allowing developers to build search applications or non-technical users to run locally without coding.

778 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

778

Forks

Language

Python

License

MIT

Higher-rated alternatives

ssrajadh/sentrysearch

Semantic search over videos using Gemini Embedding 2.

zilliz-bootcamp/audio_search

This project use PANNs for audio tagging and sound event detection, and finally get audio...

kyegomez/Pegasus

PegasusX: The Future of Multimodal Embeddings 🦄 🦄

ashvardanian/SwiftSemanticSearch

Real-time on-device text-to-image and image-to-image Semantic Search with video stream camera...

tomfalainen/word_spotting

Semantic and Verbatim Word Spotting in Torch

Explore Embedding Tools

All categories Trending Embeddings directory Insights