zilliz-bootcamp/audio_search
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
Leverages pretrained PANNs models for multi-label audio classification and event detection, converting audio files into fixed-size embeddings that capture acoustic patterns. Integrates MySQL for metadata storage alongside Milvus's vector database to enable efficient similarity search across large audio collections via REST APIs. Provides a complete web-based pipeline supporting batch audio ingestion and real-time query matching for audio retrieval tasks.
No commits in the last 6 months.
Stars
28
Forks
7
Language
Python
License
MIT
Category
Last pushed
Aug 10, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/zilliz-bootcamp/audio_search"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ssrajadh/sentrysearch
Semantic search over videos using Gemini Embedding 2.
hayabhay/frogbase
Transform audio-visual content into navigable knowledge.
kyegomez/Pegasus
PegasusX: The Future of Multimodal Embeddings 🦄 🦄
ashvardanian/SwiftSemanticSearch
Real-time on-device text-to-image and image-to-image Semantic Search with video stream camera...
tomfalainen/word_spotting
Semantic and Verbatim Word Spotting in Torch