Akhilesh-00/Multimodal-Audio-RAG-CrossLingual-QA
An AI that listens to a Tamil song and lets you ask questions about its lyrics in English. AudioRAG uses a cross-lingual Retrieval-Augmented Generation (RAG) pipeline, powered by Whisper and Gemini, to provide accurate answers directly from the audio content.
Stars
2
Forks
—
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jan 04, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Akhilesh-00/Multimodal-Audio-RAG-CrossLingual-QA"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
talkdai/dialog
RAG LLM Ops App for easy deployment and testing
michelderu/build-your-own-rag-chatbot
Workshop to build and deploy your own Chat Agent using Retrieval Augmented Generation with Astra DB
ronantakizawa/cacheaugmentedgeneration
A Demo of Cache-Augmented Generation (CAG) in an LLM
nicolaric/rahmenabkommen-gpt
"Ask your question about the new framework agreement between Switzerland and the EU." Answers...
ARUNAGIRINATHAN-K/pdf-RAG-question-answering
Upload PDFs → ask questions → get grounded answers.