AbhaySingh71/Multimodal-Agentic-Assistant-Clara
Clara: An agentic multimodal AI assistant that can see through your webcam, listen to your voice, think with Gemini, and speak back using ElevenLabs. Built with LangGraph, OpenCV, Groq, and Gradio.
No commits in the last 6 months.
Stars
1
Forks
—
Language
Python
License
MIT
Category
Last pushed
Aug 29, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AbhaySingh71/Multimodal-Agentic-Assistant-Clara"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
asiff00/On-Device-Speech-to-Speech-Conversational-AI
This is an on-CPU real-time conversational system for two-way speech communication with AI...
VideotronicMaker/LM-Studio-Voice-Conversation
Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for...
syntithenai/hermod
voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by...
voice-engine/make-a-smart-speaker
A collection of resources to make a smart speaker
FR33TR1ST/VoiceAssistant
A VoiceAsistant with WhisperAI speech recognition