Aavato-c/AI-voice-annotation-from-webcam
A set of scripts to capture pictures from a webcam feed and then getting the description of them from openai. After that, convert the text to speech using Elevenlabs.
No commits in the last 6 months.
Stars
2
Forks
—
Language
Python
License
MIT
Category
Last pushed
Oct 02, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Aavato-c/AI-voice-annotation-from-webcam"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
met4citizen/TalkingHead
Talking Head (3D): A JavaScript class for real-time lip-sync using full-body 3D avatars.
livekit/livekit
End-to-end realtime stack for connecting humans and AI
Henry-23/VideoChat
实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable...
zslrmhb/Omniverse-Virtual-Assisstant
Audio2Face Avatar with Riva SDK functionality
dmisol/flexatar-virtual-webcam
Personalized Virtual Webcam for WebRTC