Ejb503/ai-voice-generation
Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. This repository contains a simple and efficient implementation using NextJS. Dive into the world of AI voice generation for free with our comprehensive demo. Contributions welcome.
Implements a full voice chat pipeline—speech recognition via WebSpeechRecognition API, LLM inference through Groq's Llama 3, and TTS via Deepgram—all streamed end-to-end in the browser. The architecture chains voice-to-text transcription directly into prompt-based LLM processing, enabling customization through system prompts and context manipulation. Built on NextJS with Express backend, it serves as a cost-effective alternative to proprietary voice APIs, with notes on swapping components (e.g., Whisper for SpeechRecognition) for production stability.
No commits in the last 6 months.
Stars
37
Forks
5
Language
TypeScript
License
MIT
Category
Last pushed
Jun 12, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/Ejb503/ai-voice-generation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ohmstone/pocket-tts-deno
WASM ONNX build of Pocket TTS with voice cloning adapted from pocket-tts-server to run as a Deno...
altkriz/voicecraft
Bring Your Words to Life Transform your text into lifelike speech using cutting-edge AI voices....
AniruddhaAdak/EchoCraft
EchoCraft : Transform your audio into text with precision and style
VoltsyGM/OpenVoice
🔊 Clone voices accurately while controlling style and tone in multiple languages seamlessly with...
dobizz/pyplayht
PlayHT API Python wrapper