rk-vashista/TTS-Story_Generator
A versatile app that converts images into short stories and lifelike audio locally. It combines Hugging Face's image captioning, Groq's story generation, and Parler TTS for local text-to-speech synthesis. Ideal for AI-driven projects with fast, reliable on-device TTS.
No commits in the last 6 months.
Stars
—
Forks
1
Language
Python
License
MIT
Category
Last pushed
Sep 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/rk-vashista/TTS-Story_Generator"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
edwko/OuteTTS
Interface for OuteTTS models.
fluxions-ai/vui
100M parameter lightweight conversational text-to-speech model with breaths, laughter,...
OpenVoiceOS/ovos-audio-transformer-plugin-ggwave
data over sound plugin
mbzuai-oryx/LLMVoX
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
inboxpraveen/LLM-Minutes-of-Meeting
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates...