zsoltfrks/multimodal-story-generator
A rather simple story generator from images with text-to-speech integration using HuggingFace open source assets to create a multimodal app. University project.
Stars
—
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 14, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/zsoltfrks/multimodal-story-generator"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rainygirl/rspeaker
말귀를 알아듣고 뉴스도 요약해 읽어줍니다
pncnmnp/phoenix10.1
Creates personalized radio stations with your own radio jockey!
rudra00434/SoulPlayer
My own music application build with Django , Tailwind CSS and Spacy (specifically used for voice search )
lifeiteng/TTS-TextAnalyzer
TTS Text Analyzer
troykelly/live-news-break
An advanced tool designed for creating automated news bulletins. It generates dynamic news...