Afhrodite/Audio-LLM-Playground
A collection of audio transcription and summarization tools developed during IBM's Coursera course on Generative AI. Combines OpenAI Whisper for speech recognition and IBM Watsonx’s LLAMA2 for natural language processing. Cleaned up and organized for clarity and reuse.
No commits in the last 6 months.
Stars
—
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Afhrodite/Audio-LLM-Playground"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
edwko/OuteTTS
Interface for OuteTTS models.
fluxions-ai/vui
100M parameter lightweight conversational text-to-speech model with breaths, laughter,...
OpenVoiceOS/ovos-audio-transformer-plugin-ggwave
data over sound plugin
mbzuai-oryx/LLMVoX
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
inboxpraveen/LLM-Minutes-of-Meeting
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates...