ahmedbesbes/audiolizr
A bentoML-powered API to transcribe audio and make sense of it
Extracts keywords using the Yake algorithm, generates abstractive summaries via T5 Transformer, and identifies named entities with spaCy—all orchestrated through BentoML's concurrent runner architecture that processes transcription sequentially before parallel metadata extraction. Accepts YouTube video URLs and audio files, containerizes to Docker with GPU support (CUDA 11.6), enabling deployment on AWS EC2 instances for scalable inference workloads.
No commits in the last 6 months.
Stars
39
Forks
2
Language
Python
License
—
Category
Last pushed
Dec 21, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ahmedbesbes/audiolizr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
biodatlab/thonburian-whisper
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:
Arkapravo-Ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
haiodo/oaitt
An OpenAI compatible transcriber using transformers and whisperx.
purvanshjoshi/IndiVoice-DeepASR
Deep Learning framework for Indian-accented Speech-to-Text using Whisper and LoRA. Includes...
boned-fruitwood759/whisperx-asr-with-fastapi
🎤 Enable real-time speech recognition with WhisperX using FastAPI for efficient, scalable audio...