ahmedbesbes/audiolizr

A bentoML-powered API to transcribe audio and make sense of it

/ 100

Experimental

Extracts keywords using the Yake algorithm, generates abstractive summaries via T5 Transformer, and identifies named entities with spaCy—all orchestrated through BentoML's concurrent runner architecture that processes transcription sequentially before parallel metadata extraction. Accepts YouTube video URLs and audio files, containerizes to Docker with GPU support (CUDA 11.6), enabling deployment on AWS EC2 instances for scalable inference workloads.

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:

Arkapravo-Ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

haiodo/oaitt

An OpenAI compatible transcriber using transformers and whisperx.

purvanshjoshi/IndiVoice-DeepASR

Deep Learning framework for Indian-accented Speech-to-Text using Whisper and LoRA. Includes...

boned-fruitwood759/whisperx-asr-with-fastapi

🎤 Enable real-time speech recognition with WhisperX using FastAPI for efficient, scalable audio...

Explore Transformer Models

All categories Trending Transformer directory Insights