HenestrosaDev/audiotext

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

41
/ 100
Emerging

Supports three distinct transcription backends—Google Speech-to-Text API, OpenAI's Whisper API, and WhisperX—each with configurable parameters like model size, compute type, and batch size for local processing. Built with Python and PyQt for the desktop UI, it enables batch processing of audio files and directories while offering fine-grained subtitle customization including word-level highlighting and line width constraints. Multilingual support spans 99 languages with optional translation capabilities when using Whisper-based methods.

345 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

345

Forks

32

Language

Python

License

Last pushed

Oct 15, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/HenestrosaDev/audiotext"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.