tsmdt/whisply
💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and subtitle generation using OpenAI’s Whisper on CPU, Nvidia GPU and Apple MLX.
Leverages hardware-specific Whisper implementations (`faster-whisper` for CPUs/Nvidia, `mlx-whisper` for Apple Silicon) with automatic device detection, plus integrates `whisperX` and `pyannote` for word-level speaker diarization and customizable subtitle generation. Supports multiple export formats (JSON, SRT, VTT, HTML, RTTM) and batch processing via CLI, browser app, or config files for scalable transcription workflows.
108 stars and 1,597 monthly downloads. Available on PyPI.
Stars
108
Forks
16
Language
Python
License
MIT
Category
Last pushed
Mar 18, 2026
Monthly downloads
1,597
Commits (30d)
0
Dependencies
17
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tsmdt/whisply"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
linto-ai/linto-stt
An automatic speech recognition API
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker...
ringger/transcribe-critic
Multi-source transcript merging inspired by textual criticism — LLM adjudicates multiple...