jianchang512/stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式

/ 100

Established

Leverages faster-whisper's optimized inference engine with support for five model sizes (tiny to large-v3), enabling CPU/GPU processing with automatic CUDA acceleration on NVIDIA hardware. Provides REST API endpoints with OpenAI API compatibility, plus a web UI for drag-and-drop processing with multi-language support (15+ languages) and configurable output formats. Includes optional FFmpeg integration for video frame extraction and can operate completely offline after model download.

4,331 stars.

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

4,331

Forks

463

Language

Python

License

GPL-3.0

Compare

stt and realtime-stt

Related tools

Jaymon/transcribe

Convert images or audio files to plain text on the command line

cyberofficial/Synthalingua

Synthalingua - Real Time Translation

developers-cosmos/Mimasa

Real time multilingual face translator

lperezmo/real-time-translator

A quick app to translate speech in real time using the Whisper API for transcribing audio,...

HenestrosaDev/audiotext

A desktop application that transcribes audio from files, microphone input or YouTube videos with...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights