meizhong986/WhisperJAV

ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV

/ 100

Established

Implements a multi-stage noise-suppression architecture combining scene-based VAD segmentation, domain-specific linguistic normalization (handling Japanese onomatopoeia and dialect tokenization), and defensive decoding with log-probability thresholding to eliminate hallucinations in extended audio. Supports pluggable ASR backends (Whisper, Faster-Whisper, Qwen3-ASR, anime-whisper, HuggingFace Kotoba) through a decoupled ChronosJAV pipeline that separates text generation from forced-alignment timestamp inference. Offers seven processing modes with configurable scene detection, speech enhancement, and segmentation strategies, plus two-pass ensemble mode to merge outputs across different model architectures.

1,216 stars. Actively maintained with 148 commits in the last 30 days.

No Package No Dependents

Maintenance 25 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 19 / 25

How are scores calculated?

Stars

1,216

Forks

110

Language

HTML

License

MIT

Related tools

itsmevictor/clean-transcribe

A simple CLI to transcribe Youtube videos or local audio/video files and produce LLM-cleaned...

BryceWG/BiBi-Keyboard

说点啥（BiBi Keyboard）:一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method...

vivekuppal/transcribe

Transcribe is a real time transcription, conversation, Language learning platform. It provides...

DevEmperor/Dictate

A powerful Whisper AI keyboard for reliable speech transcription

sindresorhus/awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

Explore Voice AI Tools

All categories Trending Voice AI directory Insights