meizhong986/WhisperJAV
ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
Implements a multi-stage noise-suppression architecture combining scene-based VAD segmentation, domain-specific linguistic normalization (handling Japanese onomatopoeia and dialect tokenization), and defensive decoding with log-probability thresholding to eliminate hallucinations in extended audio. Supports pluggable ASR backends (Whisper, Faster-Whisper, Qwen3-ASR, anime-whisper, HuggingFace Kotoba) through a decoupled ChronosJAV pipeline that separates text generation from forced-alignment timestamp inference. Offers seven processing modes with configurable scene detection, speech enhancement, and segmentation strategies, plus two-pass ensemble mode to merge outputs across different model architectures.
1,216 stars. Actively maintained with 148 commits in the last 30 days.
Stars
1,216
Forks
110
Language
HTML
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
148
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/meizhong986/WhisperJAV"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
itsmevictor/clean-transcribe
A simple CLI to transcribe Youtube videos or local audio/video files and produce LLM-cleaned...
BryceWG/BiBi-Keyboard
说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method...
vivekuppal/transcribe
Transcribe is a real time transcription, conversation, Language learning platform. It provides...
DevEmperor/Dictate
A powerful Whisper AI keyboard for reliable speech transcription
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI