meizhong986/WhisperJAV

ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV

63
/ 100
Established

Implements a multi-stage noise-suppression architecture combining scene-based VAD segmentation, domain-specific linguistic normalization (handling Japanese onomatopoeia and dialect tokenization), and defensive decoding with log-probability thresholding to eliminate hallucinations in extended audio. Supports pluggable ASR backends (Whisper, Faster-Whisper, Qwen3-ASR, anime-whisper, HuggingFace Kotoba) through a decoupled ChronosJAV pipeline that separates text generation from forced-alignment timestamp inference. Offers seven processing modes with configurable scene detection, speech enhancement, and segmentation strategies, plus two-pass ensemble mode to merge outputs across different model architectures.

1,216 stars. Actively maintained with 148 commits in the last 30 days.

No Package No Dependents
Maintenance 25 / 25
Adoption 10 / 25
Maturity 9 / 25
Community 19 / 25

How are scores calculated?

Stars

1,216

Forks

110

Language

HTML

License

MIT

Last pushed

Mar 12, 2026

Commits (30d)

148

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/meizhong986/WhisperJAV"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.