HKAB/whisper-finetune-vietnamese
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
Integrates a custom BeamSearchWithLM decoder leveraging KenLM n-gram language models to improve decoding accuracy beyond standard greedy inference. Provides complete finetuning pipelines with distributed training support, alongside pre-trained checkpoints and notebooks for n-gram generation and comparative benchmarking against Wav2vec2 baselines. Reduces WER from 50% to 33% on Vietnamese speech through supervised finetuning on 100-hour in-domain data.
No commits in the last 6 months.
Stars
37
Forks
12
Language
Jupyter Notebook
License
—
Category
Last pushed
Oct 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/HKAB/whisper-finetune-vietnamese"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
512z/podlens
Free Podwise: AI Podcast & Youtube Transcription & Understanding Agent | 播客+youtube转文字/学习/可视化AI工具
AEmotionStudio/ComfyUI-FFMPEGA
Intelligent FFMPEG agent node for ComfyUI - transforms natural language video editing prompts...
Sammybams/whisper-to-text-with-azure
A telegram bot that performs transcription, translation and summarization on your audio files in...
MLH-Fellowship/transcribio
A web application that allows educators to easily generate transcripts for their video lectures...
salihfurkaan/AutoSub-CLI
A tool that simplifies the process of adding subtitles to videos by leveraging the power of NLP...