biodatlab/thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:

42
/ 100
Emerging

Fine-tuned on a composite dataset spanning CommonVoice, Gowajee, Thai Elderly Speech, and dialect corpora, with variants optimized for noise robustness and domain-specific applications (financial, medical). Integrates directly with Hugging Face transformers pipeline for inference, and offers both full-scale and distilled model checkpoints (166M-1.5B parameters) with measurable WER benchmarks and inference speed trade-offs optimized for different deployment scenarios.

186 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

186

Forks

20

Language

Jupyter Notebook

License

MIT

Last pushed

Jul 29, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/biodatlab/thonburian-whisper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.