vasistalodagala/whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

/ 100

Emerging

Provides embedding extraction utilities across different Whisper layer depths and supports distributed multi-GPU training with step-based or epoch-based scheduling. Integrates with Hugging Face's seq2seq training pipeline and datasets library, while enabling custom dataset ingestion through standardized audio/text file formats and optional JAX-accelerated evaluation for faster inference.

361 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

361

Forks

Language

Python

License

MIT

Compare

whisper-finetune and Whisper-Finetune

Higher-rated alternatives

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data,...

xenova/whisper-web

ML-powered speech recognition directly in your browser

Pikurrot/whisper-gui

A simple GUI to use Whisper.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights