kahne/SpeechTransProgress

Tracking the progress in end-to-end speech translation

/ 100

Emerging

Comprehensive resource compiling multilingual speech translation datasets (CoVoST 2, CVSS, mTEDx, MUST-C) spanning 20+ language pairs with both text and speech targets, alongside implementations in major frameworks like ESPNet-ST and Fairseq S2T. Tracks benchmark progress through peer-reviewed papers and tutorials covering direct speech-to-text translation without intermediate ASR, end-to-end architectures that jointly model acoustic and linguistic knowledge. Provides curated bibliography of recent advances including large language model fine-tuning approaches and comparative evaluations across diverse domain corpora ranging from TED talks to parliamentary proceedings.

261 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

261

Forks

Language

—

License

CC0-1.0

Higher-rated alternatives

voicegain/platform

Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)

davidamacey/OpenTranscribe

Self-hosted AI-powered transcription platform with speaker diarization, search, and...

aws-samples/amazon-transcribe-live-call-analytics

Amazon Transcribe Live Call Analytics (LCA) Sample Solution

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while...

jim-schwoebel/voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights