athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

/ 100

Established

Built on TensorFlow 2.0+, Athena supports multiple speech processing tasks (ASR, TTS, VAD, KWS) with hybrid attention/CTC architectures, WFST-based decoding, and distributed multi-GPU training via Horovod. It includes a Kaldi-free feature extraction library (Athena_transform) and C++ runtime for inference with local server deployment capabilities. The framework provides end-to-end and streaming model variants across diverse datasets (AISHELL-1, LibriSpeech, GigaSpeech) with reference recipes and pre-trained model zoo support.

970 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

970

Forks

199

Language

C++

License

Apache-2.0

Related tools

githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

rakeshvar/rnn_ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights