revdotcom/fstalign

An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.

/ 100

Emerging

Built on OpenFST finite-state composition with lazy traversal algorithms, it aligns reference and hypothesis token sequences while supporting both NLP-formatted and CTM input formats for detailed error analysis. Version 2.0 introduces optimized graph traversal that dramatically accelerates alignment on lengthy or error-rich sequences, plus intelligent constraint handling that prevents punctuation-word misalignments and favors case-insensitive substitutions. Provides configurable beam search, strict punctuation enforcement, and JSON output for integration into speech recognition evaluation pipelines.

171 stars.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 10 / 25

How are scores calculated?

Stars

171

Forks

Language

C++

License

Apache-2.0

Higher-rated alternatives

daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as...

pykaldi/pykaldi

A Python wrapper for Kaldi

Explore Voice AI Tools

All categories Trending Voice AI directory Insights