tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

/ 100

Emerging

Processes audio in configurable frame windows (25ms default at 10ms stride) with pre-emphasis filtering and outputs 12 normalized features per frame (frequency, power, width, dissonance for up to 3 formants), using Numba-accelerated computation and HDF5 storage for batch processing. Integrates with the SER_Datasets_Import ecosystem for bulk feature extraction from speech emotion recognition datasets, with helper functions for reading HDF files and computing dataset statistics.

No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 18 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Category

keyword-speech-recognition

Last pushed

Jun 03, 2022

Monthly downloads

Commits (30d)

Dependencies

GitHub PyPI

Keyword Speech Recognition · 112 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tabahi/formantfeatures"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

shenasa-ai/speech2text

A Deep-Learning-Based Persian Speech Recognition System

Explore Voice AI Tools

All categories Trending Voice AI directory Insights