SuyashMore/MevonAI-Speech-Emotion-Recognition

Identify the emotion of multiple speakers in an Audio Segment

/ 100

Emerging

Combines speaker diarization (partitioning audio by speaker identity) with MFCC feature extraction and a CNN classifier to isolate and emotionally classify each speaker's segments independently. The pipeline uses librosa for audio processing, TensorFlow-Keras for a 2D convolutional architecture trained on RAVDESS datasets, and outputs emotion predictions per speaker to CSV. Includes Docker deployment and processes .wav files locally, making it suitable for call center feedback analysis workflows.

179 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

179

Forks

Language

License

MIT

Higher-rated alternatives

salute-developers/GigaAM

Foundational Model for Speech Recognition Tasks

AkishinoShiame/Chinese-Speech-Emotion-Datasets

Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.

habla-liaa/ser-with-w2v2

Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec...

NotAbhinavGamerz/emotion-aware-automatic-speech-recognition

🎤 Enhance speech recognition by detecting emotions in spoken language, combining OpenAI's...

jsugg/ser

The AI-powered ser Python package is a tool for recognizing and analyzing emotions in speech....

Explore Voice AI Tools

All categories Trending Voice AI directory Insights