Speech Emotion Recognition Leveraging OpenAI's Whisper Representations and Attentive Pooling Methods

By: Ali Shendabadi, Parnia Izadirad, Mostafa Salehi, Mahmoud Bijankhan

Published: 2026-02-06

View on arXiv →
#cs.AI

Abstract

This paper focuses on improving speech emotion recognition by utilizing representations from OpenAI's Whisper model combined with attentive pooling. This advancement has significant real-world applications in areas like human-computer interaction, customer service, and mental health monitoring.

FEEDBACK

Projects

No projects yet

Speech Emotion Recognition Leveraging OpenAI's Whisper Representations and Attentive Pooling Methods | ArXiv Intelligence