Speech Emotion Recognition Leveraging OpenAI's Whisper Representations and Attentive Pooling Methods
By: Ali Shendabadi, Parnia Izadirad, Mostafa Salehi, Mahmoud Bijankhan
Published: 2026-02-06
View on arXiv →#cs.AI
Abstract
This paper focuses on improving speech emotion recognition by utilizing representations from OpenAI's Whisper model combined with attentive pooling. This advancement has significant real-world applications in areas like human-computer interaction, customer service, and mental health monitoring.