Authors
Björn Schuller, Michel Valster, Florian Eyben, Roddy Cowie, Maja Pantic
Publication date
2012/10/22
Book
Proceedings of the 14th ACM international conference on Multimodal interaction
Pages
449-456
Description
We present the second Audio-Visual Emotion recognition Challenge and workshop (AVEC 2012), which aims to bring together researchers from the audio and video analysis communities around the topic of emotion recognition. The goal of the challenge is to recognise four continuously valued affective dimensions: arousal, expectancy, power, and valence. There are two sub-challenges: in the Fully Continuous Sub-Challenge participants have to predict the values of the four dimensions at every moment during the recordings, while for the Word-Level Sub-Challenge a single prediction has to be given per word uttered by the user. This paper presents the challenge guidelines, the common data used, and the performance of the baseline system on the two tasks.
Total citations
2011201220132014201520162017201820192020202120222023202411039455428242634202621178
Scholar articles
B Schuller, M Valster, F Eyben, R Cowie, M Pantic - Proceedings of the 14th ACM international conference …, 2012