Authors
Tanguy Albrici, Mandana Fasounaki, Saleh Bagher Salimi, Guillaume Vray, Behzad Bozorgtabar, Hazım Kemal Ekenel, Jean-Philippe Thiran
Publication date
2019/5/14
Conference
2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019)
Pages
1-6
Publisher
IEEE
Description
This paper addresses the problem of automatic facial expression recognition in videos, where the goal is to predict discrete emotion labels best describing the emotions expressed in short video clips. Building on a pre-trained convolutional neural network (CNN) model dedicated to analyzing the video frames and LSTM network designed to process the trajectories of the facial landmarks, this paper investigates several novel directions. First of all, improved face descriptors based on 2D CNNs and facial landmarks are proposed. Second, the paper investigates fusion methods of the features temporally, including a novel hierarchical recurrent neural network combining facial landmark trajectories over time. In addition, we propose a modification to state-of-the-art expression recognition architectures to adapt them to video processing in a simple way. In both ensemble approaches, the temporal information is integrated …
Total citations
20212022202320245421
Scholar articles
T Albrici, M Fasounaki, SB Salimi, G Vray… - 2019 14th IEEE International Conference on Automatic …, 2019