View article

[PDF] from idiap.ch

End-to-end convolutional neural network-based voice presentation attack detection

Authors

Hannah Muckenhirn, Mathew Magimai-Doss, Sébastien Marcel

Publication date

2017/10/1

Conference

2017 IEEE international joint conference on biometrics (IJCB)

Pages

335-341

Publisher

IEEE

Description

Development of countermeasures to detect attacks performed on speaker verification systems through presentation of forged or altered speech samples is a challenging and open research problem. Typically, this problem is approached by extracting features through conventional short-term speech processing and feeding them to a binary classifier. In this article, we develop a convolutional neural network-based approach that learns in an end-to-end manner both the features and the binary classifier from the raw signal. Through investigations on two publicly available databases, namely, ASVspoof and AVspoof, we show that it yields systems comparable to or better than the state-of-the-art approaches for both physical access attacks and logical access attacks. Furthermore, the approach is shown to be complementary to a spectral statistics-based approach, which, similarly to the proposed approach, does not use …

Total citations

Cited by 73

201820192020202120222023202414 15 12 12 12 7 1

Scholar articles

End-to-end convolutional neural network-based voice presentation attack detection

H Muckenhirn, M Magimai-Doss, S Marcel - 2017 IEEE international joint conference on biometrics …, 2017