Authors
Anders Boesen Lindbo Larsen, Søren Kaae Sønderby, Hugo Larochelle, Ole Winther
Publication date
2016/6/11
Conference
International conference on machine learning
Pages
1558-1566
Publisher
PMLR
Description
We present an autoencoder that leverages learned representations to better measure similarities in data space. By combining a variational autoencoder (VAE) with a generative adversarial network (GAN) we can use learned feature representations in the GAN discriminator as basis for the VAE reconstruction objective. Thereby, we replace element-wise errors with feature-wise errors to better capture the data distribution while offering invariance towards eg translation. We apply our method to images of faces and show that it outperforms VAEs with element-wise similarity measures in terms of visual fidelity. Moreover, we show that the method learns an embedding in which high-level abstract visual features (eg wearing glasses) can be modified using simple arithmetic.
Total citations
20162017201820192020202120222023202438150273328379384401400204
Scholar articles
ABL Larsen, SK Sønderby, H Larochelle, O Winther - International conference on machine learning, 2016