View article

[PDF] from arxiv.org

Sdae: Self-distillated masked autoencoder

Authors

Yabo Chen, Yuchen Liu, Dongsheng Jiang, Xiaopeng Zhang, Wenrui Dai, Hongkai Xiong, Qi Tian

Publication date

2022/10/23

Book

European conference on computer vision

Pages

108-124

Publisher

Springer Nature Switzerland

Description

With the development of generative-based self-supervised learning (SSL) approaches like BeiT and MAE, how to learn good representations by masking random patches of the input image and reconstructing the missing information has grown in concern. However, BeiT and PeCo need a “pre-pretraining” stage to produce discrete codebooks for masked patches representing. MAE does not require a pre-training codebook process, but setting pixels as reconstruction targets may introduce an optimization gap between pre-training and downstream tasks that good reconstruction quality may not always lead to the high descriptive capability for the model. Considering the above issues, in this paper, we propose a simple Self-distillated masked AutoEncoder network, namely SdAE. SdAE consists of a student branch using an encoder-decoder structure to reconstruct the missing information, and a teacher branch …

Total citations

Cited by 62

2022202320244 33 25

Scholar articles

Sdae: Self-distillated masked autoencoder

Y Chen, Y Liu, D Jiang, X Zhang, W Dai, H Xiong… - European conference on computer vision, 2022