Authors
Deng-Ping Fan, Zheng Lin, Zhao Zhang, Menglong Zhu, Ming-Ming Cheng*
Publication date
2021
Journal
IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
Volume
32
Issue
5
Pages
2075-2089
Description
The use of RGB-D information for salient object detection (SOD) has been extensively explored in recent years. However, relatively few efforts have been put toward modeling SOD in real-world human activity scenes with RGB-D. In this article, we fill the gap by making the following contributions to RGB-D SOD: 1) we carefully collect a new S al i ent P erson (SIP) data set that consists of ~1 K high-resolution images that cover diverse real-world scenes from various viewpoints, poses, occlusions, illuminations, and background s; 2) we conduct a large-scale (and, so far, the most comprehensive) benchmark comparing contemporary methods, which has long been missing in the field and can serve as a baseline for future research, and we systematically summarize 32 popular models and evaluate 18 parts of 32 models on seven data sets containing a total of about 97k images; and 3) we propose a simple general …
Total citations
201920202021202220232024105211314415097
Scholar articles
DP Fan, Z Lin, Z Zhang, M Zhu, MM Cheng - IEEE Transactions on neural networks and learning …, 2020