Authors
Manli Zhu, Edmond SL Ho, Shuang Chen, Longzhi Yang, Hubert PH Shum
Publication date
2024/7/16
Journal
IEEE Transactions on Instrumentation and Measurement
Publisher
IEEE
Description
Cameras are essential vision instruments to capture images for pattern detection and measurement. Human-object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes. Recently, Transformer-based models have become the dominant approach for HOI detection due to their advanced network architectures and thus promising results. However, most of them follow the one-stage design of vanilla Transformer, leaving rich geometric priors under-exploited and leading to compromised performance especially when occlusion occurs. Given that geometric features tend to outperform visual ones in occluded scenarios and offer information that complements visual cues, we propose a novel end-to-end Transformer-style HOI detection model, i.e., geometric features enhanced HOI detector (GeoHOI). One key part of the model is a new unified self …
Scholar articles
M Zhu, ESL Ho, S Chen, L Yang, HPH Shum - IEEE Transactions on Instrumentation and …, 2024