Authors
Eran Goldman, Roei Herzig, Aviv Eisenschtat, Jacob Goldberger, Tal Hassner
Publication date
2019
Conference
Computer Vision and Pattern Recognition (CVPR)
Description
Man-made scenes are often densely packed, containing numerous objects, often identical, positioned in close proximity. We show that precise object detection in such scenes remains a challenging frontier even for state-of-the-art object detectors. We propose a novel, deep-learning based method for precise object detection, designed for such challenging settings. Our contributions include:(1) A layer for estimating the Jaccard index as a detection quality score;(2) a novel EM merging unit, which uses our quality scores to resolve detection overlap ambiguities; finally,(3) an extensive, annotated data set, SKU-110K, representing packed retail environments, released for training and testing under such extreme settings. Detection tests on SKU-110K, and counting tests on the CARPK and PUCPR+, show our method to outperform existing state-of-the-art with substantial margins.
Total citations
20192020202120222023202493531486141
Scholar articles
E Goldman, R Herzig, A Eisenschtat, J Goldberger… - Proceedings of the IEEE/CVF conference on computer …, 2019
E Goldman, R Herzig, A Eisenschtat, J Goldberger… - CVF Conference on Computer Vision and Pattern …, 2019
E Goldman, R Herzig, A Eisenschtat, J Goldberger… - arXiv preprint arXiv:1904.00853, 1904