Authors
Jan Zahálka, Stevan Rudinac, Björn Þór Jónsson, Dennis C Koelma, Marcel Worring
Publication date
2017/9/22
Journal
IEEE Transactions on Multimedia
Volume
20
Issue
3
Pages
687-698
Publisher
IEEE
Description
This paper presents Blackthorn, an efficient interactive multimodal learning approach facilitating analysis of multimedia collections of up to 100 million items on a single high-end workstation. Blackthorn features efficient data compression, feature selection, and optimizations to the interactive learning process. The Ratio-64 data representation introduced in this paper only costs tens of bytes per item yet preserves most of the visual and textual semantic information with good accuracy. The optimized interactive learning model scores the Ratio-64-compressed data directly, greatly reducing the computational requirements. The experiments compare Blackthorn with two baselines: Conventional relevance feedback, and relevance feedback using product quantization to compress the features. The results show that Blackthorn is up to 77.5× faster than the conventional relevance feedback alternative, while outperforming …
Total citations
20182019202020212022202320243465342
Scholar articles
J Zahálka, S Rudinac, BÞ Jónsson, DC Koelma… - IEEE Transactions on Multimedia, 2017