Authors
Luciano Barbosa, Juliana Freire
Publication date
2010/5/27
Journal
Journal of Information and Data Management
Volume
1
Issue
1
Pages
133
Description
In this paper, we study the problem of automating the retrieval of data hidden behind simple search interfaces that accept keyword-based queries. Our goal is to automatically retrieve all available results (or, as many as possible). We propose a new approach to siphon hidden data that automatically generates a small set of representative keywords and builds queries which lead to high coverage. We evaluate our algorithms over several real Web sites. Preliminary results indicate our approach is effective: coverage of over 90% is obtained for most of the sites considered.
Total citations
200420052006200720082009201020112012201320142015201620172018201920202021202220232024148161317291818201716814983521
Scholar articles
L Barbosa, J Freire - Journal of Information and Data Management, 2010