Authors
Jochen Kranzdorf, Andrew Sellers, Giovanni Grasso, Christian Schallhart, Tim Furche
Publication date
2012
Conference
Proc. of the 21st World Wide Web Conference, WWW 2012 (Companion Volume)
Pages
369-372
Publisher
ACM
Description
Good examples are hard to find, particularly in wrapper induction: Picking even one wrong example can spell disaster by yielding overgeneralized or overspecialized wrappers. Such wrappers extract data with low precision or recall, unless adjusted by human experts at significant cost.
Visual OXPath is an open-source, visual wrapper induction system that requires minimal examples and eases wrapper refinement: Often it derives the intended wrapper from a single example through sophisticated heuristics that determine the best set of similar examples. To ease wrapper refinement, it offers a list of wrappers ranked by example similarity and robustness. Visual OXPath offers extensive visual feedback for this refinement which can be performed without any knowledge of the underlying wrapper language. Where further refinement by a human wrapper is needed, Visual OXPath profits from being based on OXPath, a …
Total citations
20122013201420152016201720181332212
Scholar articles
J Kranzdorf, A Sellers, G Grasso, C Schallhart… - Proceedings of the 21st International Conference on …, 2012