Authors
Rahul Kapoor, Mayank Kejriwal, Pedro Szekely
Publication date
2017/5/14
Book
Proceedings of the fourth international ACM workshop on managing and mining enriched geo-spatial data
Pages
1-6
Description
Extracting geographical tags from webpages is a well-motiva-ted application in many domains. In illicit domains with unusual language models, like human trafficking, extracting geotags with both high precision and recall is a challenging problem. In this paper, we describe a geotag extraction framework in which context, constraints and the openly available Geonames knowledge base work in tandem in an Integer Linear Programming (ILP) model to achieve good performance. In preliminary empirical investigations, the framework improves precision by 28.57% and F-measure by 36.9% on a difficult human trafficking geotagging task compared to a machine learning-based baseline. The method is already being integrated into an existing knowledge base construction system widely used by US law enforcement agencies to combat human trafficking.
Total citations
2017201820192020202120222023202445413453
Scholar articles
R Kapoor, M Kejriwal, P Szekely - Proceedings of the fourth international ACM workshop …, 2017