View article

[PDF] from psu.edu

CIRG IRGDISCO at RepLab2013 Filtering Task: Use of Wikipedia's Graph Structure for Entity Name Disambiguation in Tweets.

Authors

Muhammad Atif Qureshi, Arjumand Younus, Daniel Abril, Colm O'Riordan, Gabriella Pasi

Publication date

2013

Conference

CLEF (Working Notes)

Description

Social media repositories serve as a significant source of evidence when extracting information related to the reputation of a particular entity (eg, a particular politician, singer or company). Reputation management experts manually mine the social media repositories (in particular Twitter) for monitoring the reputation of a particular entity. Recently, the online reputation management evaluation campaign known as RepLab at CLEF has turned attention to devising computational methods for facilitating reputation management experts. A quite significant research challenge related to the above issue is to disambiguate tweets with respect to entity names. In fact, finding if a particular tweet is relevant or irrelevant to a particular entity is an important task not satisfactorily solved yet; to address this issue in this paper we use “context phrases” in a tweet and Wikipedia disambiguated articles for a particular entity in an SVM classifier that utilizes features extracted from the Wikipedia graph structure ie, links into Wikipedia articles and links from Wikipedia articles. Additionally we also use features derived from term-specificity and term-collocation features derived from the Wikipedia article of an entity under investigation. The experimental evaluations do not show a significant improvement over the baseline and other systems outperform our approach; however, manual inspection of feature sets and training data demonstrates the proposed Wikipedia graph-based features may show a promising outcome when used in combination with sophisticated learning algorithms.

Total citations

Cited by 2

201420151 1

Scholar articles

CIRG IRGDISCO at RepLab2013 Filtering Task: Use of Wikipedia's Graph Structure for Entity Name Disambiguation in Tweets.

MA Qureshi, A Younus, D Abril, C O'Riordan, G Pasi - CLEF (Working Notes), 2013