Authors
Maike Erdmann, Erik Ward, Kazushi Ikeda, Gen Hattori, Chihiro Ono, Yasuhiro Takishima
Publication date
2013/9/1
Conference
2013 International Conference on Social Computing
Pages
796-802
Publisher
IEEE
Description
Twitter is a popular medium for sharing opinions on TV programs, and the analysis of TV related tweets is attracting a lot of interest. However, when collecting all tweets containing a given TV program title, we obtain a large number of unrelated tweets, due to the fact that many of the TV program titles are ambiguous. Using supervised learning, TV related tweets can be collected with high accuracy. The goal of our proposed method is to automate the labeling process, in order to eliminate the cost required for data labeling without sacrificing classification accuracy. When creating the training data, we use only tweets of unambiguous TV program titles. In order to decide whether a TV program title is ambiguous, we automatically determine whether it can be used as a common expression or named entity. In two experiments, in which we collected tweets for 32 ambiguous TV program titles, we achieved the same (78.2 …
Total citations
20152016201720182019202020212022202312131
Scholar articles
M Erdmann, E Ward, K Ikeda, G Hattori, C Ono… - 2013 International Conference on Social Computing, 2013