Authors
Wenbo Wang, Lu Chen, Keke Chen, Krishnaprasad Thirunarayan, Amit P Sheth
Publication date
2017/8/23
Book
Proceedings of the International Conference on Web Intelligence
Pages
525-532
Description
This paper exploits a large number of self-labeled emotion tweets as the training data from the source domain to improve emotion identification in target domains (i.e., blogs and fairy tales), where there is a short supply of labeled data. Due to the noisy and ambiguous nature of self-labeled emotion training data, the existing domain adaptation methods that typically depend on high-quality labeled source-domain data do not work satisfactorily. This paper describes an adaptive source-domain training instance selection method to address the problem of noisy source-domain training data. The proposed approach can effectively identify the most informative training examples based on three carefully designed measures: consistency, diversity, and similarity. It uses an iterative method that consists of the following steps in each iteration: selecting informative samples from the source domain with the informativeness …
Total citations
201920202021202220232112
Scholar articles
W Wang, L Chen, K Chen, K Thirunarayan, AP Sheth - Proceedings of the International Conference on Web …, 2017