Authors
Fuzhen Zhuang, Ping Luo, Hui Xiong, Qing He, Yuhong Xiong, Zhongzhi Shi
Publication date
2011/2
Journal
Statistical Analysis and Data Mining: The ASA Data Science Journal
Volume
4
Issue
1
Pages
100-114
Publisher
Wiley Subscription Services, Inc., A Wiley Company
Description
Cross‐domain text categorization targets on adapting the knowledge learnt from a labeled source domain to an unlabeled target domain, where the documents from the source and target domains are drawn from different distributions. However, in spite of the different distributions in raw‐word features, the associations between word clusters (conceptual features) and document classes may remain stable across different domains. In this paper, we exploit these unchanged associations as the bridge of knowledge transformation from the source domain to the target domain by the non‐negative matrix tri‐factorization. Specifically, we formulate a joint optimization framework of the two matrix tri‐factorizations for the source‐ and target‐domain data, respectively, in which the associations between word clusters and document classes are shared between them. Then, we give an iterative algorithm for this optimization and …
Total citations
20102011201220132014201520162017201820192020202120222023202437611910148479410116
Scholar articles
F Zhuang, P Luo, H Xiong, Q He, Y Xiong, Z Shi - Statistical Analysis and Data Mining: The ASA Data …, 2011