Authors
Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen
Publication date
2006/12/18
Conference
Data Mining, 2006. ICDM'06. Sixth International Conference on
Pages
552-561
Publisher
IEEE
Description
The rapid growth of blog (also known as "weblog") data provides a rich resource for social community mining. In this paper, we put forward a novel research problem of mining the latent friends of bloggers based on the contents of their blog entries. Latent friends are defined in this paper as people who share the similar topic distribution in their blogs. These people may not actually know each other, but they have the interest and potential to find each other out. Three approaches are designed for latent friend detection. The first one, called cosine similarity-based method, determines the similarity between bloggers by calculating the cosine similarity between the contents of the blogs. The second approach, known as topic-based method, is based on the discovery of latent topics using a latent topic model and then calculating the similarity at the topic level. The third one is two-level similarity-based, which is conducted …
Total citations
2007200820092010201120122013201420152016201720182019202020217816455236332445
Scholar articles
D Shen, JT Sun, Q Yang, Z Chen - Sixth International Conference on Data Mining (ICDM' …, 2006