Authors
Konstantin Avrachenkov, Alexey Mishenin, Paulo Gonçalves, Marina Sokol
Publication date
2012/4/26
Book
Proceedings of the 2012 SIAM International Conference on Data Mining
Pages
966-974
Publisher
Society for Industrial and Applied Mathematics
Description
We develop a generalized optimization framework for graph-based semi-supervised learning. The framework gives as particular cases the Standard Laplacian, Normalized Laplacian and PageRank based methods. We have also provided new probabilistic interpretation based on random walks and characterized the limiting behaviour of the methods. The random walk based interpretation allows us to explain differences between the performances of methods with different smoothing kernels. It appears that the PageRank based method is robust with respect to the choice of the regularization parameter and the labelled data. We illustrate our theoretical results with two realistic datasets, characterizing different challenges: Les Miserables characters social network and Wikipedia hyper-link graph. The graph-based semi-supervised learning classifies the Wikipedia articles with very good precision and perfect recall …
Total citations
2012201320142015201620172018201920202021202220232024127631031095731
Scholar articles
K Avrachenkov, A Mishenin, P Gonçalves, M Sokol - Proceedings of the 2012 SIAM International …, 2012