Authors
Sanjeev Arora, Rong Ge, Yonatan Halpern, David Mimno, Ankur Moitra, David Sontag, Yichen Wu, Michael Zhu
Publication date
2013/5/13
Conference
International conference on machine learning
Pages
280-288
Publisher
PMLR
Description
Topic models provide a useful method for dimensionality reduction and exploratory data analysis in large text corpora. Most approaches to topic model learning have been based on a maximum likelihood objective. Efficient algorithms exist that attempt to approximate this objective, but they have no provable guarantees. Recently, algorithms have been introduced that provide provable bounds, but these algorithms are not practical because they are inefficient and not robust to violations of model assumptions. In this paper we present an algorithm for learning topic models that is both provable and practical. The algorithm produces results comparable to the best MCMC implementations while running orders of magnitude faster.
Total citations
20122013201420152016201720182019202020212022202320243103544485360775556444816
Scholar articles
S Arora, R Ge, Y Halpern, D Mimno, A Moitra, D Sontag… - International conference on machine learning, 2013