Authors
Michela Bacchin, Nicola Ferro, Massimo Melucci
Publication date
2005
Journal
Information processing & management
Volume
41
Issue
1
Pages
121-137
Description
In this paper we will present a language-independent probabilistic model which can automatically generate stemmers. Stemmers can improve the retrieval effectiveness of information retrieval systems, however the designing and the implementation of stemmers requires a laborious amount of effort due to the fact that documents and queries are often written or spoken in several different languages. The probabilistic model proposed in this paper aims at the development of stemmers used for several languages. The proposed model describes the mutual reinforcement relationship between stems and derivations and then provides a probabilistic interpretation. A series of experiments shows that the stemmers generated by the probabilistic model are as effective as the ones based on linguistic knowledge.
Total citations
200420052006200720082009201020112012201320142015201620172018201920202021202220232024341954274625444513112
Scholar articles
M Bacchin, N Ferro, M Melucci - Information processing & management, 2005