Authors
Eiman Tamah Al-Shammari
Publication date
2010/11/29
Conference
2010 10th International Conference on Intelligent Systems Design and Applications
Pages
385-390
Publisher
IEEE
Description
Stemming is a fundamental step in processing textual data preceding the tasks of text mining, Information Retrieval (IR), and natural language processing (NLP). The common goal of stemming is to standardize words by reducing a word to its base (root or stem), thus can be also considered a feature reduction technique. This paper aims at presenting a new dictionary free, content-based Arabic stemmer and adopts it as a feature reduction (selection) mechanism to study its contribution in improving Arabic text categorization. We employed three stemming mechanisms (root-based, light, and our stemming technique and assessed their performance in text classification exercises for an Arabic corpus to compare and contrast the text mining effectiveness of these Arabic stemming algorithms. The experiments were conducted on a corpus consisting of 2,966 Arabic documents that fall into three categories: cultural, social …
Total citations
20112012201320142015201620172018201920202021202220231142311111
Scholar articles
ET Al-Shammari - 2010 10th International Conference on Intelligent …, 2010