Authors
Rehab M Duwairi
Publication date
2005/6
Conference
DMIN
Pages
187-192
Description
A distance-based classifier for Arabic text categorization was proposed. The classifier, in its learning phase, scans the set of training documents once to extract features of categories that capture inherent category-specific properties; while in its testing phase the classifier uses category-specific features to categorize unclassified documents. Stemming was used to reduce the dimensionality of feature vectors. The accuracy of the classifier was tested by carrying out several categorization tasks on an in-house collected Arabic corpus. The results show that the proposed classifier is very accurate and robust.
Total citations
20062007200820092010201120122013201420152016201720182019202020212022117664125431412