The Automated VSMs to Categorize Arabic Text Data Sets

Authors

  • Mamoun Suleiman Al Rababaa Al albayt University, Mafraq
  • Essam Said Hanandeh Zarqa University, Zarqa

DOI:

https://doi.org/10.24297/ijct.v13i1.2925

Keywords:

Arabic data sets, Data mining, Text categorisation, Term weighting, VSM.

Abstract

Text Categorization is one of the most important tasks in information retrieval and data mining. This paper aims at investigating different variations of vector space models (VSMs) using KNN algorithm. we used 242 Arabic abstract documents that were used by (Hmeidi & Kanaan, 1997). The bases of our comparison are the most popular text evaluation measures; we use Recall measure, Precision measure, and F1 measure. The Experimental results against the Saudi data sets reveal that Cosine outperformed over of the Dice and Jaccard coefficients.

Downloads

Download data is not yet available.

Downloads

Published

2014-03-31

Issue

Section

Research Articles

How to Cite

The Automated VSMs to Categorize Arabic Text Data Sets. (2014). INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 13(1), 4074-4081. https://doi.org/10.24297/ijct.v13i1.2925

Similar Articles

1-10 of 163

You may also start an advanced similarity search for this article.