We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.IR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Arabic Text Categorization Algorithm using Vector Evaluation Method

Abstract: Text categorization is the process of grouping documents into categories based on their contents. This process is important to make information retrieval easier, and it became more important due to the huge textual information available online. The main problem in text categorization is how to improve the classification accuracy. Although Arabic text categorization is a new promising field, there are a few researches in this field. This paper proposes a new method for Arabic text categorization using vector evaluation. The proposed method uses a categorized Arabic documents corpus, and then the weights of the tested document's words are calculated to determine the document keywords which will be compared with the keywords of the corpus categorizes to determine the tested document's best category.
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
Journal reference: International Journal of Computer Science & Information Technology (IJCSIT) Vol 6, No 6, December 2014
DOI: 10.5121/ijcsit.2014.6606
Cite as: arXiv:1501.01318 [cs.IR]
  (or arXiv:1501.01318v1 [cs.IR] for this version)

Submission history

From: Aymen Abu-Errub Ph.D. [view email]
[v1] Tue, 6 Jan 2015 21:10:26 GMT (138kb)

Link back to: arXiv, form interface, contact.