Reference Hub3
An Algorithm for Multi-Domain Website Classification

An Algorithm for Multi-Domain Website Classification

Mohammad Aman Ullah, Anika Tahrin, Sumaiya Marjan
Copyright: © 2020 |Volume: 15 |Issue: 4 |Pages: 9
ISSN: 1548-1093|EISSN: 1548-1107|EISBN13: 9781799803997|DOI: 10.4018/IJWLTT.2020100104
Cite Article Cite Article

MLA

Ullah, Mohammad Aman, et al. "An Algorithm for Multi-Domain Website Classification." IJWLTT vol.15, no.4 2020: pp.57-65. http://doi.org/10.4018/IJWLTT.2020100104

APA

Ullah, M. A., Tahrin, A., & Marjan, S. (2020). An Algorithm for Multi-Domain Website Classification. International Journal of Web-Based Learning and Teaching Technologies (IJWLTT), 15(4), 57-65. http://doi.org/10.4018/IJWLTT.2020100104

Chicago

Ullah, Mohammad Aman, Anika Tahrin, and Sumaiya Marjan. "An Algorithm for Multi-Domain Website Classification," International Journal of Web-Based Learning and Teaching Technologies (IJWLTT) 15, no.4: 57-65. http://doi.org/10.4018/IJWLTT.2020100104

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

The web is the largest world-wide communication system of computers. The web has local, academic, commercial and government sites. As the types of websites increases in numbers, the cost and accuracy of manual classification became cumbersome and cannot satisfy the increasing internet service demands, thereby automated classification became important for better and more accurate search engine results. Therefore, this research has proposed an algorithm for classifying different websites automatically by using randomly collected textual data from the webpages. This research also contributed ten dictionaries covering different domains and used as training data in the classification process. Finally, the classification was carried out using the proposed and Naïve Bayes algorithms and found the proposed algorithm outperformed on the scale of accuracy by 1.25%. This research suggests that the proposed algorithm could be applied to any number of domains if the related dictionaries are available.