Proximate Breast Cancer Factors Using Data Mining Classification Techniques

Proximate Breast Cancer Factors Using Data Mining Classification Techniques

Alice Constance Mensah, Isaac Ofori Asare
ISBN13: 9781668471364|ISBN10: 1668471361|EISBN13: 9781668471371
DOI: 10.4018/978-1-6684-7136-4.ch013
Cite Chapter Cite Chapter

MLA

Mensah, Alice Constance, and Isaac Ofori Asare. "Proximate Breast Cancer Factors Using Data Mining Classification Techniques." Research Anthology on Medical Informatics in Breast and Cervical Cancer, edited by Information Resources Management Association, IGI Global, 2023, pp. 236-246. https://doi.org/10.4018/978-1-6684-7136-4.ch013

APA

Mensah, A. C. & Asare, I. O. (2023). Proximate Breast Cancer Factors Using Data Mining Classification Techniques. In I. Management Association (Ed.), Research Anthology on Medical Informatics in Breast and Cervical Cancer (pp. 236-246). IGI Global. https://doi.org/10.4018/978-1-6684-7136-4.ch013

Chicago

Mensah, Alice Constance, and Isaac Ofori Asare. "Proximate Breast Cancer Factors Using Data Mining Classification Techniques." In Research Anthology on Medical Informatics in Breast and Cervical Cancer, edited by Information Resources Management Association, 236-246. Hershey, PA: IGI Global, 2023. https://doi.org/10.4018/978-1-6684-7136-4.ch013

Export Reference

Mendeley
Favorite

Abstract

Breast cancer is the most common of all cancers and is the leading cause of cancer deaths in women worldwide. The classification of breast cancer data can be useful to predict the outcome of some diseases or discover the genetic behavior of tumors. Data mining technology helps in classifying cancer patients and this technique helps to identify potential cancer patients by simply analyzing the data. This study examines the determinant factors of breast cancer and measures the breast cancer patient data to build a useful classification model using a data mining approach. In this study of 2397 women, 1022 (42.64%) were diagnosed with breast cancer. Among the four main learning techniques such as: Random Forest, Naive Bayes, Classification and Regression Model (CART), and Boosted Tree model were used for the study. The Random Forest technique had the better accuracy value of 0.9892(95%CI,0.9832 -0.9935) and a sensitivity value of about 92%. This means that the Random Forest learning model is the best model to classify and predict breast cancer based on associated factors.