Reference Hub4
Predicting Seminal Quality and its Dependence on Life Style Factors Through Ensemble Learning

Predicting Seminal Quality and its Dependence on Life Style Factors Through Ensemble Learning

Satya Ranjan Dash, Ratula Ray
Copyright: © 2020 |Volume: 11 |Issue: 2 |Pages: 18
ISSN: 1947-315X|EISSN: 1947-3168|EISBN13: 9781799806905|DOI: 10.4018/IJEHMC.2020040105
Cite Article Cite Article

MLA

Dash, Satya Ranjan, and Ratula Ray. "Predicting Seminal Quality and its Dependence on Life Style Factors Through Ensemble Learning." IJEHMC vol.11, no.2 2020: pp.78-95. http://doi.org/10.4018/IJEHMC.2020040105

APA

Dash, S. R. & Ray, R. (2020). Predicting Seminal Quality and its Dependence on Life Style Factors Through Ensemble Learning. International Journal of E-Health and Medical Communications (IJEHMC), 11(2), 78-95. http://doi.org/10.4018/IJEHMC.2020040105

Chicago

Dash, Satya Ranjan, and Ratula Ray. "Predicting Seminal Quality and its Dependence on Life Style Factors Through Ensemble Learning," International Journal of E-Health and Medical Communications (IJEHMC) 11, no.2: 78-95. http://doi.org/10.4018/IJEHMC.2020040105

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

The awareness related to fertility is of great importance due to the change in lifestyle habits. Semen analysis is a reliable confirmatory test to check the fertility in men. The supervised machine learning models of base classifiers include Decision Tree, Logistic Regression and Naive Bayes classifiers in which logistic regression shows a promising accuracy of 88%. Comparing with the bagging ensemble method for the weakest classifier, the results show a leap in accuracy from 78.80% to 90.02%. The authors have also attempted to design a novel voting classifier which votes over the ensemble learners and creates a more complex model to give an accuracy of 89%. Apart from this, the authors have also analyzed the receiver operating characteristic (ROC) curve for Extra Tree classifier which shows a 66% of area under the curve (AUC). The validation procedure used is a 5 fold cross-validation. The authors have further analyzed the lifestyle habits responsible for contributing to this problem based on impurity-based feature selection and have obtained ‘Age' as the most crucial factor in declining seminal quality.