RESEARCH PAPER
PREDICTION OF TYPE 2 DIABETES MELLITUS USING FEATURE SELECTION-BASED
MACHINE LEARNING ALGORITHMS
More details
Hide details
1
Department of Computer Engineering, Beykent University, Istanbul, Turkey
Submission date: 2021-12-09
Final revision date: 2022-03-02
Acceptance date: 2022-03-14
Publication date: 2022-06-30
Health Prob Civil. 2022;16(2):128-139
KEYWORDS
TOPICS
ABSTRACT
Background:
The aim of this study is to develop and evaluate a machine learning model for the
early diagnosis of type 2 diabetes to allow for treatments to be applied in the early stages of the
disease.
Material and methods:
A proposed hybrid machine learning model was developed and applied
to the Early-stage diabetes risk prediction dataset from the UCI database. The prediction success
of the proposed model was compared with other machine learning models. Pearson’s correlation
and SelectKBest feature selection methods were employed to examine the relationships between
the dataset input parameters and the results.
Results:
Of the 520 patients included in the dataset, 320 were diagnosed with diabetes and
328 (63.08%) were males. The most commonly observed diabetes diagnosis criterion was
obesity (n=482, 83.08%). While the strongest feature detected with Pearson’s correlation was
polyuria, the strongest feature detected with SelectKBest was polydipsia. With Pearson’s feature
extraction, the most successful machine learning method was the proposed hybrid method, with
an accuracy of 97.28%. Using SelectKBest feature selection, the same model was able to predict
type 2 diabetes with accuracy of 95.16%.
Conclusions:
Early detection of type 2 diabetes will allow for a prompter and more effective
treatment of the patient. Thus, use of the proposed model may help to improve the quality of
patient care and lower the number of deaths caused by this disease.