Comparison of machine learning and logistic regression as predictive models for adverse maternal and neonatal outcomes of preeclampsia : A retrospective study
Zheng, D., Hao, X., Khan, M., Wang, L., Li, F., Xiang, N., Kang, F., Hamalainen, T., Cong, F., Song, K., & Qiao, C. (2022). Comparison of machine learning and logistic regression as predictive models for adverse maternal and neonatal outcomes of preeclampsia : A retrospective study. Frontiers in Cardiovascular Medicine, 9, Article 959649. https://doi.org/10.3389/fcvm.2022.959649
Published inFrontiers in Cardiovascular Medicine
Li, Fan |
DisciplineTietotekniikkaTekniikkaSecure Communications Engineering and Signal ProcessingMathematical Information TechnologyEngineeringSecure Communications Engineering and Signal Processing
© 2022 Zheng, Hao, Khan, Wang, Li, Xiang, Kang, Hamalainen, Cong, Song and Qiao.
Introduction: Preeclampsia, one of the leading causes of maternal and fetal morbidity and mortality, demands accurate predictive models for the lack of effective treatment. Predictive models based on machine learning algorithms demonstrate promising potential, while there is a controversial discussion about whether machine learning methods should be recommended preferably, compared to traditional statistical models. Methods: We employed both logistic regression and six machine learning methods as binary predictive models for a dataset containing 733 women diagnosed with preeclampsia. Participants were grouped by four different pregnancy outcomes. After the imputation of missing values, statistical description and comparison were conducted preliminarily to explore the characteristics of documented 73 variables. Sequentially, correlation analysis and feature selection were performed as preprocessing steps to ﬁlter contributing variables for developing models. The models were evaluated by multiple criteria. Results: We ﬁrst ﬁgured out that the inﬂuential variables screened by preprocessing steps did not overlap with those determined by statistical differences. Secondly, the most accurate imputation method is K-Nearest Neighbor, and the imputation process did not affect the performance of the developed models much. Finally, the performance of models was investigated. The random forest classiﬁer, multi-layer perceptron, and support vector machine demonstrated better discriminative power for prediction evaluated by the area under the receiver operating characteristic curve, while the decision tree classiﬁer, random forest, and logistic regression yielded better calibration ability veriﬁed, as by the calibration curve. Conclusion: Machine learning algorithms can accomplish prediction modeling and demonstrate superior discrimination, while Logistic Regression can be calibrated well. Statistical analysis and machine learning are two scientiﬁc domains sharing similar themes. The predictive abilities of such developed models vary according to the characteristics of datasets, which still need larger sample sizes and more inﬂuential predictors to accumulate evidence. ...
PublisherFrontiers Media SA
Publication in research information system
MetadataShow full item record
Showing items with similar title or keywords.
Comparing the forecasting performance of logistic regression and random forest models in criminal recidivism Aaltonen, Olli-Pekka (2016)Rikosseuraamusalalla on viime vuosina kehitetty uusintarikollisuutta ennustavia malleja (Tyni, 2015), jotka perustuvat tyypillisesti rekisteripohjaisiin mittareihin, jotka mittaavat mm. tuomitun sukupuolta, ikää, rikostaustaa ...
Comparison of three ordinal logistic regression methods for predicting person’s self-assessed health status with functional, haemodynamic covariates Markkanen, Merri-Lotta (2023)Lääketieteen parissa perinteiset kyselytutkimukset ovat yhä suosittuja, jonka myötä myös järjestysasteikollisten muuttujien analyysia suoritetaan paljon. Modernin teknologian kehittyminen näkyy kuitenkin myös tällä ...
Ivannikova, Elena (University of Jyväskylä, 2017)The subject of this thesis belongs to the topic of machine learning or, speciﬁcally, to the development of advanced methods for regression analysis, clustering, and anomaly detection. Industry is constantly seeking ...
Linja, Joakim; Hämäläinen, Joonas; Nieminen, Paavo; Kärkkäinen, Tommi (MDPI AG, 2020)Minimal Learning Machine (MLM) is a recently popularized supervised learning method, which is composed of distance-regression and multilateration steps. The computational complexity of MLM is dominated by the solution of ...
Needs and preferences of women with prior severe preeclampsia regarding app-based cardiovascular health promotion Kókai, Lili L.; van der Bijl, Marte F.; Hagger, Martin S.; Ó Ceallaigh, Diarmaid T.; Rohde, Kirsten I.M.; van Kippersluis, Hans; Burdorf, Alex; Duvekot, Johannes J.; Roeters van Lennep, Jeanine E.; Wijtzes, Anne I. (Biomed Central, 2022)Background Women with prior severe preeclampsia are at an increased risk for cardiovascular diseases later in life compared to women who had a normotensive pregnancy. The objective of this study was to assess their needs ...