Effect of variable selection strategy on the predictive models for adverse pregnancy outcomes of pre-eclampsia : A retrospective study
Zheng, D., Hao, X., Khan, M., Kang, F., Li, F., Hämäläinen, T., & Wang, L. (2024). Effect of variable selection strategy on the predictive models for adverse pregnancy outcomes of pre-eclampsia : A retrospective study. Placenta and Reproductive Medicine, 3, Article 1. https://doi.org/10.54844/prm.2024.0318
Published in
Placenta and Reproductive MedicineAuthors
Li, Fan |
Date
2024Copyright
© 2024 Placenta and Reproductive Medicine
Objectives: The improvement of prediction for adverse pregnancy outcomes is quite essential to the women suffering from pre-eclampsia, while the collection of predictive indicators is the prerequisite. The traditional knowledge-based strategy for variable selection confronts challenge referring to dataset with high-dimensional or unfamiliar data. In this study, we employed five different automatic variable selection methods to screen out influential indicators, and evaluated the performance of constructed predictive models. Methods: Seven hundreds and thirty-three Han-Chinese women were enrolled and 56 clinical and laboratory variables were recorded. After grouping based on binary pregnancy outcomes, statistical description and analysis were performed. Then, utilizing forward stepwise logistic regression (FSLR) as the reference method, another four variable selection strategies were included for filtering contributing variables as the predictive subsets, respectively. Finally, the logistic regression prediction models were constructed by the five subsets and evaluated by the receiver operator characteristic curve. Results: The variables confirmed statistical significance between the adverse and satisfactory outcomes groups did not overlap with the variables selected by selection strategies. “Platelet” and “Creatinine clearance rate” were the most influential indicator to predict adverse maternal outcome, while “Birth weight of neonates” was the best indicator for predicting adverse neonatal outcome. In average, the predictive models for neonatal outcomes achieved better performance than models for maternal outcomes. “Mutual information” and “Recursive feature elimination” were the best strategy under current dataset and study design. Conclusions: Variable selection strategies may provide an alternative approach besides picking influential indicators by statistical significance. Future work will focus on applying different variable selection methods to the high-dimensional dataset, which includes novel or unfamiliar variables. This aims to identify the most appropriate collection of predictors that can enhance prediction ability and clinical decision-making.
...
Publisher
Scholar Media Publishing CompanyISSN Search the Publication Forum
2790-0428Keywords
pre-eclampsia feature selection variable selection logistic regression forward stepwise mutual information least absolute shrinkage and selection recursive feature elimination principal component analysis koneoppiminen raskaus mallit (mallintaminen) mallintaminen pre-eklampsia indikaattorit ennusteet regressioanalyysi pääkomponenttianalyysi
Original source
https://www.hksmp.com/journals/prm/article/view/318Publication in research information system
https://converis.jyu.fi/converis/portal/detail/Publication/207618493
Metadata
Show full item recordCollections
License
Related items
Showing items with similar title or keywords.
-
Comparison of machine learning and logistic regression as predictive models for adverse maternal and neonatal outcomes of preeclampsia : A retrospective study
Zheng, Dongying; Hao, Xinyu; Khan, Muhanmmad; Wang, Lixia; Li, Fan; Xiang, Ning; Kang, Fuli; Hamalainen, Timo; Cong, Fengyu; Song, Kedong; Qiao, Chong (Frontiers Media SA, 2022)Introduction: Preeclampsia, one of the leading causes of maternal and fetal morbidity and mortality, demands accurate predictive models for the lack of effective treatment. Predictive models based on machine learning ... -
Comparison of three ordinal logistic regression methods for predicting person’s self-assessed health status with functional, haemodynamic covariates
Markkanen, Merri-Lotta (2023)Lääketieteen parissa perinteiset kyselytutkimukset ovat yhä suosittuja, jonka myötä myös järjestysasteikollisten muuttujien analyysia suoritetaan paljon. Modernin teknologian kehittyminen näkyy kuitenkin myös tällä ... -
Predicting Children's Myopia Risk : A Monte Carlo Approach to Compare the Performance of Machine Learning Models
Artiemjew, Piotr; Cybulski, Radosław; Emamian, Mohammad; Grzybowski, Andrzej; Jankowski, Andrzej; Lanca, Carla; Mehravaran, Shiva; Młyński, Marcin; Morawski, Cezary; Nordhausen, Klaus; Pärssinen, Olavi; Ropiak, Krzysztof (SCITEPRESS Science and Technology Publications, 2024)This study presents the initial results of the Myopia Risk Calculator (MRC) Consortium, introducing an innovative approach to predict myopia risk by using trustworthy machine-learning models. The dataset included approximately ... -
Eight Simple Guidelines for Improved Understanding of Transformations and Nonlinear Effects
Rönkkö, Mikko; Aalto, Eero; Tenhunen, Henni; Aguirre-Urreta, Miguel I. (SAGE Publications, 2022)Transforming variables before analysis or applying a transformation as a part of a generalized linear model are common practices in organizational research. Several methodological articles addressing the topic, either ... -
Comparing the forecasting performance of logistic regression and random forest models in criminal recidivism
Aaltonen, Olli-Pekka (2016)Rikosseuraamusalalla on viime vuosina kehitetty uusintarikollisuutta ennustavia malleja (Tyni, 2015), jotka perustuvat tyypillisesti rekisteripohjaisiin mittareihin, jotka mittaavat mm. tuomitun sukupuolta, ikää, rikostaustaa ...