Hangzhou Research Institute, Xidian University, Hangzhou 311231, China.
School of Computer Science and Technology, Xidian University, Xi'an 710071, China.
Int J Mol Sci. 2024 Oct 4;25(19):10684. doi: 10.3390/ijms251910684.
Preeclampsia is a pregnancy syndrome characterized by complex symptoms which cause maternal and fetal problems and deaths. The aim of this study is to achieve preeclampsia risk prediction and early risk prediction in Xinjiang, China, based on the placental growth factor measured using the SiMoA or Elecsys platform. A novel reliable calibration modeling method and missing data imputing method are proposed, in which different strategies are used to adapt to small samples, training data, test data, independent features, and dependent feature pairs. Multiple machine learning algorithms were applied to train models using various datasets, such as single-platform versus bi-platform data, early pregnancy versus early plus non-early pregnancy data, and real versus real plus augmented data. It was found that a combination of two types of mono-platform data could improve risk prediction performance, and non-early pregnancy data could enhance early risk prediction performance when limited early pregnancy data were available. Additionally, the inclusion of augmented data resulted in achieving a high but unstable performance. The models in this study significantly reduced the incidence of preeclampsia in the region from 7.2% to 2.0%, and the mortality rate was reduced to 0%.
子痫前期是一种妊娠综合征,其特征是复杂的症状,会导致母婴出现问题和死亡。本研究旨在基于使用 SiMoA 或 Elecsys 平台测量的胎盘生长因子,实现中国新疆子痫前期的风险预测和早期风险预测。提出了一种新的可靠校准建模方法和缺失数据插补方法,其中使用不同的策略来适应小样本、训练数据、测试数据、独立特征和相关特征对。应用多种机器学习算法,使用各种数据集(如单平台与双平台数据、早孕期与早孕期加非早孕期数据、真实数据与真实数据加扩充数据)训练模型。结果发现,两种类型的单平台数据的组合可以提高风险预测性能,并且在早期妊娠数据有限的情况下,非早孕期数据可以提高早期风险预测性能。此外,包含扩充数据会导致性能达到高但不稳定的水平。本研究中的模型显著降低了该地区子痫前期的发病率,从 7.2%降至 2.0%,死亡率降低至 0%。