International Peace Maternity and Child Health Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China.
Shanghai Key Laboratory of Embryo Original Diseases, Shanghai, China.
J Clin Endocrinol Metab. 2021 Mar 8;106(3):e1191-e1205. doi: 10.1210/clinem/dgaa899.
Accurate methods for early gestational diabetes mellitus (GDM) (during the first trimester of pregnancy) prediction in Chinese and other populations are lacking.
This work aimed to establish effective models to predict early GDM.
Pregnancy data for 73 variables during the first trimester were extracted from the electronic medical record system. Based on a machine learning (ML)-driven feature selection method, 17 variables were selected for early GDM prediction. To facilitate clinical application, 7 variables were selected from the 17-variable panel. Advanced ML approaches were then employed using the 7-variable data set and the 73-variable data set to build models predicting early GDM for different situations, respectively.
A total of 16 819 and 14 992 cases were included in the training and testing sets, respectively. Using 73 variables, the deep neural network model achieved high discriminative power, with area under the curve (AUC) values of 0.80. The 7-variable logistic regression (LR) model also achieved effective discriminate power (AUC = 0.77). Low body mass index (BMI) (≤ 17) was related to an increased risk of GDM, compared to a BMI in the range of 17 to 18 (minimum risk interval) (11.8% vs 8.7%, P = .09). Total 3,3,5'-triiodothyronine (T3) and total thyroxin (T4) were superior to free T3 and free T4 in predicting GDM. Lipoprotein(a) was demonstrated a promising predictive value (AUC = 0.66).
We employed ML models that achieved high accuracy in predicting GDM in early pregnancy. A clinically cost-effective 7-variable LR model was simultaneously developed. The relationship of GDM with thyroxine and BMI was investigated in the Chinese population.
缺乏针对中国人群和其他人群的准确方法来预测早发妊娠糖尿病(GDM)(在妊娠早期)。
本研究旨在建立预测早发 GDM 的有效模型。
从电子病历系统中提取了 73 个变量的妊娠早期数据。基于机器学习(ML)驱动的特征选择方法,选择了 17 个变量用于早期 GDM 预测。为便于临床应用,从 17 变量组中选择了 7 个变量。然后,使用这 7 个变量数据集和 73 个变量数据集,分别采用先进的 ML 方法,为不同情况构建预测早发 GDM 的模型。
训练集和测试集分别纳入了 16819 例和 14992 例患者。使用 73 个变量,深度神经网络模型具有较高的判别能力,曲线下面积(AUC)值为 0.80。7 变量逻辑回归(LR)模型也具有有效的判别能力(AUC=0.77)。与 BMI 范围在 17 到 18 之间(最小风险间隔)相比,低 BMI(≤17)与 GDM 风险增加相关(11.8%比 8.7%,P=0.09)。总 3,3,5'-三碘甲状腺原氨酸(T3)和总甲状腺素(T4)预测 GDM 的效果优于游离 T3 和游离 T4。脂蛋白(a)显示出有前途的预测价值(AUC=0.66)。
我们使用 ML 模型成功地预测了妊娠早期的 GDM,准确率较高。同时还开发了一种具有临床成本效益的 7 变量 LR 模型。本研究在中国人群中探讨了 GDM 与甲状腺素和 BMI 的关系。