Department of Obstetrics and Gynecology, Meir Medical Center, 59 Tchernichovsky St, Kfar Saba, Israel.
Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel.
Arch Gynecol Obstet. 2021 Sep;304(3):641-647. doi: 10.1007/s00404-021-05994-z. Epub 2021 Feb 19.
Applying machine-learning models to clinical and laboratory features of women with intrahepatic cholestasis of pregnancy (ICP) and creating algorithm to identify these patients without bile acid measurements.
This retrospective study included 336 pregnant women with a chief complaint of pruritis without rash during the second/third trimesters. Data extracted included: demographics, obstetric, clinical and laboratory features. The primary outcome was an elevated bile acid measurement ≥ 10 µmol/L, regardless of liver enzyme levels. We used different machine-learning models and statistical regression to predict elevated bile acid levels.
Among 336 women who complained about pruritis, 167 had bile acids ≥ 10 µmol/L and 169 had normal levels. Women with elevated bile acids were older than those with normal levels (p = 0.001), higher parity (p = 0.001), and higher glutamic oxaloacetic transaminase ( GOT) (p = 0.001) and glutamic-pyruvic transaminase (GPT) levels (p = 0.001). Using machine-learning models, the XGB Classifier model was the most accurate (area under the curve (AUC), 0.9) followed by the K-neighbors model (AUC, 0.86); and then the Support Vector Classification (SVC) model (AUC, 0.82). The model with the lowest predicative ability was the logistic regression (AUC, 0.72). The maximum sensitivity of the XGB model was 86% and specificity 75%. The best predictive parameters of the XGB model were elevated GOT (Importance 0.17), elevated GPT (Importance 0.16), family history of bile disease (0.16) and previous pregnancy with ICP (0.13).
Machine-learning models using clinical data may predict ICP more accurately than logistic regression does. Using detection algorithms derived from these techniques may improve identification of ICP, especially when bile acid testing is not available.
应用机器学习模型分析妊娠肝内胆汁淤积症(ICP)患者的临床和实验室特征,并建立一种无需检测胆汁酸即可识别此类患者的算法。
本回顾性研究纳入了 336 名主诉在妊娠中晚期出现无皮疹瘙痒的孕妇。提取的数据包括:人口统计学、产科、临床和实验室特征。主要结局是胆汁酸水平升高(≥10 μmol/L),无论肝酶水平如何。我们使用不同的机器学习模型和统计回归来预测胆汁酸水平升高。
在 336 名主诉瘙痒的孕妇中,有 167 名胆汁酸水平升高(≥10 μmol/L),169 名胆汁酸水平正常。胆汁酸升高组的孕妇年龄大于胆汁酸正常组(p = 0.001),产次较高(p = 0.001),谷草转氨酶(GOT)(p = 0.001)和谷丙转氨酶(GPT)水平较高(p = 0.001)。使用机器学习模型,XGB 分类器模型的准确性最高(曲线下面积(AUC),0.9),其次是 K-近邻模型(AUC,0.86);然后是支持向量分类(SVC)模型(AUC,0.82)。预测能力最低的模型是逻辑回归(AUC,0.72)。XGB 模型的最高敏感性为 86%,特异性为 75%。XGB 模型的最佳预测参数是 GOT 升高(重要性 0.17)、GPT 升高(重要性 0.16)、胆汁疾病家族史(0.16)和既往妊娠 ICP(0.13)。
使用临床数据的机器学习模型可能比逻辑回归更准确地预测 ICP。使用从这些技术中衍生出的检测算法可能会提高 ICP 的识别能力,尤其是在无法检测胆汁酸时。