Tong Feng, Shahid Muhammad, Jin Peng, Jung Sungyong, Kim Won Hwa, Kim Jayoung
Department of Computer Science, University of Texas at Arlington, Arlington TX 76019, USA.
Departments of Surgery and Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA.
Bladder (San Franc). 2020 Jun 2;7(2):e43. doi: 10.14440/bladder.2020.815. eCollection 2020.
With the advent of artificial intelligence (AI) in biostatistical analysis and modeling, machine learning can potentially be applied into developing diagnostic models for interstitial cystitis (IC). In the current clinical setting, urologists are dependent on cystoscopy and questionnaire-based decisions to diagnose IC. This is a result of a lack of objective diagnostic molecular biomarkers. The purpose of this study was to develop a machine learning-based method for diagnosing IC and assess its performance using metabolomics profiles obtained from a prior study. To develop the machine learning algorithm, two classification methods, support vector machine (SVM) and logistic regression (LR), set at various parameters, were applied to 43 IC patients and 16 healthy controls. There were 3 measures used in this study, accuracy, precision (positive predictive value), and recall (sensitivity). Individual precision and recall (PR) curves were drafted. Since the sample size was relatively small, complicated deep learning could not be done. We achieved a 76%-86% accuracy with leave-one-out cross validation depending on the method and parameters set. The highest accuracy achieved was 86.4% using SVM with a polynomial kernel degree set to 5, but a larger area under the curve (AUC) from the PR curve was achieved using LR with a -norm regularizer. The AUC was greater than 0.9 in its ability to discriminate IC patients from controls, suggesting that the algorithm works well in identifying IC, even when there is a class distribution imbalance between the IC and control samples. This finding provides further insight into utilizing previously identified urinary metabolic biomarkers in developing machine learning algorithms that can be applied in the clinical setting.
随着人工智能(AI)在生物统计分析和建模中的出现,机器学习有可能应用于开发间质性膀胱炎(IC)的诊断模型。在当前的临床环境中,泌尿科医生依靠膀胱镜检查和基于问卷的决策来诊断IC。这是由于缺乏客观的诊断分子生物标志物所致。本研究的目的是开发一种基于机器学习的IC诊断方法,并使用先前研究获得的代谢组学谱评估其性能。为了开发机器学习算法,将设置了各种参数的两种分类方法,即支持向量机(SVM)和逻辑回归(LR),应用于43例IC患者和16例健康对照。本研究使用了三种测量方法,准确性、精确率(阳性预测值)和召回率(敏感性)。绘制了个体精确率和召回率(PR)曲线。由于样本量相对较小,无法进行复杂的深度学习。根据设置的方法和参数,通过留一法交叉验证,我们实现了76%-86%的准确率。使用多项式核度设置为5的SVM实现的最高准确率为86.4%,但使用具有 -范数正则化器的LR从PR曲线获得的曲线下面积(AUC)更大。AUC在区分IC患者和对照方面的能力大于0.9,这表明该算法在识别IC方面表现良好,即使IC和对照样本之间存在类分布不平衡。这一发现为利用先前确定的尿液代谢生物标志物开发可应用于临床环境的机器学习算法提供了进一步的见解。