Axia Women's Health, Cincinnati, OH.
Aspira Women's Health, Trumbull, CT.
JCO Clin Cancer Inform. 2022 Jun;6:e2100192. doi: 10.1200/CCI.21.00192.
Early detection of ovarian cancer, the deadliest gynecologic cancer, is crucial for reducing mortality. Current noninvasive risk assessment measures include protein biomarkers in combination with other clinical factors, which vary in their accuracy. Machine learning can be applied to optimizing the combination of these features, leading to more accurate assessment of malignancy. However, the low prevalence of the disease can make rigorous validation of these tests challenging and can result in unbalanced performance.
MIA3G is a deep feedforward neural network for ovarian cancer risk assessment, using seven protein biomarkers along with age and menopausal status as input features. The algorithm was developed on a heterogenous data set of 1,067 serum specimens from women with adnexal masses (prevalence = 31.8%). It was subsequently validated on a cohort almost twice that size (N = 2,000).
In the analytical validation data set (prevalence = 4.9%), MIA3G demonstrated a sensitivity of 89.8% and a specificity of 84.02%. The positive predictive value was 22.45%, and the negative predictive value was 99.38%. When stratified by cancer type and stage, MIA3G achieved sensitivities of 94.94% for epithelial ovarian cancer, 76.92% for early-stage cancer, and 98.04% for late-stage cancer.
The balanced performance of MIA3G leads to a high sensitivity and high specificity, a combination that may be clinically useful for providers in evaluating the appropriate management strategy for their patients. Limitations of this work include the largely retrospective nature of the data set and the unequal, albeit random, assignment of histologic subtypes between the training and validation data sets. Future directions may include the addition of new biomarkers or other modalities to strengthen the performance of the algorithm.
卵巢癌是最致命的妇科癌症,早期发现对于降低死亡率至关重要。目前的非侵入性风险评估措施包括蛋白质生物标志物与其他临床因素相结合,其准确性各不相同。机器学习可用于优化这些特征的组合,从而更准确地评估恶性程度。然而,由于疾病的低患病率,这些测试的严格验证具有挑战性,并且可能导致性能不平衡。
MIA3G 是一种用于卵巢癌风险评估的深度前馈神经网络,使用 7 种蛋白质生物标志物以及年龄和绝经状态作为输入特征。该算法是在一个包含 1067 例附件肿块妇女血清标本的异质数据集(患病率=31.8%)上开发的。随后在几乎是其两倍大小的队列(N=2000)上进行验证。
在分析验证数据集(患病率=4.9%)中,MIA3G 显示出 89.8%的敏感性和 84.02%的特异性。阳性预测值为 22.45%,阴性预测值为 99.38%。按癌症类型和分期分层,MIA3G 对上皮性卵巢癌的敏感性为 94.94%,早期癌症为 76.92%,晚期癌症为 98.04%。
MIA3G 的平衡性能导致高敏感性和高特异性,这种组合可能对提供者在评估其患者的适当管理策略方面具有临床意义。这项工作的局限性包括数据集主要是回顾性的,以及训练和验证数据集中组织学亚型的分配不均,尽管是随机的。未来的方向可能包括添加新的生物标志物或其他模式来增强算法的性能。