基于生存数据分析的临床属性相似性度量方法。

Methods for a similarity measure for clinical attributes based on survival data analysis.

机构信息

Heidelberg University Hospital, Institute of Medical Biometry and Informatics, Im Neuenheimer Feld 130.3, 69120, Heidelberg, Germany.

Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany.

出版信息

BMC Med Inform Decis Mak. 2019 Oct 21;19(1):195. doi: 10.1186/s12911-019-0917-6.

DOI:10.1186/s12911-019-0917-6

PMID:31638963

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6805472/

Abstract

BACKGROUND

Case-based reasoning is a proven method that relies on learned cases from the past for decision support of a new case. The accuracy of such a system depends on the applied similarity measure, which quantifies the similarity between two cases. This work proposes a collection of methods for similarity measures especially for comparison of clinical cases based on survival data, as they are available for example from clinical trials.

METHODS

Our approach is intended to be used in scenarios, where it is of interest to use longitudinal data, such as survival data, for a case-based reasoning approach. This might be especially important, where uncertainty about the ideal therapy decision exists. The collection of methods consists of definitions of the local similarity of nominal as well as numeric attributes, a calculation of attribute weights, a feature selection method and finally a global similarity measure. All of them use survival time (consisting of survival status and overall survival) as a reference of similarity. As a baseline, we calculate a survival function for each value of any given clinical attribute.

RESULTS

We define the similarity between values of the same attribute by putting the estimated survival functions in relation to each other. Finally, we quantify the similarity by determining the area between corresponding curves of survival functions. The proposed global similarity measure is designed especially for cases from randomized clinical trials or other collections of clinical data with survival information. Overall survival can be considered as an eligible and alternative solution for similarity calculations. It is especially useful, when similarity measures that depend on the classic solution-describing attribute "applied therapy" are not applicable. This is often the case for data from clinical trials containing randomized arms.

CONCLUSIONS

In silico evaluation scenarios showed that the mean accuracy of biomarker detection in k = 10 most similar cases is higher (0.909-0.998) than for competing similarity measures, such as Heterogeneous Euclidian-Overlap Metric (0.657-0.831) and Discretized Value Difference Metric (0.535-0.671). The weight calculation method showed a more than six times (6.59-6.95) higher weight for biomarker attributes over non-biomarker attributes. These results suggest that the similarity measure described here is suitable for applications based on survival data.

摘要

背景

基于案例的推理是一种经过验证的方法，它依赖于过去的学习案例来为新案例提供决策支持。这种系统的准确性取决于所应用的相似度度量，它量化了两个案例之间的相似度。这项工作提出了一系列用于相似度度量的方法，特别是用于基于生存数据的临床案例比较，因为这些数据可从临床试验中获得。

方法

我们的方法旨在用于感兴趣使用纵向数据（如生存数据）进行基于案例的推理的场景。在存在对理想治疗决策的不确定性的情况下，这可能尤为重要。该方法集包括对名义和数值属性的局部相似度的定义、属性权重的计算、特征选择方法以及最终的全局相似度度量。所有这些方法都使用生存时间（包括生存状态和总生存）作为相似度的参考。作为基线，我们为任何给定临床属性的每个值计算生存函数。

结果

我们通过将估计的生存函数相互关联来定义同一属性值之间的相似度。最后，我们通过确定相应生存函数曲线之间的区域来量化相似度。所提出的全局相似度度量专门为来自随机临床试验或其他包含生存信息的临床数据集合的案例而设计。总生存可以被认为是相似度计算的一个合适和替代的解决方案。当依赖于经典的“应用治疗”描述属性的相似度度量不可用时，它尤其有用。这种情况通常发生在包含随机臂的临床试验数据中。

结论

模拟评估场景表明，在 k=10 个最相似案例中检测生物标志物的平均准确性（0.909-0.998）高于竞争相似度度量，如异质欧式重叠度量（0.657-0.831）和离散值差度量（0.535-0.671）。权重计算方法显示生物标志物属性的权重比非生物标志物属性高 6.59-6.95 倍。这些结果表明，这里描述的相似度度量适合基于生存数据的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e685/6805472/193ec11c93b8/12911_2019_917_Fig1_HTML.jpg

相似文献

Methods for a similarity measure for clinical attributes based on survival data analysis.

BMC Med Inform Decis Mak. 2019 Oct 21;19(1):195. doi: 10.1186/s12911-019-0917-6.

Coupled attribute similarity learning on categorical data.

IEEE Trans Neural Netw Learn Syst. 2015 Apr;26(4):781-97. doi: 10.1109/TNNLS.2014.2325872.

Sensors (Basel). 2019 Oct 23;19(21):4605. doi: 10.3390/s19214605.

Decision Support for Managing Common Musculoskeletal Pain Disorders: Development of a Case-Based Reasoning Application.

JMIR Form Res. 2024 May 10;8:e44805. doi: 10.2196/44805.

Nearest neighbour classification with heterogeneous proximity functions.

Stud Health Technol Inform. 2000;77:753-7.

Graph-Based Dissimilarity Measurement for Cluster Analysis of Any-Type-Attributed Data.

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6530-6544. doi: 10.1109/TNNLS.2022.3202700. Epub 2023 Sep 1.

A case-based reasoning system based on weighted heterogeneous value distance metric for breast cancer diagnosis.

Artif Intell Med. 2017 Mar;77:31-47. doi: 10.1016/j.artmed.2017.02.003. Epub 2017 Feb 11.

Integrating functional data analysis with case-based reasoning for hypertension prognosis and diagnosis based on real-world electronic health records.

BMC Med Inform Decis Mak. 2022 Jun 6;22(1):149. doi: 10.1186/s12911-022-01894-7.

A New Distance Metric Exploiting Heterogeneous Interattribute Relationship for Ordinal-and-Nominal-Attribute Data Clustering.

IEEE Trans Cybern. 2022 Feb;52(2):758-771. doi: 10.1109/TCYB.2020.2983073. Epub 2022 Feb 16.

本文引用的文献

An association study of established breast cancer reproductive and lifestyle risk factors with tumour subtype defined by the prognostic 70-gene expression signature (MammaPrint).

Eur J Cancer. 2017 Apr;75:5-13. doi: 10.1016/j.ejca.2016.12.024. Epub 2017 Feb 16.

Front Physiol. 2016 Nov 24;7:561. doi: 10.3389/fphys.2016.00561. eCollection 2016.

An ontology-driven, case-based clinical decision support model for removable partial denture design.

Sci Rep. 2016 Jun 14;6:27855. doi: 10.1038/srep27855.

Safety and efficacy of palliative systemic chemotherapy combined with colorectal self-expandable metallic stents in advanced colorectal cancer: A multicenter study.

Clin Res Hepatol Gastroenterol. 2016 Apr;40(2):230-8. doi: 10.1016/j.clinre.2015.09.004. Epub 2015 Oct 21.

The consensus molecular subtypes of colorectal cancer.

Nat Med. 2015 Nov;21(11):1350-6. doi: 10.1038/nm.3967. Epub 2015 Oct 12.

Impaired Neonatal Outcome after Emergency Cerclage Adds Controversy to Prolongation of Pregnancy.

PLoS One. 2015 Jun 29;10(6):e0129104. doi: 10.1371/journal.pone.0129104. eCollection 2015.

Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials.

J Am Med Inform Assoc. 2015 Apr;22(e1):e141-50. doi: 10.1093/jamia/ocu050. Epub 2015 Mar 13.

Feasibility of case-based beam generation for robotic radiosurgery.

Artif Intell Med. 2011 Jun;52(2):67-75. doi: 10.1016/j.artmed.2011.04.008. Epub 2011 Jun 16.

Supporting adaptive clinical treatment processes through recommendations.

Comput Methods Programs Biomed. 2012 Sep;107(3):413-24. doi: 10.1016/j.cmpb.2010.12.005. Epub 2011 Jan 21.

eXiT*CBR: A framework for case-based medical diagnosis development and experimentation.

Artif Intell Med. 2011 Feb;51(2):81-91. doi: 10.1016/j.artmed.2010.09.002. Epub 2010 Oct 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于生存数据分析的临床属性相似性度量方法。

Methods for a similarity measure for clinical attributes based on survival data analysis.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献