Suppr超能文献

一种用于结合具有检测限的高维标记物的重采样替换套索方法。

A resample-replace lasso procedure for combining high-dimensional markers with limit of detection.

作者信息

Wang Jinjuan, Zhao Yunpeng, Tang Larry L, Mueller Claudius, Li Qizhai

机构信息

School of Mathematics and Statistics, Beijing Institute of Technology, Beijing, People's Republic of China.

School of Mathematical and Natural Sciences, Arizona State University, Tempe, AZ, USA.

出版信息

J Appl Stat. 2021 Sep 22;49(16):4278-4293. doi: 10.1080/02664763.2021.1977785. eCollection 2022.

Abstract

In disease screening, a biomarker combination developed by combining multiple markers tends to have a higher sensitivity than an individual marker. Parametric methods for marker combination rely on the inverse of covariance matrices, which is often a non-trivial problem for high-dimensional data generated by modern high-throughput technologies. Additionally, another common problem in disease diagnosis is the existence of limit of detection (LOD) for an instrument - that is, when a biomarker's value falls below the limit, it cannot be observed and is assigned an NA value. To handle these two challenges in combining high-dimensional biomarkers with the presence of LOD, we propose a resample-replace lasso procedure. We first impute the values below LOD and then use the graphical lasso method to estimate the means and precision matrices for the high-dimensional biomarkers. The simulation results show that our method outperforms alternative methods such as either substitute NA values with LOD values or remove observations that have NA values. A real case analysis on a protein profiling study of glioblastoma patients on their survival status indicates that the biomarker combination obtained through the proposed method is more accurate in distinguishing between two groups.

摘要

在疾病筛查中,通过组合多个标志物开发的生物标志物组合往往比单个标志物具有更高的灵敏度。用于标志物组合的参数方法依赖于协方差矩阵的逆,这对于现代高通量技术生成的高维数据来说通常是一个棘手的问题。此外,疾病诊断中的另一个常见问题是仪器存在检测限(LOD)——也就是说,当生物标志物的值低于该限时,就无法观察到,并且会被赋值为NA值。为了应对在存在LOD的情况下组合高维生物标志物时的这两个挑战,我们提出了一种重采样替换套索程序。我们首先对低于LOD的值进行插补,然后使用图形套索方法来估计高维生物标志物的均值和精度矩阵。模拟结果表明,我们的方法优于其他方法,如用LOD值替代NA值或删除具有NA值的观测值。对胶质母细胞瘤患者生存状态的蛋白质谱研究进行的实际案例分析表明,通过所提出的方法获得的生物标志物组合在区分两组时更准确。

相似文献

3
Combining markers with and without the limit of detection.结合有和无检测限的标志物。
Stat Med. 2014 Apr 15;33(8):1307-20. doi: 10.1002/sim.6027. Epub 2013 Oct 17.
10
New insights into modeling exposure measurements below the limit of detection.低于检测限的暴露测量建模新见解。
Environ Epidemiol. 2020 Dec 16;5(1):e116. doi: 10.1097/EE9.0000000000000116. eCollection 2021 Feb.

本文引用的文献

1
ℓ 1-Penalized censored Gaussian graphical model.L1 惩罚截断高斯图形模型。
Biostatistics. 2020 Apr 1;21(2):e1-e16. doi: 10.1093/biostatistics/kxy043.
2
Sufficient dimension reduction for censored predictors.删失预测变量的充分降维
Biometrics. 2017 Mar;73(1):220-231. doi: 10.1111/biom.12556. Epub 2016 Aug 9.
4
Combining markers with and without the limit of detection.结合有和无检测限的标志物。
Stat Med. 2014 Apr 15;33(8):1307-20. doi: 10.1002/sim.6027. Epub 2013 Oct 17.
5
Penalized classification using Fisher's linear discriminant.使用费舍尔线性判别法的惩罚分类
J R Stat Soc Series B Stat Methodol. 2011 Nov;73(5):753-772. doi: 10.1111/j.1467-9868.2011.00783.x.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验