Suppr超能文献

针对存在合并情况的偏态生物标志物结果的回归分析。

Regression for skewed biomarker outcomes subject to pooling.

作者信息

Mitchell Emily M, Lyles Robert H, Manatunga Amita K, Danaher Michelle, Perkins Neil J, Schisterman Enrique F

机构信息

Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, Georgia 30322, U.S.A.

出版信息

Biometrics. 2014 Mar;70(1):202-11. doi: 10.1111/biom.12134. Epub 2014 Feb 12.

Abstract

Epidemiological studies involving biomarkers are often hindered by prohibitively expensive laboratory tests. Strategically pooling specimens prior to performing these lab assays has been shown to effectively reduce cost with minimal information loss in a logistic regression setting. When the goal is to perform regression with a continuous biomarker as the outcome, regression analysis of pooled specimens may not be straightforward, particularly if the outcome is right-skewed. In such cases, we demonstrate that a slight modification of a standard multiple linear regression model for poolwise data can provide valid and precise coefficient estimates when pools are formed by combining biospecimens from subjects with identical covariate values. When these x-homogeneous pools cannot be formed, we propose a Monte Carlo expectation maximization (MCEM) algorithm to compute maximum likelihood estimates (MLEs). Simulation studies demonstrate that these analytical methods provide essentially unbiased estimates of coefficient parameters as well as their standard errors when appropriate assumptions are met. Furthermore, we show how one can utilize the fully observed covariate data to inform the pooling strategy, yielding a high level of statistical efficiency at a fraction of the total lab cost.

摘要

涉及生物标志物的流行病学研究常常受到实验室检测费用过高的阻碍。在进行这些实验室检测之前,有策略地合并样本已被证明在逻辑回归设置中能有效降低成本,同时信息损失最小。当目标是以连续生物标志物作为结果进行回归时,合并样本的回归分析可能并不简单,尤其是当结果呈右偏态时。在这种情况下,我们证明,当通过组合具有相同协变量值的受试者的生物样本形成样本池时,对样本池数据的标准多元线性回归模型进行轻微修改,可以提供有效且精确的系数估计。当无法形成这些x同质样本池时,我们提出一种蒙特卡罗期望最大化(MCEM)算法来计算最大似然估计(MLE)。模拟研究表明,当满足适当假设时,这些分析方法能提供系数参数及其标准误差的基本无偏估计。此外,我们展示了如何利用完全观测到的协变量数据为合并策略提供信息,从而在仅花费总实验室成本一小部分的情况下实现高水平的统计效率。

相似文献

7

引用本文的文献

1
Additive partially linear model for pooled biomonitoring data.合并生物监测数据的加法部分线性模型。
Comput Stat Data Anal. 2024 Feb;190. doi: 10.1016/j.csda.2023.107862. Epub 2023 Oct 2.
3
Varying-coefficient regression analysis for pooled biomonitoring.用于汇总生物监测的变系数回归分析
Biometrics. 2022 Dec;78(4):1328-1341. doi: 10.1111/biom.13516. Epub 2021 Aug 1.
5
Local polynomial regression for pooled response data.合并响应数据的局部多项式回归
J Nonparametr Stat. 2020;32(4):814-837. doi: 10.1080/10485252.2020.1834104. Epub 2020 Nov 4.
9
Group testing case identification with biomarker information.利用生物标志物信息进行分组检测病例识别。
Comput Stat Data Anal. 2018 Jun;122:156-166. doi: 10.1016/j.csda.2018.01.005. Epub 2018 Feb 1.

本文引用的文献

1
3
Pooling designs for outcomes under a Gaussian random effects model.高斯随机效应模型下结果的合并设计
Biometrics. 2012 Mar;68(1):45-52. doi: 10.1111/j.1541-0420.2011.01673.x. Epub 2011 Oct 9.
9
Circulating chemokine levels and miscarriage.循环趋化因子水平与流产
Am J Epidemiol. 2007 Aug 1;166(3):323-31. doi: 10.1093/aje/kwm084. Epub 2007 May 15.
10
The Collaborative Perinatal Project: lessons and legacy.围产期协作项目:经验与遗产。
Ann Epidemiol. 2003 May;13(5):303-11. doi: 10.1016/s1047-2797(02)00479-9.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验