• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用符号计算辅助有效得分的计算。

Facilitating the Calculation of the Efficient Score Using Symbolic Computing.

作者信息

Sibley Alexander, Li Zhiguo, Jiang Yu, Li Yi-Ju, Chan Cliburn, Allen Andrew, Owzar Kouros

机构信息

Duke Cancer Institute, Duke University Medical Center.

Biostatistics and Bioinformatics, Duke University School of Medicine.

出版信息

Am Stat. 2018;72(2):199-205. doi: 10.1080/00031305.2017.1392361. Epub 2017 Oct 30.

DOI:10.1080/00031305.2017.1392361
PMID:30122786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6092959/
Abstract

The score statistic continues to be a fundamental tool for statistical inference. In the analysis of data from high-throughput genomic assays, inference on the basis of the score usually enjoys greater stability, considerably higher computational efficiency, and lends itself more readily to the use of resampling methods than the asymptotically equivalent Wald or likelihood ratio tests. The score function often depends on a set of unknown nuisance parameters which have to be replaced by estimators, but can be improved by calculating the efficient score, which accounts for the variability induced by estimating these parameters. Manual derivation of the efficient score is tedious and error-prone, so we illustrate using computer algebra to facilitate this derivation. We demonstrate this process within the context of a standard example from genetic association analyses, though the techniques shown here could be applied to any derivation, and have a place in the toolbox of any modern statistician. We further show how the resulting symbolic expressions can be readily ported to compiled languages, to develop fast numerical algorithms for high-throughput genomic analysis. We conclude by considering extensions of this approach. The code featured in this report is available online as part of the supplementary material.

摘要

得分统计量仍然是统计推断的一个基本工具。在高通量基因组检测数据分析中,基于得分的推断通常具有更高的稳定性、显著更高的计算效率,并且与渐近等价的 Wald 检验或似然比检验相比,更易于采用重采样方法。得分函数通常依赖于一组未知的干扰参数,这些参数必须用估计量来代替,但通过计算有效得分可以改进,有效得分考虑了估计这些参数所引起的变异性。手动推导有效得分既繁琐又容易出错,因此我们举例说明使用计算机代数来促进这一推导过程。我们在遗传关联分析的一个标准示例的背景下演示这个过程,尽管这里展示的技术可以应用于任何推导,并且在任何现代统计学家的工具库中都有一席之地。我们进一步展示如何将所得的符号表达式轻松移植到编译语言中,以开发用于高通量基因组分析的快速数值算法。我们通过考虑这种方法的扩展来得出结论。本报告中的代码作为补充材料的一部分可在网上获取。

相似文献

1
Facilitating the Calculation of the Efficient Score Using Symbolic Computing.利用符号计算辅助有效得分的计算。
Am Stat. 2018;72(2):199-205. doi: 10.1080/00031305.2017.1392361. Epub 2017 Oct 30.
2
The MVGC multivariate Granger causality toolbox: a new approach to Granger-causal inference.MVGC 多元 Granger 因果关系工具箱:Granger 因果推断的新方法。
J Neurosci Methods. 2014 Feb 15;223:50-68. doi: 10.1016/j.jneumeth.2013.10.018. Epub 2013 Nov 5.
3
Fast score test with global null estimation regardless of missing genotypes.无论基因型缺失如何,都可以进行快速得分检验和全局零假设估计。
PLoS One. 2018 Jul 5;13(7):e0199692. doi: 10.1371/journal.pone.0199692. eCollection 2018.
4
Targeted estimation of nuisance parameters to obtain valid statistical inference.对干扰参数进行有针对性的估计以获得有效的统计推断。
Int J Biostat. 2014;10(1):29-57. doi: 10.1515/ijb-2012-0038.
5
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
6
Semiparametric estimation in generalized linear mixed models with auxiliary covariates: a pairwise likelihood approach.具有辅助协变量的广义线性混合模型中的半参数估计:一种成对似然方法。
Biometrics. 2014 Dec;70(4):910-9. doi: 10.1111/biom.12208. Epub 2014 Sep 23.
7
SD-CAS: Spin Dynamics by Computer Algebra System.SD-CAS:计算机代数系统中的自旋动力学。
J Magn Reson. 2010 Nov;207(1):95-113. doi: 10.1016/j.jmr.2010.08.014. Epub 2010 Aug 24.
8
Multivariate correlation estimator for inferring functional relationships from replicated genome-wide data.用于从复制的全基因组数据推断功能关系的多变量相关估计器。
Bioinformatics. 2007 Sep 1;23(17):2298-305. doi: 10.1093/bioinformatics/btm328. Epub 2007 Jun 22.
9
Doubly robust inference for targeted minimum loss-based estimation in randomized trials with missing outcome data.在存在结局数据缺失的随机试验中,基于目标最小损失估计的双重稳健推断。
Stat Med. 2017 Oct 30;36(24):3807-3819. doi: 10.1002/sim.7389. Epub 2017 Jul 25.
10
Efficient score statistics for mapping quantitative trait loci with extended pedigrees.利用扩展系谱定位数量性状基因座的高效得分统计量。
Hum Hered. 2002;54(2):57-68. doi: 10.1159/000067663.

本文引用的文献

1
Leveraging population information in family-based rare variant association analyses of quantitative traits.在基于家系的数量性状罕见变异关联分析中利用群体信息。
Genet Epidemiol. 2017 Feb;41(2):98-107. doi: 10.1002/gepi.22022. Epub 2016 Dec 5.
2
Random Effects Model for Multiple Pathway Analysis with Applications to Type II Diabetes Microarray Data.用于多通路分析的随机效应模型及其在II型糖尿病微阵列数据中的应用
Stat Biosci. 2015 Oct 1;7(2):167-186. doi: 10.1007/s12561-014-9109-1. Epub 2014 Jan 30.
3
Genomic profiling in locally advanced and inflammatory breast cancer and its link to DCE-MRI and overall survival.局部晚期和炎性乳腺癌的基因组分析及其与动态对比增强磁共振成像和总生存期的关联
Int J Hyperthermia. 2015 Jun;31(4):386-95. doi: 10.3109/02656736.2015.1016557. Epub 2015 Mar 26.
4
Genetic associations with expression for genes implicated in GWAS studies for atherosclerotic cardiovascular disease and blood phenotypes.与 GWAS 研究中涉及动脉粥样硬化性心血管疾病和血液表型的基因表达相关的遗传关联。
Hum Mol Genet. 2014 Feb 1;23(3):782-95. doi: 10.1093/hmg/ddt461. Epub 2013 Sep 20.
5
Common genetic variants and modification of penetrance of BRCA2-associated breast cancer.常见遗传变异与 BRCA2 相关乳腺癌外显率的修饰。
PLoS Genet. 2010 Oct 28;6(10):e1001183. doi: 10.1371/journal.pgen.1001183.
6
Polymorphisms in estrogen- and androgen-metabolizing genes and the risk of gastric cancer.雌激素和雄激素代谢基因多态性与胃癌风险
Carcinogenesis. 2009 Jan;30(1):71-7. doi: 10.1093/carcin/bgn258. Epub 2008 Nov 17.
7
Tests of significance in multivariate analysis.多元分析中的显著性检验。
Biometrika. 1948 May;35(Pts 1-2):58-79.
8
So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests.这么多相关的测试,时间却这么少!多个相关测试的 P 值的快速调整。
Am J Hum Genet. 2007 Dec;81(6):1158-68. doi: 10.1086/522036.
9
Variation in DNA repair genes ERCC2, XRCC1, and XRCC3 and risk of follicular lymphoma.DNA修复基因ERCC2、XRCC1和XRCC3的变异与滤泡性淋巴瘤风险
Cancer Epidemiol Biomarkers Prev. 2006 Feb;15(2):258-65. doi: 10.1158/1055-9965.EPI-05-0583.
10
Calibrating a coalescent simulation of human genome sequence variation.校准人类基因组序列变异的合并模拟。
Genome Res. 2005 Nov;15(11):1576-83. doi: 10.1101/gr.3709305.