Suppr超能文献

利用符号计算辅助有效得分的计算。

Facilitating the Calculation of the Efficient Score Using Symbolic Computing.

作者信息

Sibley Alexander, Li Zhiguo, Jiang Yu, Li Yi-Ju, Chan Cliburn, Allen Andrew, Owzar Kouros

机构信息

Duke Cancer Institute, Duke University Medical Center.

Biostatistics and Bioinformatics, Duke University School of Medicine.

出版信息

Am Stat. 2018;72(2):199-205. doi: 10.1080/00031305.2017.1392361. Epub 2017 Oct 30.

Abstract

The score statistic continues to be a fundamental tool for statistical inference. In the analysis of data from high-throughput genomic assays, inference on the basis of the score usually enjoys greater stability, considerably higher computational efficiency, and lends itself more readily to the use of resampling methods than the asymptotically equivalent Wald or likelihood ratio tests. The score function often depends on a set of unknown nuisance parameters which have to be replaced by estimators, but can be improved by calculating the efficient score, which accounts for the variability induced by estimating these parameters. Manual derivation of the efficient score is tedious and error-prone, so we illustrate using computer algebra to facilitate this derivation. We demonstrate this process within the context of a standard example from genetic association analyses, though the techniques shown here could be applied to any derivation, and have a place in the toolbox of any modern statistician. We further show how the resulting symbolic expressions can be readily ported to compiled languages, to develop fast numerical algorithms for high-throughput genomic analysis. We conclude by considering extensions of this approach. The code featured in this report is available online as part of the supplementary material.

摘要

得分统计量仍然是统计推断的一个基本工具。在高通量基因组检测数据分析中,基于得分的推断通常具有更高的稳定性、显著更高的计算效率,并且与渐近等价的 Wald 检验或似然比检验相比,更易于采用重采样方法。得分函数通常依赖于一组未知的干扰参数,这些参数必须用估计量来代替,但通过计算有效得分可以改进,有效得分考虑了估计这些参数所引起的变异性。手动推导有效得分既繁琐又容易出错,因此我们举例说明使用计算机代数来促进这一推导过程。我们在遗传关联分析的一个标准示例的背景下演示这个过程,尽管这里展示的技术可以应用于任何推导,并且在任何现代统计学家的工具库中都有一席之地。我们进一步展示如何将所得的符号表达式轻松移植到编译语言中,以开发用于高通量基因组分析的快速数值算法。我们通过考虑这种方法的扩展来得出结论。本报告中的代码作为补充材料的一部分可在网上获取。

相似文献

7
SD-CAS: Spin Dynamics by Computer Algebra System.SD-CAS:计算机代数系统中的自旋动力学。
J Magn Reson. 2010 Nov;207(1):95-113. doi: 10.1016/j.jmr.2010.08.014. Epub 2010 Aug 24.

本文引用的文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验