Suppr超能文献

分子生物学的非参数方法。

Nonparametric methods for molecular biology.

作者信息

Wittkowski Knut M, Song Tingting

机构信息

Center for Clinical and Translational Science, The Rockefeller University, New York, NY, USA.

出版信息

Methods Mol Biol. 2010;620:105-53. doi: 10.1007/978-1-60761-580-4_2.

Abstract

In 2003, the completion of the Human Genome Project (1) together with advances in computational resources (2) were expected to launch an era where the genetic and genomic contributions to many common diseases would be found. In the years following, however, researchers became increasingly frustrated as most reported 'findings' could not be replicated in independent studies (3). To improve the signal/noise ratio, it was suggested to increase the number of cases to be included to tens of thousands (4), a requirement that would dramatically restrict the scope of personalized medicine. Similarly, there was little success in elucidating the gene-gene interactions involved in complex diseases or even in developing criteria for assessing their phenotypes. As a partial solution to these enigmata, we here introduce a class of statistical methods as the 'missing link' between advances in genetics and informatics. As a first step, we provide a unifying view of a plethora of nonparametric tests developed mainly in the 1940s, all of which can be expressed as u-statistics. Then, we will extend this approach to reflect categorical and ordinal relationships between variables, resulting in a flexible and powerful approach to deal with the impact of (1) multiallelic genetic loci, (2) poly-locus genetic regions, and (3) oligo-genetic and oligo-genomic collaborative interactions on complex phenotypes.

摘要

2003年,人类基因组计划的完成(1)以及计算资源的进步(2),有望开启一个能够发现遗传和基因组对许多常见疾病影响的时代。然而,在随后的几年里,研究人员越来越沮丧,因为大多数报告的“发现”无法在独立研究中得到重复验证(3)。为了提高信号/噪声比,有人建议将纳入的病例数量增加到数万例(4),这一要求将极大地限制个性化医疗的范围。同样,在阐明复杂疾病中涉及的基因-基因相互作用,甚至在制定评估其表型的标准方面,也几乎没有取得成功。作为这些谜团的部分解决方案,我们在此引入一类统计方法,作为遗传学和信息学进展之间的“缺失环节”。作为第一步,我们对主要在20世纪40年代开发的大量非参数检验提供了一个统一的观点,所有这些检验都可以表示为u统计量。然后,我们将扩展这种方法,以反映变量之间的分类和有序关系,从而形成一种灵活而强大的方法,来处理(1)多等位基因遗传位点、(2)多基因座遗传区域以及(3)寡基因和寡基因组协同相互作用对复杂表型的影响。

相似文献

1
Nonparametric methods for molecular biology.
Methods Mol Biol. 2010;620:105-53. doi: 10.1007/978-1-60761-580-4_2.
2
5
Boosting the power of schizophrenia genetics by leveraging new statistical tools.
Schizophr Bull. 2014 Jan;40(1):13-7. doi: 10.1093/schbul/sbt168. Epub 2013 Dec 6.
8
U-statistics in genetic association studies.
Hum Genet. 2012 Sep;131(9):1395-401. doi: 10.1007/s00439-012-1178-y. Epub 2012 May 20.

引用本文的文献

1
TIGA: target illumination GWAS analytics.
Bioinformatics. 2021 Nov 5;37(21):3865-3873. doi: 10.1093/bioinformatics/btab427.
2
CrAssphage as a Novel Tool to Detect Human Fecal Contamination on Environmental Surfaces and Hands.
Emerg Infect Dis. 2020 Aug;26(8):1731-1739. doi: 10.3201/eid2608.200346. Epub 2020 Jun 8.
3
Complex polymorphisms in endocytosis genes suggest alpha-cyclodextrin as a treatment for breast cancer.
PLoS One. 2018 Jul 2;13(7):e0199012. doi: 10.1371/journal.pone.0199012. eCollection 2018.
5
Conditional long-term survival following minimally invasive robotic mitral valve repair: a health services perspective.
Ann Cardiothorac Surg. 2015 Sep;4(5):433-42. doi: 10.3978/j.issn.2225-319X.2015.08.08.
9
The evolutionary rate of antibacterial drug targets.
BMC Bioinformatics. 2013 Feb 1;14:36. doi: 10.1186/1471-2105-14-36.

本文引用的文献

1
Answering Ordinal Questions with Ordinal Data Using Ordinal Statistics.
Multivariate Behav Res. 1996 Jul 1;31(3):331-50. doi: 10.1207/s15327906mbr3103_4.
2
U-Scores for Multivariate Data in Sports.
J Quant Anal Sports. 2008 Jul 18;4(3). doi: 10.2202/1559-0410.1129,.
3
STUDYING TRAVEL-RELATED INDIVIDUAL ASSESSMENTS AND DESIRES BY COMBINING HIERARCHICALLY STRUCTURED ORDINAL VARIABLES.
Transportation (Amst). 2009 Mar 1;36(2):187-206. doi: 10.1007/s11116-009-9186-z.
4
The statistical sign test.
J Am Stat Assoc. 1946 Dec;41(236):557-66. doi: 10.1080/01621459.1946.10501898.
5
Note on the sampling error of the difference between correlated proportions or percentages.
Psychometrika. 1947 Jun;12(2):153-7. doi: 10.1007/BF02295996.
6
7
Genetic risk prediction--are we there yet?
N Engl J Med. 2009 Apr 23;360(17):1701-3. doi: 10.1056/NEJMp0810107. Epub 2009 Apr 15.
8
Genomewide association studies: history, rationale, and prospects for psychiatric disorders.
Am J Psychiatry. 2009 May;166(5):540-56. doi: 10.1176/appi.ajp.2008.08091354. Epub 2009 Apr 1.
9
Assessment of multiple ordinal endpoints.
Biom J. 2009 Feb;51(1):217-26. doi: 10.1002/bimj.200810502.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验