• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组扫描的序贯分析方法。

Sequential methods of analysis for genome scans.

作者信息

Province M A

机构信息

Division of Biostatistics, Washington University School of Medicine, St. Louis, Missouri 63110, USA.

出版信息

Adv Genet. 2001;42:499-514. doi: 10.1016/s0065-2660(01)42039-6.

DOI:10.1016/s0065-2660(01)42039-6
PMID:11037338
Abstract

As the preceding chapters illustrate, now that whole-genome scan analyses are becoming more common, there is considerable disagreement about the best way to balance between false positives and false negatives (traditionally called type I and type II errors in the statistical parlance). Type I and type II errors can be simultaneously controlled, if we are willing to let the sample size of analysis vary. This is the secret that Wald (1947) discovered in the 1940s that led to the theory of sequential sampling and was the inspiration for Newton Morton in developing the lod score method. We can exploit this idea further and capitalize on an old, but nearly forgotten theory: sequential multiple decision procedures (SMDP) (Bechhoffer, et al., 1968), which generalizes the standard "two-hypotheses" tests to consider multiple alternative hypotheses. Using this theory, we can develop a single, genome-wide test that simultaneously partitions all markers into "signal" and "noise" groups, with tight control over both type I and type II errors (Province, 2000). Conceiving this approach as an analysis tool for fixed sample designs (instead of a true sequential sampling scheme), we can let the data decide at which point we should move from the hypothesis generation phase of a genome scan (where multiple comparisons make the interpretation of p values and significance levels difficult and controversial), to a true hypothesis-testing phase (where the problem of multiple comparisons has been all but eliminated so that p values may be accepted at face value).

摘要

如前几章所述,鉴于全基因组扫描分析正变得越来越普遍,在如何最佳平衡假阳性和假阴性(在统计术语中传统上称为I型和II型错误)方面存在相当大的分歧。如果我们愿意让分析的样本量有所变化,那么I型和II型错误可以同时得到控制。这就是沃尔德(1947年)在20世纪40年代发现的秘密,它催生了序贯抽样理论,也是牛顿·莫顿开发对数优势计分法的灵感来源。我们可以进一步利用这一理念,并借助一个古老但几乎被遗忘的理论:序贯多重决策程序(SMDP)(贝乔弗等人,1968年),该理论将标准的“双假设”检验进行了推广,以考虑多个备择假设。利用这一理论,我们可以开发一种单一的全基因组检验方法,将所有标记同时划分为“信号”和“噪声”组,同时严格控制I型和II型错误(普罗文斯,2000年)。将这种方法视为固定样本设计的分析工具(而非真正的序贯抽样方案),我们可以让数据决定从基因组扫描的假设生成阶段(此时多重比较使得p值和显著性水平的解释既困难又有争议)过渡到真正的假设检验阶段(此时多重比较问题几乎已被消除,因此p值可以直接接受)的时机。

相似文献

1
Sequential methods of analysis for genome scans.基因组扫描的序贯分析方法。
Adv Genet. 2001;42:499-514. doi: 10.1016/s0065-2660(01)42039-6.
2
A single, sequential, genome-wide test to identify simultaneously all promising areas in a linkage scan.一种单一的、连续的全基因组检测方法,用于在连锁扫描中同时识别所有有前景的区域。
Genet Epidemiol. 2000 Dec;19(4):301-22. doi: 10.1002/1098-2272(200012)19:4<301::AID-GEPI3>3.0.CO;2-G.
3
False positives and false negatives in genome scans.基因组扫描中的假阳性和假阴性。
Adv Genet. 2001;42:487-98. doi: 10.1016/s0065-2660(01)42038-4.
4
MVQTLCIM: composite interval mapping of multivariate traits in a hybrid F population of outbred species.MVQTLCIM:远交物种杂交F群体中多变量性状的复合区间作图
BMC Bioinformatics. 2017 Nov 23;18(1):515. doi: 10.1186/s12859-017-1908-1.
5
Stratified false discovery control for large-scale hypothesis testing with application to genome-wide association studies.用于大规模假设检验的分层错误发现控制及其在全基因组关联研究中的应用。
Genet Epidemiol. 2006 Sep;30(6):519-30. doi: 10.1002/gepi.20164.
6
Independent genome-wide scans identify a chromosome 18 quantitative-trait locus influencing dyslexia.全基因组独立扫描确定了一个影响诵读困难的18号染色体数量性状位点。
Nat Genet. 2002 Jan;30(1):86-91. doi: 10.1038/ng792. Epub 2001 Dec 17.
7
Mapping the genetic architecture of complex traits in experimental populations.绘制实验群体中复杂性状的遗传结构图谱。
Bioinformatics. 2007 Jun 15;23(12):1527-36. doi: 10.1093/bioinformatics/btm143. Epub 2007 Apr 25.
8
Reducing sample sizes in genome scans: group sequential study designs with futility stops.减少基因组扫描中的样本量:带有无效性终止的序贯组研究设计
Genet Epidemiol. 2003 Dec;25(4):339-49. doi: 10.1002/gepi.10265.
9
A simple correction for multiple comparisons in interval mapping genome scans.区间作图基因组扫描中多重比较的一种简单校正方法。
Heredity (Edinb). 2001 Jul;87(Pt 1):52-8. doi: 10.1046/j.1365-2540.2001.00901.x.
10
HEGESMA: genome search meta-analysis and heterogeneity testing.HEGESMA:基因组搜索荟萃分析与异质性检验。
Bioinformatics. 2005 Sep 15;21(18):3672-3. doi: 10.1093/bioinformatics/bti536. Epub 2005 Jun 14.

引用本文的文献

1
The use of weighted multiple linear regression to estimate QTL × QTL × QTL interaction effects of winter wheat (Triticum aestivum L.) doubled-haploid lines.利用加权多重线性回归估计冬小麦(Triticum aestivum L.)加倍单倍体系的 QTL×QTL×QTL 互作效应。
J Appl Genet. 2023 Dec;64(4):679-693. doi: 10.1007/s13353-023-00795-3. Epub 2023 Oct 25.
2
A Comparison of Methods to Estimate Additive-by-Additive-by-Additive of QTL×QTL×QTL Interaction Effects by Monte Carlo Simulation Studies.基于蒙特卡罗模拟研究的加性-加性-加性 QTL×QTL×QTL 互作效应估计方法比较。
Int J Mol Sci. 2023 Jun 12;24(12):10043. doi: 10.3390/ijms241210043.
3
Epistasis interaction of QTL effects as a genetic parameter influencing estimation of the genetic additive effect.
数量性状基因座(QTL)效应的上位性互作作为影响遗传加性效应估计的遗传参数。
Genet Mol Biol. 2013 Mar;36(1):93-100. doi: 10.1590/S1415-47572013000100013. Epub 2013 Mar 4.
4
Some capabilities for model-based and model-free linkage analysis using the program package S.A.G.E. (Statistical Analysis for Genetic Epidemiology).使用程序包S.A.G.E.(遗传流行病学统计分析)进行基于模型和无模型连锁分析的一些功能。
Hum Hered. 2011;72(4):237-46. doi: 10.1159/000331672. Epub 2011 Dec 23.
5
Linkage studies of catechol-O-methyltransferase (COMT) and dopamine-beta-hydroxylase (DBH) cDNA expression levels.儿茶酚-O-甲基转移酶(COMT)和多巴胺-β-羟化酶(DBH)cDNA表达水平的连锁研究。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S95. doi: 10.1186/1753-6561-1-s1-s95. Epub 2007 Dec 18.
6
Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels).使用贝叶斯网络挖掘遗传流行病学数据I:贝叶斯网络及示例应用(血浆载脂蛋白E水平)
Bioinformatics. 2005 Aug 1;21(15):3273-8. doi: 10.1093/bioinformatics/bti505. Epub 2005 May 24.