• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于标记集和测序数据分析的多标记信息聚合:基因型合并与相似性合并

On the Aggregation of Multimarker Information for Marker-Set and Sequencing Data Analysis: Genotype Collapsing vs. Similarity Collapsing.

作者信息

Pongpanich Monnat, Neely Megan L, Tzeng Jung-Ying

机构信息

Bioinformatics Research Center, North Carolina State University Raleigh, NC, USA.

出版信息

Front Genet. 2012 Jan 9;2:110. doi: 10.3389/fgene.2011.00110. eCollection 2011.

DOI:10.3389/fgene.2011.00110
PMID:22303404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3266618/
Abstract

Methods that collapse information across genetic markers when searching for association signals are gaining momentum in the literature. Although originally developed to achieve a better balance between retaining information and controlling degrees of freedom when performing multimarker association analysis, these methods have recently been proven to be a powerful tool for identifying rare variants that contribute to complex phenotypes. The information among markers can be collapsed at the genotype level, which focuses on the mean of genetic information, or the similarity level, which focuses on the variance of genetic information. The aim of this work is to understand the strengths and weaknesses of these two collapsing strategies. Our results show that neither collapsing strategy outperforms the other across all simulated scenarios. Two factors that dominate the performance of these strategies are the signal-to-noise ratio and the underlying genetic architecture of the causal variants. Genotype collapsing is more sensitive to the marker set being contaminated by noise loci than similarity collapsing. In addition, genotype collapsing performs best when the genetic architecture of the causal variants is not complex (e.g., causal loci with similar effects and similar frequencies). Similarity collapsing is more robust as the complexity of the genetic architecture increases and outperforms genotype collapsing when the genetic architecture of the marker set becomes more sophisticated (e.g., causal loci with various effect sizes or frequencies and potential non-linear or interactive effects). Because the underlying genetic architecture is not known a priori, we also considered a two-stage analysis that combines the two top-performing methods from different collapsing strategies. We find that it is reasonably robust across all simulated scenarios.

摘要

在寻找关联信号时,通过合并遗传标记信息的方法在文献中越来越受到关注。尽管这些方法最初是为了在进行多标记关联分析时,在保留信息和控制自由度之间取得更好的平衡而开发的,但最近已被证明是识别导致复杂表型的罕见变异的有力工具。标记之间的信息可以在基因型水平上合并,该水平关注遗传信息的均值;也可以在相似性水平上合并,该水平关注遗传信息的方差。这项工作的目的是了解这两种合并策略的优缺点。我们的结果表明,在所有模拟场景中,没有一种合并策略比另一种更优。主导这些策略性能的两个因素是信噪比和因果变异的潜在遗传结构。与相似性合并相比,基因型合并对被噪声位点污染的标记集更敏感。此外,当因果变异的遗传结构不复杂时(例如,具有相似效应和相似频率的因果位点),基因型合并表现最佳。随着遗传结构复杂性的增加,相似性合并更稳健,当标记集的遗传结构变得更复杂时(例如,具有各种效应大小或频率以及潜在非线性或交互效应的因果位点),相似性合并优于基因型合并。由于潜在的遗传结构事先未知,我们还考虑了一种两阶段分析,该分析结合了来自不同合并策略的两种表现最佳的方法。我们发现它在所有模拟场景中都相当稳健。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/c38a44c1b9de/fgene-02-00110-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/bd412f896671/fgene-02-00110-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/452a0cb6832a/fgene-02-00110-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/e3d16ff6b003/fgene-02-00110-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/63d5adc834c9/fgene-02-00110-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/8e2c30654753/fgene-02-00110-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/5e35431e598d/fgene-02-00110-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/bf63e4065dc2/fgene-02-00110-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/c38a44c1b9de/fgene-02-00110-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/bd412f896671/fgene-02-00110-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/452a0cb6832a/fgene-02-00110-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/e3d16ff6b003/fgene-02-00110-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/63d5adc834c9/fgene-02-00110-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/8e2c30654753/fgene-02-00110-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/5e35431e598d/fgene-02-00110-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/bf63e4065dc2/fgene-02-00110-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4f0/3266618/c38a44c1b9de/fgene-02-00110-g008.jpg

相似文献

1
On the Aggregation of Multimarker Information for Marker-Set and Sequencing Data Analysis: Genotype Collapsing vs. Similarity Collapsing.用于标记集和测序数据分析的多标记信息聚合:基因型合并与相似性合并
Front Genet. 2012 Jan 9;2:110. doi: 10.3389/fgene.2011.00110. eCollection 2011.
2
Enrichment analysis of genetic association in genes and pathways by aggregating signals from both rare and common variants.通过整合罕见和常见变异的信号对基因和通路中的遗传关联进行富集分析。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S52. doi: 10.1186/1753-6561-5-S9-S52.
3
Whole genome sequence analysis of the simulated systolic blood pressure in Genetic Analysis Workshop 18 family data: long-term average and collapsing methods.遗传分析研讨会18家庭数据中模拟收缩压的全基因组序列分析:长期平均法和压缩法
BMC Proc. 2014 Jun 17;8(Suppl 1 Genetic Analysis Workshop 18Vanessa Olmo):S12. doi: 10.1186/1753-6561-8-S1-S12. eCollection 2014.
4
Comparison of similarity-based tests and pooling strategies for rare variants.基于相似性的检验和稀有变异体的池化策略比较。
BMC Genomics. 2013 Jan 24;14:50. doi: 10.1186/1471-2164-14-50.
5
Application of collapsing methods for continuous traits to the Genetic Analysis Workshop 17 exome sequence data.将连续性状的压缩方法应用于遗传分析研讨会17外显子组序列数据。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S121. doi: 10.1186/1753-6561-5-S9-S121.
6
Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data.应用于遗传分析研讨会17小外显子数据的基于塌缩法和基于核函数法的单基因分析。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S117. doi: 10.1186/1753-6561-5-S9-S117. eCollection 2011.
7
Weighted selective collapsing strategy for detecting rare and common variants in genetic association study.加权选择性合并策略在遗传关联研究中检测罕见和常见变异。
BMC Genet. 2012 Feb 6;13:7. doi: 10.1186/1471-2156-13-7.
8
Detecting disease association with rare variants in case-parents studies.在病例-父母研究中检测疾病与罕见变异的关联。
J Hum Genet. 2017 Apr;62(5):549-552. doi: 10.1038/jhg.2017.1. Epub 2017 Feb 2.
9
Detecting functional rare variants by collapsing and incorporating functional annotation in Genetic Analysis Workshop 17 mini-exome data.在遗传分析研讨会17小外显子组数据中,通过合并和纳入功能注释来检测功能性罕见变异。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S27. doi: 10.1186/1753-6561-5-S9-S27.
10
A comparison of two collapsing methods in different approaches.不同方法中两种折叠方法的比较。
BMC Proc. 2014 Jun 17;8(Suppl 1 Genetic Analysis Workshop 18Vanessa Olmo):S8. doi: 10.1186/1753-6561-8-S1-S8. eCollection 2014.

引用本文的文献

1
A New Method for Conditional Gene-Based Analysis Effectively Accounts for the Regional Polygenic Background.一种新的基于条件基因的分析方法能够有效地解释区域多基因背景。
Genes (Basel). 2024 Sep 7;15(9):1174. doi: 10.3390/genes15091174.
2
How to make DNA methylome wide association studies more powerful.如何使全基因组DNA甲基化关联研究更具效力。
Epigenomics. 2016 Aug;8(8):1117-29. doi: 10.2217/epi-2016-0017. Epub 2016 Apr 7.
3
A New Method for Detecting Associations with Rare Copy-Number Variants.一种检测与罕见拷贝数变异关联的新方法。

本文引用的文献

1
A general framework for detecting disease associations with rare variants in sequencing studies.一种用于在测序研究中检测罕见变异与疾病关联的通用框架。
Am J Hum Genet. 2011 Sep 9;89(3):354-67. doi: 10.1016/j.ajhg.2011.07.015. Epub 2011 Sep 1.
2
Studying gene and gene-environment effects of uncommon and common variants on continuous traits: a marker-set approach using gene-trait similarity regression.研究罕见和常见变异对连续性状的基因和基因-环境效应:一种使用基因-性状相似性回归的标记集方法。
Am J Hum Genet. 2011 Aug 12;89(2):277-88. doi: 10.1016/j.ajhg.2011.07.007.
3
Comparison of statistical tests for disease association with rare variants.
PLoS Genet. 2015 Oct 2;11(10):e1005403. doi: 10.1371/journal.pgen.1005403. eCollection 2015 Oct.
4
A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction.一种应用于检测基因-环境相互作用的快速多核方法。
Genet Epidemiol. 2015 Sep;39(6):456-68. doi: 10.1002/gepi.21909. Epub 2015 Jul 3.
5
Assessing gene-environment interactions for common and rare variants with binary traits using gene-trait similarity regression.使用基因-性状相似性回归评估具有二元性状的常见和罕见变异的基因-环境相互作用。
Genetics. 2015 Mar;199(3):695-710. doi: 10.1534/genetics.114.171686. Epub 2015 Jan 12.
6
Complete effect-profile assessment in association studies with multiple genetic and multiple environmental factors.在涉及多个遗传因素和多个环境因素的关联研究中进行完整的效应概况评估。
Genet Epidemiol. 2015 Feb;39(2):122-33. doi: 10.1002/gepi.21877. Epub 2014 Dec 23.
7
GENE-LEVEL PHARMACOGENETIC ANALYSIS ON SURVIVAL OUTCOMES USING GENE-TRAIT SIMILARITY REGRESSION.使用基因-性状相似性回归对生存结局进行基因水平的药物遗传学分析。
Ann Appl Stat. 2014;8(2):1232-1255. doi: 10.1214/14-aoas735.
8
Analysis of gene-gene interactions using gene-trait similarity regression.使用基因-性状相似性回归分析基因-基因相互作用。
Hum Hered. 2014;78(1):17-26. doi: 10.1159/000360161. Epub 2014 Jun 21.
比较罕见变异与疾病关联的统计学检验方法。
Genet Epidemiol. 2011 Nov;35(7):606-19. doi: 10.1002/gepi.20609. Epub 2011 Jul 18.
4
Rare-variant association testing for sequencing data with the sequence kernel association test.基于序列核关联检验的测序数据罕见变异关联分析
Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.
5
A framework for variation discovery and genotyping using next-generation DNA sequencing data.利用下一代 DNA 测序数据进行变异发现和基因分型的框架。
Nat Genet. 2011 May;43(5):491-8. doi: 10.1038/ng.806. Epub 2011 Apr 10.
6
Testing for an unusual distribution of rare variants.检测罕见变异的异常分布。
PLoS Genet. 2011 Mar;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. Epub 2011 Mar 3.
7
Comparison of methods and sampling designs to test for association between rare variants and quantitative traits.用于检测罕见变异与数量性状之间关联的方法和抽样设计的比较。
Genet Epidemiol. 2011 May;35(4):226-35. doi: 10.1002/gepi.20570.
8
A new testing strategy to identify rare variants with either risk or protective effect on disease.一种新的检测策略,用于识别对疾病具有风险或保护作用的罕见变异。
PLoS Genet. 2011 Feb 3;7(2):e1001289. doi: 10.1371/journal.pgen.1001289.
9
Efficient utilization of rare variants for detection of disease-related genomic regions.高效利用罕见变异来检测与疾病相关的基因组区域。
PLoS One. 2010 Dec 10;5(12):e14288. doi: 10.1371/journal.pone.0014288.
10
Comprehensive approach to analyzing rare genetic variants.综合分析罕见遗传变异。
PLoS One. 2010 Nov 3;5(11):e13584. doi: 10.1371/journal.pone.0013584.