• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CollapsABEL:一个用于在全基因组关联研究中检测复合杂合子等位基因的R语言库。

CollapsABEL: an R library for detecting compound heterozygote alleles in genome-wide association studies.

作者信息

Zhong Kaiyin, Karssen Lennart C, Kayser Manfred, Liu Fan

机构信息

Department of Genetic Identification, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands.

PolyOmica, Groningen, The Netherlands.

出版信息

BMC Bioinformatics. 2016 Apr 8;17:156. doi: 10.1186/s12859-016-1006-9.

DOI:10.1186/s12859-016-1006-9
PMID:27059780
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4826552/
Abstract

BACKGROUND

Compound Heterozygosity (CH) in classical genetics is the presence of two different recessive mutations at a particular gene locus. A relaxed form of CH alleles may account for an essential proportion of the missing heritability, i.e. heritability of phenotypes so far not accounted for by single genetic variants. Methods to detect CH-like effects in genome-wide association studies (GWAS) may facilitate explaining the missing heritability, but to our knowledge no viable software tools for this purpose are currently available.

RESULTS

In this work we present the Generalized Compound Double Heterozygosity (GCDH) test and its implementation in the R package CollapsABEL. Time-consuming procedures are optimized for computational efficiency using Java or C++. Intermediate results are stored either in an SQL database or in a so-called big.matrix file to achieve reasonable memory footprint. Our large scale simulation studies show that GCDH is capable of discovering genetic associations due to CH-like interactions with much higher power than a conventional single-SNP approach under various settings, whether the causal genetic variations are available or not. CollapsABEL provides a user-friendly pipeline for genotype collapsing, statistical testing, power estimation, type I error control and graphics generation in the R language.

CONCLUSIONS

CollapsABEL provides a computationally efficient solution for screening general forms of CH alleles in densely imputed microarray or whole genome sequencing datasets. The GCDH test provides an improved power over single-SNP based methods in detecting the prevalence of CH in human complex phenotypes, offering an opportunity for tackling the missing heritability problem. Binary and source packages of CollapsABEL are available on CRAN ( https://cran.r-project.org/web/packages/CollapsABEL ) and the website of the GenABEL project ( http://www.genabel.org/packages ).

摘要

背景

在经典遗传学中,复合杂合性(CH)是指在特定基因座上存在两个不同的隐性突变。CH等位基因的一种宽松形式可能占缺失遗传力的很大一部分,即迄今为止尚未由单个基因变异解释的表型遗传力。在全基因组关联研究(GWAS)中检测类似CH效应的方法可能有助于解释缺失的遗传力,但据我们所知,目前尚无适用于此目的的可行软件工具。

结果

在这项工作中,我们提出了广义复合双杂合性(GCDH)检验及其在R包CollapsABEL中的实现。使用Java或C++对耗时的程序进行了优化以提高计算效率。中间结果存储在SQL数据库或所谓的big.matrix文件中,以实现合理的内存占用。我们的大规模模拟研究表明,无论因果基因变异是否可用,在各种设置下,GCDH都能够以比传统单核苷酸多态性(SNP)方法更高的功效发现由于类似CH相互作用而产生的基因关联。CollapsABEL提供了一个用户友好的管道,用于在R语言中进行基因型压缩、统计检验、功效估计、I型错误控制和图形生成。

结论

CollapsABEL为在密集填充的微阵列或全基因组测序数据集中筛选CH等位基因的一般形式提供了一种计算高效的解决方案。GCDH检验在检测人类复杂表型中CH的普遍性方面比基于单SNP的方法具有更高的功效,为解决缺失遗传力问题提供了一个机会。CollapsABEL的二进制和源包可在CRAN(https://cran.r-project.org/web/packages/CollapsABEL)和GenABEL项目网站(http://www.genabel.org/packages)上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/8add840f7147/12859_2016_1006_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/ce7be463389e/12859_2016_1006_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/5e96dccce76d/12859_2016_1006_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/ab8bbe8bd320/12859_2016_1006_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/8add840f7147/12859_2016_1006_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/ce7be463389e/12859_2016_1006_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/5e96dccce76d/12859_2016_1006_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/ab8bbe8bd320/12859_2016_1006_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecc/4826552/8add840f7147/12859_2016_1006_Fig4_HTML.jpg

相似文献

1
CollapsABEL: an R library for detecting compound heterozygote alleles in genome-wide association studies.CollapsABEL:一个用于在全基因组关联研究中检测复合杂合子等位基因的R语言库。
BMC Bioinformatics. 2016 Apr 8;17:156. doi: 10.1186/s12859-016-1006-9.
2
Detecting low frequent loss-of-function alleles in genome wide association studies with red hair color as example.以红发颜色为例,在全基因组关联研究中检测低频功能丧失等位基因。
PLoS One. 2011;6(11):e28145. doi: 10.1371/journal.pone.0028145. Epub 2011 Nov 29.
3
RAINBOW: Haplotype-based genome-wide association study using a novel SNP-set method.基于单倍型的全基因组关联研究,使用一种新的 SNP 集方法。
PLoS Comput Biol. 2020 Feb 14;16(2):e1007663. doi: 10.1371/journal.pcbi.1007663. eCollection 2020 Feb.
4
Genome-wide compound heterozygote analysis highlights alleles associated with adult height in Europeans.全基因组复合杂合子分析揭示了与欧洲成年人身高相关的等位基因。
Hum Genet. 2017 Nov;136(11-12):1407-1417. doi: 10.1007/s00439-017-1842-3. Epub 2017 Sep 18.
5
Genome-wide compound heterozygosity analysis highlighted 4 novel susceptibility loci for congenital heart disease in Chinese population.全基因组复合杂合性分析突出了中国人群先天性心脏病的 4 个新的易感基因座。
Clin Genet. 2018 Oct;94(3-4):296-302. doi: 10.1111/cge.13384. Epub 2018 Jun 29.
6
GenABEL: an R library for genome-wide association analysis.GenABEL:一个用于全基因组关联分析的R语言库。
Bioinformatics. 2007 May 15;23(10):1294-6. doi: 10.1093/bioinformatics/btm108. Epub 2007 Mar 23.
7
SEQMINER: An R-Package to Facilitate the Functional Interpretation of Sequence-Based Associations.SEQMINER:一个用于促进基于序列关联的功能解释的R包。
Genet Epidemiol. 2015 Dec;39(8):619-23. doi: 10.1002/gepi.21918. Epub 2015 Sep 23.
8
GENOME-WIDE ASSOCIATION MAPPING AND RARE ALLELES: FROM POPULATION GENOMICS TO PERSONALIZED MEDICINE - Session Introduction.全基因组关联图谱与罕见等位基因:从群体基因组学到个性化医学——会议介绍
Pac Symp Biocomput. 2011:74-5. doi: 10.1142/9789814335058_0008.
9
synbreed: a framework for the analysis of genomic prediction data using R.synbreed:一个使用 R 进行基因组预测数据分析的框架。
Bioinformatics. 2012 Aug 1;28(15):2086-7. doi: 10.1093/bioinformatics/bts335. Epub 2012 Jun 10.
10
PyHLA: tests for the association between HLA alleles and diseases.PyHLA:用于检测HLA等位基因与疾病之间关联的测试。
BMC Bioinformatics. 2017 Feb 6;18(1):90. doi: 10.1186/s12859-017-1496-0.

引用本文的文献

1
Decreased Immunoglobulin G Core Fucosylation, A Player in Antibody-dependent Cell-mediated Cytotoxicity, is Associated with Autoimmune Thyroid Diseases.免疫球蛋白 G 核心岩藻糖基化降低,抗体依赖的细胞介导的细胞毒性的一个参与者,与自身免疫性甲状腺疾病相关。
Mol Cell Proteomics. 2020 May;19(5):774-792. doi: 10.1074/mcp.RA119.001860. Epub 2020 Feb 5.
2
In trans variant calling reveals enrichment for compound heterozygous variants in genes involved in neuronal development and growth.在反式变异检测中发现,参与神经元发育和生长的基因中复合杂合变异出现富集。
Genet Res (Camb). 2019 Jun 13;101:e8. doi: 10.1017/S0016672319000065.
3
DNA sequence-level analyses reveal potential phenotypic modifiers in a large family with psychiatric disorders.

本文引用的文献

1
The MC1R Gene and Youthful Looks.MC1R基因与年轻容貌。
Curr Biol. 2016 May 9;26(9):1213-20. doi: 10.1016/j.cub.2016.03.008. Epub 2016 Apr 28.
2
The Rotterdam Study: 2016 objectives and design update.鹿特丹研究:2016年目标与设计更新
Eur J Epidemiol. 2015 Aug;30(8):661-708. doi: 10.1007/s10654-015-0082-x. Epub 2015 Sep 19.
3
Second-generation PLINK: rising to the challenge of larger and richer datasets.第二代PLINK:应对更大、更丰富数据集的挑战
DNA 序列水平分析揭示了一个有精神障碍的大家族中的潜在表型修饰因子。
Mol Psychiatry. 2018 Dec;23(12):2254-2265. doi: 10.1038/s41380-018-0087-4. Epub 2018 Jun 7.
4
Genome-wide compound heterozygote analysis highlights alleles associated with adult height in Europeans.全基因组复合杂合子分析揭示了与欧洲成年人身高相关的等位基因。
Hum Genet. 2017 Nov;136(11-12):1407-1417. doi: 10.1007/s00439-017-1842-3. Epub 2017 Sep 18.
5
DNA methylation-based forensic age prediction using artificial neural networks and next generation sequencing.使用人工神经网络和下一代测序技术基于DNA甲基化的法医年龄预测
Forensic Sci Int Genet. 2017 May;28:225-236. doi: 10.1016/j.fsigen.2017.02.009. Epub 2017 Feb 28.
6
The GenABEL Project for statistical genomics.用于统计基因组学的GenABEL项目。
F1000Res. 2016 May 19;5:914. doi: 10.12688/f1000research.8733.1. eCollection 2016.
Gigascience. 2015 Feb 25;4:7. doi: 10.1186/s13742-015-0047-8. eCollection 2015.
4
Testing the effect of rare compound-heterozygous and recessive mutations in case--parent sequencing studies.在病例-亲本测序研究中检测罕见复合杂合突变和隐性突变的效应。
Genet Epidemiol. 2015 Mar;39(3):166-72. doi: 10.1002/gepi.21885. Epub 2015 Jan 28.
5
Testing for modes of inheritance involving compound heterozygotes.检测涉及复合杂合子的遗传方式。
Genet Epidemiol. 2013 Jul;37(5):522-8. doi: 10.1002/gepi.21732. Epub 2013 Apr 30.
6
iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies.iLOCi:一种 SNP 相互作用优先级技术,用于检测全基因组关联研究中的上位性。
BMC Genomics. 2012;13 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2164-13-S7-S2. Epub 2012 Dec 13.
7
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
8
BiForce Toolbox: powerful high-throughput computational analysis of gene-gene interactions in genome-wide association studies.BiForce 工具包:全基因组关联研究中基因-基因相互作用的强大高通量计算分析。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W628-32. doi: 10.1093/nar/gks550. Epub 2012 Jun 11.
9
Detecting low frequent loss-of-function alleles in genome wide association studies with red hair color as example.以红发颜色为例,在全基因组关联研究中检测低频功能丧失等位基因。
PLoS One. 2011;6(11):e28145. doi: 10.1371/journal.pone.0028145. Epub 2011 Nov 29.
10
Model-based prediction of human hair color using DNA variants.基于模型的人类头发颜色 DNA 变体预测。
Hum Genet. 2011 Apr;129(4):443-54. doi: 10.1007/s00439-010-0939-8. Epub 2011 Jan 4.