• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用单倍型调用拷贝数多态性的方法。

A method for calling copy number polymorphism using haplotypes.

机构信息

Department of Biostatistics and Epidemiology, Center for Clinical Epidemiology and Biostatistics, University of Pennsylvania Philadelphia, PA, USA.

出版信息

Front Genet. 2013 Sep 23;4:165. doi: 10.3389/fgene.2013.00165. eCollection 2013.

DOI:10.3389/fgene.2013.00165
PMID:24069028
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3780619/
Abstract

Single nucleotide polymorphism (SNP) and copy number variation (CNV) are both widespread characteristic of the human genome, but are often called separately on common genotyping platforms. To capture integrated SNP and CNV information, methods have been developed for calling allelic specific copy numbers or so called copy number polymorphism (CNP), using limited inter-marker correlation. In this paper, we proposed a haplotype-based maximum likelihood method to call CNP, which takes advantage of the valuable multi-locus linkage disequilibrium (LD) information in the population. We also developed a computationally efficient algorithm to estimate haplotype frequencies and optimize individual CNP calls iteratively, even at presence of missing data. Through simulations, we demonstrated our model is more sensitive and accurate in detecting various CNV regions, compared with commonly-used CNV calling methods including PennCNV, another hidden Markov model (HMM) using CNP, a scan statistic, segCNV, and cnvHap. Our method often performs better in the regions with higher LD, in longer CNV regions, and in common CNV than the opposite. We implemented our method on the genotypes of 90 HapMap CEU samples and 23 patients with acute lung injury (ALI). For each ALI patient the genotyping was performed twice. The CNPs from our method show good consistency and accuracy comparable to others.

摘要

单核苷酸多态性(SNP)和拷贝数变异(CNV)都是人类基因组的广泛特征,但在常见的基因分型平台上通常分别进行研究。为了获取 SNP 和 CNV 的综合信息,已经开发了一些方法来调用等位基因特异性拷贝数或所谓的拷贝数多态性(CNP),这些方法利用了有限的标记间相关性。在本文中,我们提出了一种基于单倍型的最大似然方法来调用 CNP,该方法利用了群体中宝贵的多基因座连锁不平衡(LD)信息。我们还开发了一种计算效率高的算法来估计单倍型频率,并通过迭代优化个体 CNP 调用,即使存在缺失数据。通过模拟,我们表明与常用的 CNV 调用方法(包括 PennCNV、另一种使用 CNP 的隐马尔可夫模型(HMM)、扫描统计、segCNV 和 cnvHap)相比,我们的模型在检测各种 CNV 区域方面更敏感和准确。与相反的情况相比,我们的方法在 LD 较高的区域、较长的 CNV 区域和常见的 CNV 中表现更好。我们在 90 个 HapMap CEU 样本和 23 名急性肺损伤(ALI)患者的基因型上实现了我们的方法。每位 ALI 患者进行了两次基因分型。我们方法的 CNP 显示出良好的一致性和可与他人相媲美的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/945e/3780619/bef1e2d0a924/fgene-04-00165-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/945e/3780619/7b1a539816eb/fgene-04-00165-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/945e/3780619/bef1e2d0a924/fgene-04-00165-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/945e/3780619/7b1a539816eb/fgene-04-00165-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/945e/3780619/bef1e2d0a924/fgene-04-00165-g0002.jpg

相似文献

1
A method for calling copy number polymorphism using haplotypes.一种使用单倍型调用拷贝数多态性的方法。
Front Genet. 2013 Sep 23;4:165. doi: 10.3389/fgene.2013.00165. eCollection 2013.
2
HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.单倍型拷贝数:利用隐马尔可夫模型和局部单倍型聚类进行拷贝数单倍型推断
PLoS One. 2014 May 21;9(5):e96841. doi: 10.1371/journal.pone.0096841. eCollection 2014.
3
Genome-wide algorithm for detecting CNV associations with diseases.全基因组算法检测与疾病相关的 CNV 关联。
BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.
4
Identification of Copy Number Variants from SNP Arrays Using PennCNV.使用PennCNV从SNP阵列中鉴定拷贝数变异
Methods Mol Biol. 2018;1833:1-28. doi: 10.1007/978-1-4939-8666-8_1.
5
Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle.使用高密度或中密度单核苷酸多态性基因分型面板检测到的拷贝数变异与从牛侧翼高密度单核苷酸多态性单倍型推断拷贝数变异的一致性。
BMC Genomics. 2020 Mar 4;21(1):205. doi: 10.1186/s12864-020-6627-8.
6
Inferring combined CNV/SNP haplotypes from genotype data.从基因型数据推断 CNV/SNP 单体型。
Bioinformatics. 2010 Jun 1;26(11):1437-45. doi: 10.1093/bioinformatics/btq157. Epub 2010 Apr 20.
7
Effect of Combining Multiple CNV Defining Algorithms on the Reliability of CNV Calls from SNP Genotyping Data.多种拷贝数变异(CNV)定义算法相结合对基于单核苷酸多态性(SNP)基因分型数据的CNV检测可靠性的影响
Genomics Inform. 2012 Sep;10(3):194-9. doi: 10.5808/GI.2012.10.3.194. Epub 2012 Sep 28.
8
Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.单核苷酸多态性(SNPs)、常见拷贝数多态性和罕见拷贝数变异(CNVs)的整合基因型分型与关联分析。
Nat Genet. 2008 Oct;40(10):1253-60. doi: 10.1038/ng.237. Epub 2008 Sep 7.
9
Genomic and genetic variability of six chicken populations using single nucleotide polymorphism and copy number variants as markers.使用单核苷酸多态性和拷贝数变异作为标记对六个鸡群体的基因组和遗传变异性进行研究。
Animal. 2017 May;11(5):737-745. doi: 10.1017/S1751731116002135. Epub 2016 Nov 7.
10
Copy Number Variation Detection via High-Density SNP Genotyping.通过高密度单核苷酸多态性基因分型进行拷贝数变异检测。
CSH Protoc. 2008 Jun 1;2008:pdb.top46. doi: 10.1101/pdb.top46.

引用本文的文献

1
A hidden Markov model for haplotype inference for present-absent data of clustered genes using identified haplotypes and haplotype patterns.使用已识别的单倍型和单倍型模式,为聚类基因的存在-缺失数据进行单倍型推断的隐马尔可夫模型。
Front Genet. 2014 Aug 12;5:267. doi: 10.3389/fgene.2014.00267. eCollection 2014.
2
VTET: a variable threshold exact test for identifying disease-associated copy number variations enriched in short genomic regions.VTET:一种用于识别短基因组区域中富集的与疾病相关的拷贝数变异的可变阈值精确检验方法。
Front Genet. 2014 Mar 18;5:53. doi: 10.3389/fgene.2014.00053. eCollection 2014.

本文引用的文献

1
Optimal Sparse Segment Identification with Application in Copy Number Variation Analysis.用于拷贝数变异分析的最优稀疏片段识别
J Am Stat Assoc. 2010 Apr 1;105(491):1156-1166. doi: 10.1198/jasa.2010.tm10083. Epub 2012 Jan 1.
2
An integrative segmentation method for detecting germline copy number variations in SNP arrays.一种用于 SNP 阵列中检测种系拷贝数变异的综合分割方法。
Genet Epidemiol. 2012 May;36(4):373-83. doi: 10.1002/gepi.21631.
3
Genome wide association identifies PPFIA1 as a candidate gene for acute lung injury risk following major trauma.
全基因组关联分析鉴定 PPFIA1 为重大创伤后急性肺损伤风险的候选基因。
PLoS One. 2012;7(1):e28268. doi: 10.1371/journal.pone.0028268. Epub 2012 Jan 25.
4
Genome-wide detection of allele specific copy number variation associated with insulin resistance in African Americans from the HyperGEN study.全基因组检测与非洲裔美国人胰岛素抵抗相关的等位基因特异性拷贝数变异:来自 HyperGEN 研究。
PLoS One. 2011;6(8):e24052. doi: 10.1371/journal.pone.0024052. Epub 2011 Aug 25.
5
Accuracy of CNV Detection from GWAS Data.从 GWAS 数据中检测 CNV 的准确性。
PLoS One. 2011 Jan 13;6(1):e14511. doi: 10.1371/journal.pone.0014511.
6
cnvHap: an integrative population and haplotype-based multiplatform model of SNPs and CNVs.cnvHap:一种基于人群和单倍型的整合 SNP 和 CNV 的多平台模型。
Nat Methods. 2010 Jul;7(7):541-6. doi: 10.1038/nmeth.1466. Epub 2010 May 30.
7
Inferring combined CNV/SNP haplotypes from genotype data.从基因型数据推断 CNV/SNP 单体型。
Bioinformatics. 2010 Jun 1;26(11):1437-45. doi: 10.1093/bioinformatics/btq157. Epub 2010 Apr 20.
8
Origins and functional impact of copy number variation in the human genome.人类基因组中拷贝数变异的起源和功能影响。
Nature. 2010 Apr 1;464(7289):704-12. doi: 10.1038/nature08516. Epub 2009 Oct 7.
9
Comparing CNV detection methods for SNP arrays.比较单核苷酸多态性(SNP)阵列的拷贝数变异(CNV)检测方法。
Brief Funct Genomic Proteomic. 2009 Sep;8(5):353-66. doi: 10.1093/bfgp/elp017. Epub 2009 Sep 8.
10
Markov Models for inferring copy number variations from genotype data on Illumina platforms.用于从Illumina平台的基因型数据推断拷贝数变异的马尔可夫模型。
Hum Hered. 2009;68(1):1-22. doi: 10.1159/000210445. Epub 2009 Apr 1.