• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单倍型拷贝数:利用隐马尔可夫模型和局部单倍型聚类进行拷贝数单倍型推断

HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.

作者信息

Lin Yen-Jen, Chen Yu-Tin, Hsu Shu-Ni, Peng Chien-Hua, Tang Chuan-Yi, Yen Tzu-Chen, Hsieh Wen-Ping

机构信息

Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan.

Institute of Statistics, National Tsing Hua University, Hsinchu, Taiwan.

出版信息

PLoS One. 2014 May 21;9(5):e96841. doi: 10.1371/journal.pone.0096841. eCollection 2014.

DOI:10.1371/journal.pone.0096841
PMID:24849202
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4029584/
Abstract

Copy number variation (CNV) has been reported to be associated with disease and various cancers. Hence, identifying the accurate position and the type of CNV is currently a critical issue. There are many tools targeting on detecting CNV regions, constructing haplotype phases on CNV regions, or estimating the numerical copy numbers. However, none of them can do all of the three tasks at the same time. This paper presents a method based on Hidden Markov Model to detect parent specific copy number change on both chromosomes with signals from SNP arrays. A haplotype tree is constructed with dynamic branch merging to model the transition of the copy number status of the two alleles assessed at each SNP locus. The emission models are constructed for the genotypes formed with the two haplotypes. The proposed method can provide the segmentation points of the CNV regions as well as the haplotype phasing for the allelic status on each chromosome. The estimated copy numbers are provided as fractional numbers, which can accommodate the somatic mutation in cancer specimens that usually consist of heterogeneous cell populations. The algorithm is evaluated on simulated data and the previously published regions of CNV of the 270 HapMap individuals. The results were compared with five popular methods: PennCNV, genoCN, COKGEN, QuantiSNP and cnvHap. The application on oral cancer samples demonstrates how the proposed method can facilitate clinical association studies. The proposed algorithm exhibits comparable sensitivity of the CNV regions to the best algorithm in our genome-wide study and demonstrates the highest detection rate in SNP dense regions. In addition, we provide better haplotype phasing accuracy than similar approaches. The clinical association carried out with our fractional estimate of copy numbers in the cancer samples provides better detection power than that with integer copy number states.

摘要

据报道,拷贝数变异(CNV)与疾病及各种癌症相关。因此,确定CNV的准确位置和类型是当前的关键问题。有许多工具旨在检测CNV区域、构建CNV区域的单倍型相位或估计拷贝数的数值。然而,它们中没有一个能同时完成这三项任务。本文提出了一种基于隐马尔可夫模型的方法,利用SNP阵列的信号检测两条染色体上亲本特异性的拷贝数变化。通过动态分支合并构建单倍型树,以模拟在每个SNP位点评估的两个等位基因拷贝数状态的转变。为两个单倍型形成的基因型构建发射模型。所提出的方法可以提供CNV区域的分割点以及每条染色体上等位基因状态的单倍型相位。估计的拷贝数以分数形式提供,这可以适应癌症标本中通常由异质细胞群体组成的体细胞突变。该算法在模拟数据和先前发表的270个HapMap个体的CNV区域上进行了评估。将结果与五种常用方法进行了比较:PennCNV、genoCN、COKGEN、QuantiSNP和cnvHap。在口腔癌样本上的应用展示了所提出的方法如何促进临床关联研究。在我们全基因组研究中,所提出的算法对CNV区域表现出与最佳算法相当的敏感性,并在SNP密集区域显示出最高的检测率。此外,我们提供了比类似方法更好的单倍型相位准确性。在癌症样本中使用我们对拷贝数的分数估计进行临床关联研究,比使用整数拷贝数状态提供了更好的检测能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/975573ee5ea8/pone.0096841.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/bc3723baa3e9/pone.0096841.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/9df18a06fbf2/pone.0096841.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/4052a2802a9b/pone.0096841.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/8fafa0f153e0/pone.0096841.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/23dc9c277953/pone.0096841.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/975573ee5ea8/pone.0096841.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/bc3723baa3e9/pone.0096841.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/9df18a06fbf2/pone.0096841.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/4052a2802a9b/pone.0096841.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/8fafa0f153e0/pone.0096841.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/23dc9c277953/pone.0096841.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fec/4029584/975573ee5ea8/pone.0096841.g006.jpg

相似文献

1
HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.单倍型拷贝数:利用隐马尔可夫模型和局部单倍型聚类进行拷贝数单倍型推断
PLoS One. 2014 May 21;9(5):e96841. doi: 10.1371/journal.pone.0096841. eCollection 2014.
2
Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform.用于评估 Affymetrix 6.0 SNP 阵列平台的基因组拷贝数变异的软件比较。
BMC Bioinformatics. 2011 May 31;12:220. doi: 10.1186/1471-2105-12-220.
3
A method for calling copy number polymorphism using haplotypes.一种使用单倍型调用拷贝数多态性的方法。
Front Genet. 2013 Sep 23;4:165. doi: 10.3389/fgene.2013.00165. eCollection 2013.
4
Inferring combined CNV/SNP haplotypes from genotype data.从基因型数据推断 CNV/SNP 单体型。
Bioinformatics. 2010 Jun 1;26(11):1437-45. doi: 10.1093/bioinformatics/btq157. Epub 2010 Apr 20.
5
Haplotype phasing and inheritance of copy number variants in nuclear families.核心家庭中单体型定相及拷贝数变异的遗传
PLoS One. 2015 Apr 8;10(4):e0122713. doi: 10.1371/journal.pone.0122713. eCollection 2015.
6
Genome-wide algorithm for detecting CNV associations with diseases.全基因组算法检测与疾病相关的 CNV 关联。
BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.
7
Identification of Copy Number Variants from SNP Arrays Using PennCNV.使用PennCNV从SNP阵列中鉴定拷贝数变异
Methods Mol Biol. 2018;1833:1-28. doi: 10.1007/978-1-4939-8666-8_1.
8
A Hidden Markov Model to estimate population mixture and allelic copy-numbers in cancers using Affymetrix SNP arrays.一种使用Affymetrix SNP阵列估计癌症群体混合和等位基因拷贝数的隐马尔可夫模型。
BMC Bioinformatics. 2007 Nov 9;8:434. doi: 10.1186/1471-2105-8-434.
9
Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle.使用高密度或中密度单核苷酸多态性基因分型面板检测到的拷贝数变异与从牛侧翼高密度单核苷酸多态性单倍型推断拷贝数变异的一致性。
BMC Genomics. 2020 Mar 4;21(1):205. doi: 10.1186/s12864-020-6627-8.
10
Inference of haplotypic phase and missing genotypes in polyploid organisms and variable copy number genomic regions.多倍体生物和可变拷贝数基因组区域中单体型相位及缺失基因型的推断。
BMC Bioinformatics. 2008 Dec 1;9:513. doi: 10.1186/1471-2105-9-513.

引用本文的文献

1
Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure.充分利用 SNP 阵列:提取潜在基因组结构的工具的系统评价。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac043.
2
The Role of Constitutional Copy Number Variants in Breast Cancer.体质性拷贝数变异在乳腺癌中的作用
Microarrays (Basel). 2015 Sep 8;4(3):407-23. doi: 10.3390/microarrays4030407.
3
Haplotype phasing and inheritance of copy number variants in nuclear families.核心家庭中单体型定相及拷贝数变异的遗传

本文引用的文献

1
Somatic mutations in the HLA genes of patients with hematological malignancy.血液系统恶性肿瘤患者HLA基因的体细胞突变
Tissue Antigens. 2012 May;79(5):359-66. doi: 10.1111/j.1399-0039.2012.01868.x.
2
Evolution of the cancer genome.癌症基因组的演变。
Trends Genet. 2012 Apr;28(4):155-63. doi: 10.1016/j.tig.2012.01.003. Epub 2012 Feb 16.
3
Allele-specific copy number analysis of tumor samples with aneuploidy and tumor heterogeneity.肿瘤样本的等位基因特异性拷贝数分析,包括非整倍体和肿瘤异质性。
PLoS One. 2015 Apr 8;10(4):e0122713. doi: 10.1371/journal.pone.0122713. eCollection 2015.
Genome Biol. 2011 Oct 24;12(10):R108. doi: 10.1186/gb-2011-12-10-r108.
4
A novel molecular signature identified by systems genetics approach predicts prognosis in oral squamous cell carcinoma.系统遗传学方法鉴定的新型分子特征可预测口腔鳞状细胞癌的预后。
PLoS One. 2011;6(8):e23452. doi: 10.1371/journal.pone.0023452. Epub 2011 Aug 11.
5
Parent-specific copy number in paired tumor-normal studies using circular binary segmentation.基于环形二元分割的配对肿瘤-正常样本中父系特异性拷贝数分析。
Bioinformatics. 2011 Aug 1;27(15):2038-46. doi: 10.1093/bioinformatics/btr329. Epub 2011 Jun 11.
6
Inference of chromosome-specific copy numbers using population haplotypes.基于群体单体型推断染色体特异性拷贝数。
BMC Bioinformatics. 2011 May 24;12:194. doi: 10.1186/1471-2105-12-194.
7
Estimation of parent specific DNA copy number in tumors using high-density genotyping arrays.利用高密度基因分型阵列估计肿瘤中的亲本特异性 DNA 拷贝数。
PLoS Comput Biol. 2011 Jan 27;7(1):e1001060. doi: 10.1371/journal.pcbi.1001060.
8
Allele-specific copy number analysis of tumors.肿瘤的等位基因特异性拷贝数分析。
Proc Natl Acad Sci U S A. 2010 Sep 28;107(39):16910-5. doi: 10.1073/pnas.1009843107. Epub 2010 Sep 13.
9
cnvHap: an integrative population and haplotype-based multiplatform model of SNPs and CNVs.cnvHap:一种基于人群和单倍型的整合 SNP 和 CNV 的多平台模型。
Nat Methods. 2010 Jul;7(7):541-6. doi: 10.1038/nmeth.1466. Epub 2010 May 30.
10
COKGEN: a software for the identification of rare copy number variation from SNP microarrays.COKGEN:一款用于从单核苷酸多态性微阵列中识别罕见拷贝数变异的软件。
Pac Symp Biocomput. 2010:371-82.