• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用Affymetrix SNP阵列估计癌症群体混合和等位基因拷贝数的隐马尔可夫模型。

A Hidden Markov Model to estimate population mixture and allelic copy-numbers in cancers using Affymetrix SNP arrays.

作者信息

Lamy Philippe, Andersen Claus L, Dyrskjot Lars, Torring Niels, Wiuf Carsten

机构信息

Bioinformatics Research Center, University of Aarhus, Hoegh-Guldbergsgade 10, Bldg 1090, 8000 Aarhus C, Denmark.

出版信息

BMC Bioinformatics. 2007 Nov 9;8:434. doi: 10.1186/1471-2105-8-434.

DOI:10.1186/1471-2105-8-434
PMID:17996079
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2206057/
Abstract

BACKGROUND

Affymetrix SNP arrays can interrogate thousands of SNPs at the same time. This allows us to look at the genomic content of cancer cells and to investigate the underlying events leading to cancer. Genomic copy-numbers are today routinely derived from SNP array data, but the proposed algorithms for this task most often disregard the genotype information available from germline cells in paired germline-tumour samples. Including this information may deepen our understanding of the "true" biological situation e.g. by enabling analysis of allele specific copy-numbers. Here we rely on matched germline-tumour samples and have developed a Hidden Markov Model (HMM) to estimate allelic copy-number changes in tumour cells. Further with this approach we are able to estimate the proportion of normal cells in the tumour (mixture proportion).

RESULTS

We show that our method is able to recover the underlying copy-number changes in simulated data sets with high accuracy (above 97.71%). Moreover, although the known copy-numbers could be well recovered in simulated cancer samples with more than 70% cancer cells (and less than 30% normal cells), we demonstrate that including the mixture proportion in the HMM increases the accuracy of the method. Finally, the method is tested on HapMap samples and on bladder and prostate cancer samples.

CONCLUSION

The HMM method developed here uses the genotype calls of germline DNA and the allelic SNP intensities from the tumour DNA to estimate allelic copy-numbers (including changes) in the tumour. It differentiates between different events like uniparental disomy and allelic imbalances. Moreover, the HMM can estimate the mixture proportion, and thus inform about the purity of the tumour sample.

摘要

背景

Affymetrix SNP 芯片可同时检测数千个单核苷酸多态性(SNP)。这使我们能够研究癌细胞的基因组内容,并探究导致癌症的潜在事件。如今,基因组拷贝数通常从 SNP 芯片数据中推导得出,但针对此任务所提出的算法大多忽略了配对的种系 - 肿瘤样本中种系细胞的基因型信息。纳入这些信息可能会加深我们对“真实”生物学状况的理解,例如通过实现对等位基因特异性拷贝数的分析。在此,我们依赖配对的种系 - 肿瘤样本,并开发了一种隐马尔可夫模型(HMM)来估计肿瘤细胞中的等位基因拷贝数变化。通过这种方法,我们还能够估计肿瘤中正常细胞的比例(混合比例)。

结果

我们表明,我们的方法能够高精度地恢复模拟数据集中的潜在拷贝数变化(准确率高于 97.71%)。此外,尽管在癌细胞比例超过 70%(正常细胞比例低于 30%)的模拟癌症样本中能够很好地恢复已知拷贝数,但我们证明在 HMM 中纳入混合比例可提高该方法的准确性。最后,该方法在 HapMap 样本以及膀胱癌和前列腺癌样本上进行了测试。

结论

这里开发的 HMM 方法利用种系 DNA 的基因型调用和肿瘤 DNA 的等位基因 SNP 强度来估计肿瘤中的等位基因拷贝数(包括变化)。它能够区分单亲二体和等位基因失衡等不同事件。此外,HMM 可以估计混合比例,从而告知肿瘤样本的纯度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/bd77e4be5991/1471-2105-8-434-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/40c01d7fd066/1471-2105-8-434-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/04665bdfbd42/1471-2105-8-434-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/27d96f85550b/1471-2105-8-434-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/7b9a37b42d97/1471-2105-8-434-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/bd77e4be5991/1471-2105-8-434-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/40c01d7fd066/1471-2105-8-434-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/04665bdfbd42/1471-2105-8-434-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/27d96f85550b/1471-2105-8-434-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/7b9a37b42d97/1471-2105-8-434-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f80d/2206057/bd77e4be5991/1471-2105-8-434-5.jpg

相似文献

1
A Hidden Markov Model to estimate population mixture and allelic copy-numbers in cancers using Affymetrix SNP arrays.一种使用Affymetrix SNP阵列估计癌症群体混合和等位基因拷贝数的隐马尔可夫模型。
BMC Bioinformatics. 2007 Nov 9;8:434. doi: 10.1186/1471-2105-8-434.
2
Major copy proportion analysis of tumor samples using SNP arrays.使用单核苷酸多态性(SNP)阵列对肿瘤样本进行主要拷贝比例分析。
BMC Bioinformatics. 2008 Apr 21;9:204. doi: 10.1186/1471-2105-9-204.
3
CARAT: a novel method for allelic detection of DNA copy number changes using high density oligonucleotide arrays.CARAT:一种使用高密度寡核苷酸阵列检测DNA拷贝数变化等位基因的新方法。
BMC Bioinformatics. 2006 Feb 21;7:83. doi: 10.1186/1471-2105-7-83.
4
HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.单倍型拷贝数:利用隐马尔可夫模型和局部单倍型聚类进行拷贝数单倍型推断
PLoS One. 2014 May 21;9(5):e96841. doi: 10.1371/journal.pone.0096841. eCollection 2014.
5
MixHMM: inferring copy number variation and allelic imbalance using SNP arrays and tumor samples mixed with stromal cells.MixHMM:利用 SNP 阵列和混合有基质细胞的肿瘤样本推断拷贝数变异和等位基因失衡。
PLoS One. 2010 Jun 1;5(6):e10909. doi: 10.1371/journal.pone.0010909.
6
A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.一种用于Affymetrix SNP微阵列的多阵列多SNP基因分型算法。
Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25.
7
Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios.Illumina Infinium全基因组单核苷酸多态性(SNP)数据的标准化可改善拷贝数估计和等位基因强度比。
BMC Bioinformatics. 2008 Oct 2;9:409. doi: 10.1186/1471-2105-9-409.
8
Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform.用于评估 Affymetrix 6.0 SNP 阵列平台的基因组拷贝数变异的软件比较。
BMC Bioinformatics. 2011 May 31;12:220. doi: 10.1186/1471-2105-12-220.
9
Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.单核苷酸多态性(SNPs)、常见拷贝数多态性和罕见拷贝数变异(CNVs)的整合基因型分型与关联分析。
Nat Genet. 2008 Oct;40(10):1253-60. doi: 10.1038/ng.237. Epub 2008 Sep 7.
10
Genome-wide copy number analysis using copy number inferring tool (CNIT) and DNA pooling.使用拷贝数推断工具(CNIT)和DNA池进行全基因组拷贝数分析。
Hum Mutat. 2008 Aug;29(8):1055-62. doi: 10.1002/humu.20760.

引用本文的文献

1
Directional allelic imbalance profiling and visualization from multi-sample data with RECUR.使用 RECUR 从多样本数据中进行定向等位基因不平衡分析和可视化。
Bioinformatics. 2019 Jul 1;35(13):2300-2302. doi: 10.1093/bioinformatics/bty885.
2
Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data.Sequenza:来自肿瘤测序数据的等位基因特异性拷贝数和突变图谱。
Ann Oncol. 2015 Jan;26(1):64-70. doi: 10.1093/annonc/mdu479. Epub 2014 Oct 15.
3
Application of SNP microarrays to the genome-wide analysis of chromosomal instability in premalignant airway lesions.

本文引用的文献

1
DNA copy number alterations in prostate cancers: a combined analysis of published CGH studies.前列腺癌中的DNA拷贝数改变:已发表的比较基因组杂交研究的综合分析
Prostate. 2007 May 15;67(7):692-700. doi: 10.1002/pros.20543.
2
Genome-wide analysis of allelic imbalance in prostate cancer using the Affymetrix 50K SNP mapping array.使用Affymetrix 50K SNP基因分型芯片对前列腺癌等位基因不平衡进行全基因组分析。
Br J Cancer. 2007 Feb 12;96(3):499-506. doi: 10.1038/sj.bjc.6603476. Epub 2007 Jan 23.
3
Genotyping and annotation of Affymetrix SNP arrays.
SNP 微阵列在癌前气道病变中染色体不稳定性的全基因组分析中的应用。
Cancer Prev Res (Phila). 2014 Feb;7(2):255-65. doi: 10.1158/1940-6207.CAPR-12-0485. Epub 2013 Dec 17.
4
Biologic tumor behavior in pilocytic astrocytomas.毛细胞型星形细胞瘤的生物学肿瘤行为。
Childs Nerv Syst. 2012 Mar;28(3):375-89. doi: 10.1007/s00381-011-1676-6. Epub 2012 Jan 14.
5
Model-integrated estimation of normal tissue contamination for cancer SNP allelic copy number data.癌症单核苷酸多态性等位基因拷贝数数据正常组织污染的模型整合估计
Cancer Inform. 2011;10:159-73. doi: 10.4137/CIN.S6873. Epub 2011 May 25.
6
Parent-specific copy number in paired tumor-normal studies using circular binary segmentation.基于环形二元分割的配对肿瘤-正常样本中父系特异性拷贝数分析。
Bioinformatics. 2011 Aug 1;27(15):2038-46. doi: 10.1093/bioinformatics/btr329. Epub 2011 Jun 11.
7
BACOM: in silico detection of genomic deletion types and correction of normal cell contamination in copy number data.BACOM:在基因组缺失类型的计算检测中,对拷贝数数据中的正常细胞污染进行校正。
Bioinformatics. 2011 Jun 1;27(11):1473-80. doi: 10.1093/bioinformatics/btr183. Epub 2011 Apr 15.
8
A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data.一种基于单核苷酸多态性基因分型数据检测异质性肿瘤样本中基因组异常的统计方法。
Genome Biol. 2010;11(9):R92. doi: 10.1186/gb-2010-11-9-r92. Epub 2010 Sep 21.
9
An integrated Bayesian analysis of LOH and copy number data.LOH 和拷贝数数据的综合贝叶斯分析。
BMC Bioinformatics. 2010 Jun 15;11:321. doi: 10.1186/1471-2105-11-321.
10
Robust unmixing of tumor states in array comparative genomic hybridization data.阵列比较基因组杂交数据中肿瘤状态的稳健解混。
Bioinformatics. 2010 Jun 15;26(12):i106-14. doi: 10.1093/bioinformatics/btq213.
Affymetrix单核苷酸多态性阵列的基因分型与注释
Nucleic Acids Res. 2006;34(14):e100. doi: 10.1093/nar/gkl475. Epub 2006 Aug 9.
4
Frequent occurrence of uniparental disomy in colorectal cancer.单亲二体在结直肠癌中频繁出现。
Carcinogenesis. 2007 Jan;28(1):38-48. doi: 10.1093/carcin/bgl086. Epub 2006 Jun 13.
5
Inferring loss-of-heterozygosity from unpaired tumors using high-density oligonucleotide SNP arrays.使用高密度寡核苷酸单核苷酸多态性阵列从不配对肿瘤中推断杂合性缺失。
PLoS Comput Biol. 2006 May;2(5):e41. doi: 10.1371/journal.pcbi.0020041. Epub 2006 May 12.
6
CARAT: a novel method for allelic detection of DNA copy number changes using high density oligonucleotide arrays.CARAT:一种使用高密度寡核苷酸阵列检测DNA拷贝数变化等位基因的新方法。
BMC Bioinformatics. 2006 Feb 21;7:83. doi: 10.1186/1471-2105-7-83.
7
Allele-specific amplification in cancer revealed by SNP array analysis.通过单核苷酸多态性(SNP)阵列分析揭示癌症中的等位基因特异性扩增。
PLoS Comput Biol. 2005 Nov;1(6):e65. doi: 10.1371/journal.pcbi.0010065. Epub 2005 Nov 25.
8
Role of activating fibroblast growth factor receptor 3 mutations in the development of bladder tumors.激活成纤维细胞生长因子受体3突变在膀胱肿瘤发生发展中的作用
Clin Cancer Res. 2005 Nov 1;11(21):7709-19. doi: 10.1158/1078-0432.CCR-05-1130.
9
A genotype calling algorithm for affymetrix SNP arrays.一种用于Affymetrix SNP阵列的基因型分型算法。
Bioinformatics. 2006 Jan 1;22(1):7-12. doi: 10.1093/bioinformatics/bti741. Epub 2005 Nov 2.
10
The significance of tumor heterogeneity for prediction of DNA ploidy of prostate cancer.肿瘤异质性对前列腺癌DNA倍体预测的意义。
Scand J Urol Nephrol. 2005;39(5):387-92. doi: 10.1080/00365590500239883.