• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CLImAT-HET:从全基因组测序数据中检测异质性肿瘤样本中的亚克隆拷贝数改变和杂合性缺失

CLImAT-HET: detecting subclonal copy number alterations and loss of heterozygosity in heterogeneous tumor samples from whole-genome sequencing data.

作者信息

Yu Zhenhua, Li Ao, Wang Minghui

机构信息

School of Information Science and Technology, University of Science and Technology of China, Hefei, AH230027, China.

Centers for Biomedical Engineering, University of Science and Technology of China, Hefei, AH230027, China.

出版信息

BMC Med Genomics. 2017 Mar 15;10(1):15. doi: 10.1186/s12920-017-0255-4.

DOI:10.1186/s12920-017-0255-4
PMID:28298214
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5351278/
Abstract

BACKGROUND

Copy number alterations (CNA) and loss of heterozygosity (LOH) represent a large proportion of genetic structural variations of cancer genomes. These aberrations are continuously accumulated during the procedure of clonal evolution and patterned by phylogenetic branching. This invariably results in the emergence of multiple cell populations with distinct complement of mutational landscapes in tumor sample. With the advent of next-generation sequencing technology, inference of subclonal populations has become one of the focused interests in cancer-associated studies, and is usually based on the assessment of combinations of somatic single-nucleotide variations (SNV), CNA and LOH. However, cancer samples often have several inherent issues, such as contamination of normal stroma, tumor aneuploidy and intra-tumor heterogeneity. Addressing these critical issues is imperative for accurate profiling of clonal architecture.

METHODS

We present CLImAT-HET, a computational method designed for capturing clonal diversity in the CNA/LOH dimensions by taking into account the intra-tumor heterogeneity issue, in the case where a reference or matched normal sample is absent. The algorithm quantitatively represents the clonal identification problem using a factorial hidden Markov model, and takes an integrated analysis of read counts and allele frequency data. It is able to infer subclonal CNA and LOH events as well as the fraction of cells harboring each event.

RESULTS

The results on simulated datasets indicate that CLImAT-HET has high power to identify CNA/LOH segments, it achieves an average accuracy of 0.87. It can also accurately infer proportion of each clonal population with an overall Pearson correlation coefficient of 0.99 and a mean absolute error of 0.02. CLImAT-HET shows significant advantages when compared with other existing methods. Application of CLImAT-HET to 5 primary triple negative breast cancer samples demonstrates its ability to capture clonal diversity in the CAN/LOH dimensions. It detects two clonal populations in one sample, and three clonal populations in one other sample.

CONCLUSIONS

CLImAT-HET, a novel algorithm is introduced to infer CNA/LOH segments from heterogeneous tumor samples. We demonstrate CLImAT-HET's ability to accurately recover clonal compositions using tumor WGS data without a match normal sample.

摘要

背景

拷贝数改变(CNA)和杂合性缺失(LOH)占癌症基因组遗传结构变异的很大比例。这些畸变在克隆进化过程中不断积累,并由系统发育分支形成模式。这必然导致肿瘤样本中出现具有不同突变景观互补的多个细胞群体。随着下一代测序技术的出现,亚克隆群体的推断已成为癌症相关研究的重点关注之一,通常基于对体细胞单核苷酸变异(SNV)、CNA和LOH组合的评估。然而,癌症样本往往存在几个固有问题,如正常基质的污染、肿瘤非整倍性和肿瘤内异质性。解决这些关键问题对于准确描绘克隆结构至关重要。

方法

我们提出了CLImAT-HET,这是一种计算方法,旨在通过考虑肿瘤内异质性问题,在没有参考或匹配正常样本的情况下,在CNA/LOH维度中捕获克隆多样性。该算法使用因子隐马尔可夫模型定量表示克隆识别问题,并对读取计数和等位基因频率数据进行综合分析。它能够推断亚克隆CNA和LOH事件以及携带每个事件的细胞比例。

结果

模拟数据集的结果表明,CLImAT-HET具有很高的识别CNA/LOH片段的能力,平均准确率达到0.87。它还可以准确推断每个克隆群体的比例,总体皮尔逊相关系数为0.99,平均绝对误差为0.02。与其他现有方法相比,CLImAT-HET显示出显著优势。将CLImAT-HET应用于5个原发性三阴性乳腺癌样本,证明了其在CAN/LOH维度中捕获克隆多样性的能力。它在一个样本中检测到两个克隆群体,在另一个样本中检测到三个克隆群体。

结论

引入了一种新算法CLImAT-HET,用于从异质性肿瘤样本中推断CNA/LOH片段。我们证明了CLImAT-HET能够使用肿瘤全基因组测序(WGS)数据准确恢复克隆组成,而无需匹配的正常样本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/5d8ab4b366d2/12920_2017_255_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/7e354f4b081f/12920_2017_255_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/2a3768acb723/12920_2017_255_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/7124c1c9bf0f/12920_2017_255_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/ce306899809e/12920_2017_255_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/4e7444f4a401/12920_2017_255_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/11ff0d0cd36c/12920_2017_255_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/328aa5f78192/12920_2017_255_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/5d8ab4b366d2/12920_2017_255_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/7e354f4b081f/12920_2017_255_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/2a3768acb723/12920_2017_255_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/7124c1c9bf0f/12920_2017_255_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/ce306899809e/12920_2017_255_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/4e7444f4a401/12920_2017_255_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/11ff0d0cd36c/12920_2017_255_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/328aa5f78192/12920_2017_255_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5cc0/5351278/5d8ab4b366d2/12920_2017_255_Fig8_HTML.jpg

相似文献

1
CLImAT-HET: detecting subclonal copy number alterations and loss of heterozygosity in heterogeneous tumor samples from whole-genome sequencing data.CLImAT-HET:从全基因组测序数据中检测异质性肿瘤样本中的亚克隆拷贝数改变和杂合性缺失
BMC Med Genomics. 2017 Mar 15;10(1):15. doi: 10.1186/s12920-017-0255-4.
2
CLImAT: accurate detection of copy number alteration and loss of heterozygosity in impure and aneuploid tumor samples using whole-genome sequencing data.CLImAT:利用全基因组测序数据准确检测不纯和非整倍体肿瘤样本中的拷贝数改变和杂合性缺失。
Bioinformatics. 2014 Sep 15;30(18):2576-83. doi: 10.1093/bioinformatics/btu346. Epub 2014 May 19.
3
CloneCNA: detecting subclonal somatic copy number alterations in heterogeneous tumor samples from whole-exome sequencing data.CloneCNA:从全外显子测序数据中检测异质性肿瘤样本中的亚克隆体细胞拷贝数改变。
BMC Bioinformatics. 2016 Aug 19;17:310. doi: 10.1186/s12859-016-1174-7.
4
TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data.TITAN:从肿瘤全基因组序列数据推断克隆细胞群体中的拷贝数结构
Genome Res. 2014 Nov;24(11):1881-93. doi: 10.1101/gr.180281.114. Epub 2014 Jul 24.
5
Hierarchical discovery of large-scale and focal copy number alterations in low-coverage cancer genomes.在低覆盖度癌症基因组中进行大规模和焦点拷贝数改变的层次式发现。
BMC Bioinformatics. 2020 Apr 16;21(1):147. doi: 10.1186/s12859-020-3480-3.
6
MixClone: a mixture model for inferring tumor subclonal populations.MixClone:一种用于推断肿瘤亚克隆群体的混合模型。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2164-16-S2-S1. Epub 2015 Jan 21.
7
Landscape of somatic allelic imbalances and copy number alterations in HER2-amplified breast cancer.HER2 扩增型乳腺癌中的体细胞等位基因失衡和拷贝数改变的全景图。
Breast Cancer Res. 2011;13(6):R129. doi: 10.1186/bcr3075. Epub 2011 Dec 14.
8
FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing.FACETS:用于高通量DNA测序的等位基因特异性拷贝数和克隆异质性分析工具。
Nucleic Acids Res. 2016 Sep 19;44(16):e131. doi: 10.1093/nar/gkw520. Epub 2016 Jun 7.
9
Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.通过对选定区域进行靶向测序来鉴定全人类基因组的拷贝数变异(CNV)、杂合性缺失(LOH)和单亲二倍体(UPD)。
PLoS One. 2015 Apr 28;10(4):e0123081. doi: 10.1371/journal.pone.0123081. eCollection 2014.
10
BiTSC 2: Bayesian inference of tumor clonal tree by joint analysis of single-cell SNV and CNA data.BiTSC 2:通过单细胞 SNV 和 CNA 数据的联合分析进行肿瘤克隆树的贝叶斯推断。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac092.

引用本文的文献

1
Characterizing the allele-specific gene expression landscape in high hyperdiploid acute lymphoblastic leukemia with BASE.利用 BASE 描绘高倍体超二倍体急性淋巴细胞白血病的等位基因特异性基因表达图谱。
Sci Rep. 2024 Oct 5;14(1):23181. doi: 10.1038/s41598-024-73743-8.
2
On the core segmentation algorithms of copy number variation detection tools.基于拷贝数变异检测工具的核心分割算法。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae022.
3
Inferring single-cell copy number profiles through cross-cell segmentation of read counts.

本文引用的文献

1
Quantifying tumor heterogeneity in whole-genome and whole-exome sequencing data.量化全基因组和全外显子组测序数据中的肿瘤异质性。
Bioinformatics. 2014 Dec 15;30(24):3532-40. doi: 10.1093/bioinformatics/btu651. Epub 2014 Oct 8.
2
TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data.TITAN:从肿瘤全基因组序列数据推断克隆细胞群体中的拷贝数结构
Genome Res. 2014 Nov;24(11):1881-93. doi: 10.1101/gr.180281.114. Epub 2014 Jul 24.
3
CLImAT: accurate detection of copy number alteration and loss of heterozygosity in impure and aneuploid tumor samples using whole-genome sequencing data.
通过读取计数的跨细胞分割推断单细胞拷贝数谱。
BMC Genomics. 2024 Jan 2;25(1):25. doi: 10.1186/s12864-023-09901-5.
4
JLOH: Inferring loss of heterozygosity blocks from sequencing data.《人类遗传学杂志》:从测序数据推断杂合性缺失区域。
Comput Struct Biotechnol J. 2023 Nov 7;21:5738-5750. doi: 10.1016/j.csbj.2023.11.003. eCollection 2023.
5
Application of third-generation sequencing in cancer research.第三代测序技术在癌症研究中的应用。
Med Rev (2021). 2021 Oct 21;1(2):150-171. doi: 10.1515/mr-2021-0013. eCollection 2021 Dec.
6
SCClone: Accurate Clustering of Tumor Single-Cell DNA Sequencing Data.SCClone:肿瘤单细胞DNA测序数据的精确聚类
Front Genet. 2022 Jan 27;13:823941. doi: 10.3389/fgene.2022.823941. eCollection 2022.
7
JAX-CNV: A Whole-genome Sequencing-based Algorithm for Copy Number Detection at Clinical Grade Level.JAX-CNV:一种基于全基因组测序的拷贝数变异检测算法,可达到临床级别的水平。
Genomics Proteomics Bioinformatics. 2022 Dec;20(6):1197-1206. doi: 10.1016/j.gpb.2021.06.003. Epub 2022 Jan 25.
8
CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data.CIRCNV:基于测序数据读深度的圆形分布检测拷贝数变异
Biology (Basel). 2021 Jun 25;10(7):584. doi: 10.3390/biology10070584.
9
STARCH: copy number and clone inference from spatial transcriptomics data.STARCH:从空间转录组学数据中推断拷贝数和克隆。
Phys Biol. 2021 Mar 9;18(3):035001. doi: 10.1088/1478-3975/abbe99.
10
SimuSCoP: reliably simulate Illumina sequencing data based on position and context dependent profiles.SimuSCoP:基于位置和上下文相关的分布可靠地模拟 Illumina 测序数据。
BMC Bioinformatics. 2020 Jul 23;21(1):331. doi: 10.1186/s12859-020-03665-5.
CLImAT:利用全基因组测序数据准确检测不纯和非整倍体肿瘤样本中的拷贝数改变和杂合性缺失。
Bioinformatics. 2014 Sep 15;30(18):2576-83. doi: 10.1093/bioinformatics/btu346. Epub 2014 May 19.
4
OncoSNP-SEQ: a statistical approach for the identification of somatic copy number alterations from next-generation sequencing of cancer genomes.OncoSNP-SEQ:一种从癌症基因组的下一代测序中鉴定体细胞拷贝数改变的统计方法。
Bioinformatics. 2013 Oct 1;29(19):2482-4. doi: 10.1093/bioinformatics/btt416. Epub 2013 Aug 7.
5
PurBayes: estimating tumor cellularity and subclonality in next-generation sequencing data.PurBayes:在下一代测序数据中估计肿瘤细胞含量和亚克隆性。
Bioinformatics. 2013 Aug 1;29(15):1888-9. doi: 10.1093/bioinformatics/btt293. Epub 2013 Jun 6.
6
Evolution of the cancer genome.癌症基因组的演变。
Nat Rev Genet. 2012 Nov;13(11):795-806. doi: 10.1038/nrg3317. Epub 2012 Oct 9.
7
Integrative analysis of genome-wide loss of heterozygosity and monoallelic expression at nucleotide resolution reveals disrupted pathways in triple-negative breast cancer.全基因组杂合性缺失和核苷酸分辨率下单等位基因表达的综合分析揭示了三阴性乳腺癌中失调的通路。
Genome Res. 2012 Oct;22(10):1995-2007. doi: 10.1101/gr.137570.112. Epub 2012 May 25.
8
Absolute quantification of somatic DNA alterations in human cancer.人类癌症中体细胞 DNA 改变的绝对定量。
Nat Biotechnol. 2012 May;30(5):413-21. doi: 10.1038/nbt.2203.
9
The clonal and mutational evolution spectrum of primary triple-negative breast cancers.原发性三阴性乳腺癌的克隆和突变进化图谱。
Nature. 2012 Apr 4;486(7403):395-9. doi: 10.1038/nature10933.
10
Clonal evolution in cancer.癌症中的克隆进化。
Nature. 2012 Jan 18;481(7381):306-13. doi: 10.1038/nature10762.