鉴定稀疏单细胞突变数据中的肿瘤克隆。

Identifying tumor clones in sparse single-cell mutation data.

机构信息

Department of Computer Science, Princeton University, Princeton, NJ 08544, USA.

出版信息

Bioinformatics. 2020 Jul 1;36(Suppl_1):i186-i193. doi: 10.1093/bioinformatics/btaa449.

DOI:10.1093/bioinformatics/btaa449

PMID:32657385

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7355247/

Abstract

MOTIVATION

Recent single-cell DNA sequencing technologies enable whole-genome sequencing of hundreds to thousands of individual cells. However, these technologies have ultra-low sequencing coverage (<0.5× per cell) which has limited their use to the analysis of large copy-number aberrations (CNAs) in individual cells. While CNAs are useful markers in cancer studies, single-nucleotide mutations are equally important, both in cancer studies and in other applications. However, ultra-low coverage sequencing yields single-nucleotide mutation data that are too sparse for current single-cell analysis methods.

RESULTS

We introduce SBMClone, a method to infer clusters of cells, or clones, that share groups of somatic single-nucleotide mutations. SBMClone uses a stochastic block model to overcome sparsity in ultra-low coverage single-cell sequencing data, and we show that SBMClone accurately infers the true clonal composition on simulated datasets with coverage at low as 0.2×. We applied SBMClone to single-cell whole-genome sequencing data from two breast cancer patients obtained using two different sequencing technologies. On the first patient, sequenced using the 10X Genomics CNV solution with sequencing coverage ≈0.03×, SBMClone recovers the major clonal composition when incorporating a small amount of additional information. On the second patient, where pre- and post-treatment tumor samples were sequenced using DOP-PCR with sequencing coverage ≈0.5×, SBMClone shows that tumor cells are present in the post-treatment sample, contrary to published analysis of this dataset.

AVAILABILITY AND IMPLEMENTATION

SBMClone is available on the GitHub repository https://github.com/raphael-group/SBMClone.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

最近的单细胞 DNA 测序技术能够对数百到数千个单个细胞进行全基因组测序。然而，这些技术的测序覆盖度超低（每个细胞<0.5×），这限制了它们在单个细胞中大拷贝数异常（CNA）分析中的应用。虽然 CNA 是癌症研究中的有用标记物，但单核苷酸突变在癌症研究和其他应用中同样重要。然而，超低覆盖度测序产生的单核苷酸突变数据对于当前的单细胞分析方法来说过于稀疏。

结果

我们引入了 SBMClone 方法，用于推断共享体细胞单核苷酸突变群的细胞簇或克隆。SBMClone 使用随机块模型来克服超低覆盖度单细胞测序数据的稀疏性，我们表明 SBMClone 可以在覆盖度低至 0.2×的模拟数据集上准确推断真实的克隆组成。我们将 SBMClone 应用于从两名乳腺癌患者获得的两种不同测序技术的单细胞全基因组测序数据。在第一个患者中，使用 10X Genomics CNV 解决方案进行测序，测序覆盖度约为 0.03×，当纳入少量额外信息时，SBMClone 恢复了主要的克隆组成。在第二个患者中，使用 DOP-PCR 对治疗前后的肿瘤样本进行测序，测序覆盖度约为 0.5×，SBMClone 表明肿瘤细胞存在于治疗后的样本中，与该数据集的已发表分析结果相反。

可用性和实现

SBMClone 可在 GitHub 存储库 https://github.com/raphael-group/SBMClone 上获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/226b/7355247/bf8034489120/btaa449f1.jpg

相似文献

Identifying tumor clones in sparse single-cell mutation data.

Bioinformatics. 2020 Jul 1;36(Suppl_1):i186-i193. doi: 10.1093/bioinformatics/btaa449.

SECEDO: SNV-based subclone detection using ultra-low coverage single-cell DNA sequencing.

Bioinformatics. 2022 Sep 15;38(18):4293-4300. doi: 10.1093/bioinformatics/btac510.

Haplotype phasing in single-cell DNA-sequencing data.

Bioinformatics. 2018 Jul 1;34(13):i211-i217. doi: 10.1093/bioinformatics/bty286.

RobustClone: a robust PCA method for tumor clone and evolution inference from single-cell sequencing data.

Bioinformatics. 2020 Jun 1;36(11):3299-3306. doi: 10.1093/bioinformatics/btaa172.

QuantumClone: clonal assessment of functional mutations in cancer based on a genotype-aware method for clonal reconstruction.

Bioinformatics. 2018 Jun 1;34(11):1808-1816. doi: 10.1093/bioinformatics/bty016.

SCONCE: a method for profiling copy number alterations in cancer evolution using single-cell whole genome sequencing.

Bioinformatics. 2022 Mar 28;38(7):1801-1808. doi: 10.1093/bioinformatics/btac041.

Lep-MAP3: robust linkage mapping even for low-coverage whole genome sequencing data.

Bioinformatics. 2017 Dec 1;33(23):3726-3732. doi: 10.1093/bioinformatics/btx494.

Copy number evolution with weighted aberrations in cancer.

Bioinformatics. 2020 Jul 1;36(Suppl_1):i344-i352. doi: 10.1093/bioinformatics/btaa470.

Phertilizer: Growing a clonal tree from ultra-low coverage single-cell DNA sequencing of tumors.

PLoS Comput Biol. 2023 Oct 11;19(10):e1011544. doi: 10.1371/journal.pcbi.1011544. eCollection 2023 Oct.

SCSsim: an integrated tool for simulating single-cell genome sequencing data.

Bioinformatics. 2020 Feb 15;36(4):1281-1282. doi: 10.1093/bioinformatics/btz713.

引用本文的文献

Ongoing genome doubling shapes evolvability and immunity in ovarian cancer.

Nature. 2025 Jul 16. doi: 10.1038/s41586-025-09240-3.

Inferring active mutational processes in cancer using single cell sequencing and evolutionary constraints.

bioRxiv. 2025 Feb 27:2025.02.24.639589. doi: 10.1101/2025.02.24.639589.

SCGclust: Single Cell Graph clustering using graph autoencoders integrating SNVs and CNAs.

bioRxiv. 2025 Feb 1:2025.01.28.635357. doi: 10.1101/2025.01.28.635357.

Assessing the merits: an opinion on the effectiveness of simulation techniques in tumor subclonal reconstruction.

Bioinform Adv. 2024 Jun 26;4(1):vbae094. doi: 10.1093/bioadv/vbae094. eCollection 2024.

Joint inference of cell lineage and mitochondrial evolution from single-cell sequencing data.

Bioinformatics. 2024 Jun 28;40(Suppl 1):i218-i227. doi: 10.1093/bioinformatics/btae231.

Evaluation of simulation methods for tumor subclonal reconstruction.

ArXiv. 2024 Feb 14:arXiv:2402.09599v1.

Disease-Associated Neurotoxic Astrocyte Markers in Alzheimer Disease Based on Integrative Single-Nucleus RNA Sequencing.

Cell Mol Neurobiol. 2024 Feb 12;44(1):20. doi: 10.1007/s10571-024-01453-w.

Assessing the performance of methods for cell clustering from single-cell DNA sequencing data.

PLoS Comput Biol. 2023 Oct 12;19(10):e1010480. doi: 10.1371/journal.pcbi.1010480. eCollection 2023 Oct.

Phertilizer: Growing a clonal tree from ultra-low coverage single-cell DNA sequencing of tumors.

PLoS Comput Biol. 2023 Oct 11;19(10):e1011544. doi: 10.1371/journal.pcbi.1011544. eCollection 2023 Oct.

Computational Methods Summarizing Mutational Patterns in Cancer: Promise and Limitations for Clinical Applications.

Cancers (Basel). 2023 Mar 24;15(7):1958. doi: 10.3390/cancers15071958.

本文引用的文献

SCARLET: Single-cell tumor phylogeny inference with copy-number constrained mutation losses.

Cell Syst. 2020 Apr 22;10(4):323-332.e8. doi: 10.1016/j.cels.2020.04.001.

Inferring cancer progression from Single-Cell Sequencing while allowing mutation losses.

Bioinformatics. 2021 Apr 20;37(3):326-333. doi: 10.1093/bioinformatics/btaa722.

BnpC: Bayesian non-parametric clustering of single-cell mutation profiles.

Bioinformatics. 2020 Dec 8;36(19):4854-4859. doi: 10.1093/bioinformatics/btaa599.

Clonal Decomposition and DNA Replication States Defined by Scaled Single-Cell Genome Sequencing.

Cell. 2019 Nov 14;179(5):1207-1221.e22. doi: 10.1016/j.cell.2019.10.026.

SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data.

Genome Res. 2019 Nov;29(11):1847-1859. doi: 10.1101/gr.243121.118. Epub 2019 Oct 18.

PhISCS: a combinatorial approach for subperfect tumor phylogeny reconstruction via integrative use of single-cell and bulk sequencing data.

Genome Res. 2019 Nov;29(11):1860-1877. doi: 10.1101/gr.234435.118. Epub 2019 Oct 18.

Single-cell mutation identification via phylogenetic inference.

Nat Commun. 2018 Dec 4;9(1):5144. doi: 10.1038/s41467-018-07627-7.

SPhyR: tumor phylogeny estimation from single-cell sequencing data under loss and error.

Bioinformatics. 2018 Sep 1;34(17):i671-i679. doi: 10.1093/bioinformatics/bty589.

Chemoresistance Evolution in Triple-Negative Breast Cancer Delineated by Single-Cell Sequencing.

Cell. 2018 May 3;173(4):879-893.e13. doi: 10.1016/j.cell.2018.03.041. Epub 2018 Apr 19.

Multiclonal Invasion in Breast Tumors Identified by Topographic Single Cell Sequencing.

Cell. 2018 Jan 11;172(1-2):205-217.e12. doi: 10.1016/j.cell.2017.12.007. Epub 2018 Jan 4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

鉴定稀疏单细胞突变数据中的肿瘤克隆。

Identifying tumor clones in sparse single-cell mutation data.

机构信息

Department of Computer Science, Princeton University, Princeton, NJ 08544, USA.