• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解析大型数据集的单细胞拷贝数分析。

Resolving single-cell copy number profiling for large datasets.

机构信息

Department of Computer Science at City University of Hong Kong.

School of Software, Northwestern Polytechnical University.

出版信息

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac264.

DOI:10.1093/bib/bbac264
PMID:35801503
Abstract

The advances of single-cell DNA sequencing (scDNA-seq) enable us to characterize the genetic heterogeneity of cancer cells. However, the high noise and low coverage of scDNA-seq impede the estimation of copy number variations (CNVs). In addition, existing tools suffer from intensive execution time and often fail on large datasets. Here, we propose SeCNV, an efficient method that leverages structural entropy, to profile the copy numbers. SeCNV adopts a local Gaussian kernel to construct a matrix, depth congruent map (DCM), capturing the similarities between any two bins along the genome. Then, SeCNV partitions the genome into segments by minimizing the structural entropy from the DCM. With the partition, SeCNV estimates the copy numbers within each segment for cells. We simulate nine datasets with various breakpoint distributions and amplitudes of noise to benchmark SeCNV. SeCNV achieves a robust performance, i.e. the F1-scores are higher than 0.95 for breakpoint detections, significantly outperforming state-of-the-art methods. SeCNV successfully processes large datasets (>50 000 cells) within 4 min, while other tools fail to finish within the time limit, i.e. 120 h. We apply SeCNV to single-nucleus sequencing datasets from two breast cancer patients and acoustic cell tagmentation sequencing datasets from eight breast cancer patients. SeCNV successfully reproduces the distinct subclones and infers tumor heterogeneity. SeCNV is available at https://github.com/deepomicslab/SeCNV.

摘要

单细胞 DNA 测序(scDNA-seq)的进步使我们能够描绘癌细胞的遗传异质性。然而,scDNA-seq 的高噪声和低覆盖率阻碍了拷贝数变异(CNVs)的估计。此外,现有的工具执行时间长,并且经常在大型数据集上失败。在这里,我们提出了 SeCNV,一种利用结构熵来分析拷贝数的有效方法。SeCNV 采用局部高斯核来构建矩阵,深度一致映射(DCM),捕捉基因组中任意两个 bin 之间的相似性。然后,SeCNV 通过最小化 DCM 的结构熵将基因组划分为片段。通过分区,SeCNV 为每个细胞估计每个片段内的拷贝数。我们模拟了九个具有不同断点分布和噪声幅度的数据集,以对 SeCNV 进行基准测试。SeCNV 实现了稳健的性能,即断点检测的 F1 分数高于 0.95,明显优于最先进的方法。SeCNV 能够在 4 分钟内处理超过 50000 个细胞的大型数据集,而其他工具在时间限制内(即 120 小时)无法完成。我们将 SeCNV 应用于来自两名乳腺癌患者的单细胞测序数据集和来自八名乳腺癌患者的声学细胞标记测序数据集。SeCNV 成功地重现了不同的亚克隆,并推断了肿瘤异质性。SeCNV 可在 https://github.com/deepomicslab/SeCNV 上获得。

相似文献

1
Resolving single-cell copy number profiling for large datasets.解析大型数据集的单细胞拷贝数分析。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac264.
2
Improved allele-specific single-cell copy number estimation in low-coverage DNA-sequencing.提高低覆盖度 DNA 测序中单核苷酸多态性特异性单细胞拷贝数估计的方法。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae506.
3
SCCNAInfer: a robust and accurate tool to infer the absolute copy number on scDNA-seq data.SCCNAInfer:一种用于推断单细胞DNA测序数据绝对拷贝数的强大且准确的工具。
Bioinformatics. 2024 Jul 27;40(7). doi: 10.1093/bioinformatics/btae454.
4
Copy Number Variation Detection by Single-Cell DNA Sequencing with SCOPE.利用 SCOPE 进行单细胞 DNA 测序的拷贝数变异检测。
Methods Mol Biol. 2022;2493:279-288. doi: 10.1007/978-1-0716-2293-3_18.
5
SCOPE: A Normalization and Copy-Number Estimation Method for Single-Cell DNA Sequencing.范围:单细胞 DNA 测序的标准化和拷贝数估计方法。
Cell Syst. 2020 May 20;10(5):445-452.e6. doi: 10.1016/j.cels.2020.03.005.
6
Hierarchical discovery of large-scale and focal copy number alterations in low-coverage cancer genomes.在低覆盖度癌症基因组中进行大规模和焦点拷贝数改变的层次式发现。
BMC Bioinformatics. 2020 Apr 16;21(1):147. doi: 10.1186/s12859-020-3480-3.
7
Inferring single-cell copy number profiles through cross-cell segmentation of read counts.通过读取计数的跨细胞分割推断单细胞拷贝数谱。
BMC Genomics. 2024 Jan 2;25(1):25. doi: 10.1186/s12864-023-09901-5.
8
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
9
SCYN: single cell CNV profiling method using dynamic programming.SCYN:一种使用动态规划进行单细胞 CNV 分析的方法。
BMC Genomics. 2021 Nov 16;22(Suppl 5):651. doi: 10.1186/s12864-021-07941-3.
10
CONY: A Bayesian procedure for detecting copy number variations from sequencing read depths.CONY:一种基于测序深度的拷贝数变异检测的贝叶斯方法。
Sci Rep. 2020 Jun 26;10(1):10493. doi: 10.1038/s41598-020-64353-1.

引用本文的文献

1
Benchmarking copy number aberrations inference tools using single-cell multi-omics datasets.使用单细胞多组学数据集对拷贝数变异推断工具进行基准测试。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf076.
2
Single-cell copy number calling and event history reconstruction.单细胞拷贝数检测与事件历史重建。
Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf072.
3
Improved allele-specific single-cell copy number estimation in low-coverage DNA-sequencing.提高低覆盖度 DNA 测序中单核苷酸多态性特异性单细胞拷贝数估计的方法。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae506.
4
SCCNAInfer: a robust and accurate tool to infer the absolute copy number on scDNA-seq data.SCCNAInfer:一种用于推断单细胞DNA测序数据绝对拷贝数的强大且准确的工具。
Bioinformatics. 2024 Jul 27;40(7). doi: 10.1093/bioinformatics/btae454.
5
CoT: a transformer-based method for inferring tumor clonal copy number substructure from scDNA-seq data.CoT:一种基于 Transformer 的方法,用于从 scDNA-seq 数据推断肿瘤克隆拷贝数亚结构。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae187.
6
Inferring single-cell copy number profiles through cross-cell segmentation of read counts.通过读取计数的跨细胞分割推断单细胞拷贝数谱。
BMC Genomics. 2024 Jan 2;25(1):25. doi: 10.1186/s12864-023-09901-5.
7
The utilization of single-cell sequencing technology in investigating the immune microenvironment of ccRCC.单细胞测序技术在研究 ccRCC 免疫微环境中的应用。
Front Immunol. 2023 Nov 27;14:1276658. doi: 10.3389/fimmu.2023.1276658. eCollection 2023.