• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PEcnv:精确且高效地检测各种长度的拷贝数变异。

PEcnv: accurate and efficient detection of copy number variations of various lengths.

机构信息

Department of Computer Science and Technology, School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China.

Institute of Data Science and Information Quality, Shaanxi Engineering Research Center of Medical and Health Big Data, Xi'an Jiaotong University, Xi'an 710049, China.

出版信息

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac375.

DOI:10.1093/bib/bbac375
PMID:36056740
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9487654/
Abstract

Copy number variation (CNV) is a class of key biomarkers in many complex traits and diseases. Detecting CNV from sequencing data is a substantial bioinformatics problem and a standard requirement in clinical practice. Although many proposed CNV detection approaches exist, the core statistical model at their foundation is weakened by two critical computational issues: (i) identifying the optimal setting on the sliding window and (ii) correcting for bias and noise. We designed a statistical process model to overcome these limitations by calculating regional read depths via an exponentially weighted moving average strategy. A one-run detection of CNVs of various lengths is then achieved by a dynamic sliding window, whose size is self-adopted according to the weighted averages. We also designed a novel bias/noise reduction model, accompanied by the moving average, which can handle complicated patterns and extend training data. This model, called PEcnv, accurately detects CNVs ranging from kb-scale to chromosome-arm level. The model performance was validated with simulation samples and real samples. Comparative analysis showed that PEcnv outperforms current popular approaches. Notably, PEcnv provided considerable advantages in detecting small CNVs (1 kb-1 Mb) in panel sequencing data. Thus, PEcnv fills the gap left by existing methods focusing on large CNVs. PEcnv may have broad applications in clinical testing where panel sequencing is the dominant strategy. Availability and implementation: Source code is freely available at https://github.com/Sherwin-xjtu/PEcnv.

摘要

拷贝数变异 (CNV) 是许多复杂特征和疾病的关键生物标志物类别。从测序数据中检测 CNV 是一个重要的生物信息学问题,也是临床实践的标准要求。尽管存在许多提出的 CNV 检测方法,但它们基础的核心统计模型受到两个关键计算问题的削弱:(i) 确定滑动窗口的最佳设置,(ii) 纠正偏差和噪声。我们设计了一种统计过程模型,通过使用指数加权移动平均策略计算区域读取深度来克服这些限制。然后,通过动态滑动窗口实现了各种长度的 CNV 的一次性检测,其大小根据加权平均值自适应调整。我们还设计了一种新的偏差/噪声降低模型,与移动平均相结合,可以处理复杂的模式并扩展训练数据。该模型称为 PEcnv,可以准确检测从 kb 级到染色体臂级的 CNV。该模型的性能通过模拟样本和真实样本进行了验证。比较分析表明,PEcnv 优于当前流行的方法。值得注意的是,PEcnv 在面板测序数据中小 CNV(1 kb-1 Mb)的检测中具有显著优势。因此,PEcnv 填补了现有方法专注于大 CNV 留下的空白。PEcnv 可能在面板测序是主要策略的临床测试中有广泛的应用。

可用性和实施

源代码可在 https://github.com/Sherwin-xjtu/PEcnv 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/435540d03ffb/bbac375f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/a554a84e8feb/bbac375f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/5e4128c32402/bbac375f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/79306d1ebb9b/bbac375f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/435540d03ffb/bbac375f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/a554a84e8feb/bbac375f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/5e4128c32402/bbac375f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/79306d1ebb9b/bbac375f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6b3/9487654/435540d03ffb/bbac375f4.jpg

相似文献

1
PEcnv: accurate and efficient detection of copy number variations of various lengths.PEcnv:精确且高效地检测各种长度的拷贝数变异。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac375.
2
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
3
Detecting copy number variation in next generation sequencing data from diagnostic gene panels.检测下一代测序数据中的拷贝数变异,来自诊断基因面板。
BMC Med Genomics. 2021 Aug 31;14(1):214. doi: 10.1186/s12920-021-01059-x.
4
iCopyDAV: Integrated platform for copy number variations-Detection, annotation and visualization.iCopyDAV:用于拷贝数变异检测、注释和可视化的集成平台。
PLoS One. 2018 Apr 5;13(4):e0195334. doi: 10.1371/journal.pone.0195334. eCollection 2018.
5
An evaluation of copy number variation detection tools for cancer using whole exome sequencing data.使用全外显子组测序数据对癌症拷贝数变异检测工具的评估
BMC Bioinformatics. 2017 May 31;18(1):286. doi: 10.1186/s12859-017-1705-x.
6
CNV_IFTV: An Isolation Forest and Total Variation-Based Detection of CNVs from Short-Read Sequencing Data.CNV_IFTV:一种基于孤立森林和全变差的短读测序数据 CNV 检测方法。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):539-549. doi: 10.1109/TCBB.2019.2920889. Epub 2021 Apr 8.
7
CNV-RF Is a Random Forest-Based Copy Number Variation Detection Method Using Next-Generation Sequencing.CNV-RF是一种基于随机森林的利用下一代测序技术进行拷贝数变异检测的方法。
J Mol Diagn. 2016 Nov;18(6):872-881. doi: 10.1016/j.jmoldx.2016.07.001. Epub 2016 Sep 3.
8
CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations.CNVcaller:一款高效、广泛适用的软件,可用于检测大群体中的拷贝数变异。
Gigascience. 2017 Dec 1;6(12):1-12. doi: 10.1093/gigascience/gix115.
9
Shape-based retrieval of CNV regions in read coverage data.基于形状的读取覆盖数据中CNV区域检索
Int J Data Min Bioinform. 2014;9(3):254-76. doi: 10.1504/ijdmb.2014.060051.
10
CNV-BAC: Copy number Variation Detection in Bacterial Circular Genome.CNV-BAC:细菌环状基因组中的拷贝数变异检测。
Bioinformatics. 2020 Jun 1;36(12):3890-3891. doi: 10.1093/bioinformatics/btaa208.

引用本文的文献

1
Multi-analyte liquid biopsy approaches for early detection of esophageal cancer: the expanding role of ctDNA.用于食管癌早期检测的多分析物液体活检方法:循环肿瘤DNA的作用不断扩大
Front Oncol. 2025 Aug 14;15:1622984. doi: 10.3389/fonc.2025.1622984. eCollection 2025.
2
Comparative study of tools for copy number variation detection using next-generation sequencing data.使用下一代测序数据进行拷贝数变异检测工具的比较研究
Sci Rep. 2025 Jul 1;15(1):22145. doi: 10.1038/s41598-025-06527-3.
3
EMcnv: enhancing CNV detection performance through ensemble strategies with heterogeneous meta-graph neural networks.
EMcnv:通过使用异构元图神经网络的集成策略提高拷贝数变异(CNV)检测性能。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf135.
4
Deciphering response dynamics and treatment resistance from circulating tumor DNA after CAR T-cells in multiple myeloma.解读多发性骨髓瘤中嵌合抗原受体T细胞治疗后循环肿瘤DNA的反应动态及治疗耐药性
Nat Commun. 2025 Feb 20;16(1):1824. doi: 10.1038/s41467-025-56486-6.
5
A copy number variation detection method based on OCSVM algorithm using multi strategies integration.一种基于多策略集成的支持向量数据描述(OCSVM)算法的拷贝数变异检测方法。
Sci Rep. 2025 Jan 28;15(1):3526. doi: 10.1038/s41598-025-88143-9.
6
OTSUCNV: an adaptive segmentation and OTSU-based anomaly classification method for CNV detection using NGS data.OTSUCNV:一种自适应分割和基于 OTSU 的异常分类方法,用于使用 NGS 数据进行 CNV 检测。
BMC Genomics. 2024 Jan 30;25(1):126. doi: 10.1186/s12864-024-10018-6.
7
An update of clinical value of circulating tumor DNA in esophageal cancer: a systematic review and meta-analysis.循环肿瘤 DNA 在食管癌中的临床价值更新:系统评价和荟萃分析。
BMC Cancer. 2024 Jan 24;24(1):129. doi: 10.1186/s12885-024-11879-6.
8
What makes TMB an ambivalent biomarker for immunotherapy? A subtle mismatch between the sample-based design of variant callers and real clinical cohort.TMB 为何成为免疫治疗的一个矛盾生物标志物?变异 caller 的基于样本的设计与真实临床队列之间存在微妙的不匹配。
Front Immunol. 2023 May 25;14:1151224. doi: 10.3389/fimmu.2023.1151224. eCollection 2023.