• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合基因组相关结构可改善拷贝数变异检测。

Integrating genomic correlation structure improves copy number variations detection.

作者信息

Luo Xizhi, Qin Fei, Cai Guoshuai, Xiao Feifei

机构信息

Department of Epidemiology and Biostatistics, Arnold School of Public Health, University of South Carolina, Columbia, SC 29208, USA.

Department of Environmental Health Science, Arnold School of Public Health, University of South Carolina, Columbia, SC 29208, USA.

出版信息

Bioinformatics. 2021 Apr 20;37(3):312-317. doi: 10.1093/bioinformatics/btaa737.

DOI:10.1093/bioinformatics/btaa737
PMID:32805016
Abstract

MOTIVATION

Copy number variation plays important roles in human complex diseases. The detection of copy number variants (CNVs) is identifying mean shift in genetic intensities to locate chromosomal breakpoints, the step of which is referred to as chromosomal segmentation. Many segmentation algorithms have been developed with a strong assumption of independent observations in the genetic loci, and they assume each locus has an equal chance to be a breakpoint (i.e. boundary of CNVs). However, this assumption is violated in the genetics perspective due to the existence of correlation among genomic positions, such as linkage disequilibrium (LD). Our study showed that the LD structure is related to the location distribution of CNVs, which indeed presents a non-random pattern on the genome. To generate more accurate CNVs, we proposed a novel algorithm, LDcnv, that models the CNV data with its biological characteristics relating to genetic dependence structure (i.e. LD).

RESULTS

We theoretically demonstrated the correlation structure of CNV data in SNP array, which further supports the necessity of integrating biological structure in statistical methods for CNV detection. Therefore, we developed the LDcnv that integrated the genomic correlation structure with a local search strategy into statistical modeling of the CNV intensities. To evaluate the performance of LDcnv, we conducted extensive simulations and analyzed large-scale HapMap datasets. We showed that LDcnv presented high accuracy, stability and robustness in CNV detection and higher precision in detecting short CNVs compared to existing methods. This new segmentation algorithm has a wide scope of potential application with data from various high-throughput technology platforms.

AVAILABILITY AND IMPLEMENTATION

https://github.com/FeifeiXiaoUSC/LDcnv.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

拷贝数变异在人类复杂疾病中发挥着重要作用。拷贝数变异(CNV)的检测是通过识别基因强度中的均值偏移来定位染色体断点,这一步骤被称为染色体分割。已经开发了许多分割算法,这些算法在基因座上有一个很强的独立观察假设,并且它们假设每个基因座成为断点(即CNV边界)的机会均等。然而,从遗传学角度来看,由于基因组位置之间存在相关性,如连锁不平衡(LD),这一假设并不成立。我们的研究表明,LD结构与CNV的位置分布相关,其在基因组上确实呈现出非随机模式。为了生成更准确的CNV,我们提出了一种新算法LDcnv,该算法利用与遗传依赖结构(即LD)相关的生物学特征对CNV数据进行建模。

结果

我们从理论上证明了SNP阵列中CNV数据的相关结构,这进一步支持了在CNV检测的统计方法中整合生物学结构的必要性。因此,我们开发了LDcnv,它将基因组相关结构与局部搜索策略整合到CNV强度的统计建模中。为了评估LDcnv的性能,我们进行了广泛的模拟并分析了大规模的HapMap数据集。我们表明,与现有方法相比,LDcnv在CNV检测中具有高精度、稳定性和鲁棒性,在检测短CNV方面具有更高的精度。这种新的分割算法在处理来自各种高通量技术平台的数据时具有广泛的潜在应用。

可用性与实现

https://github.com/FeifeiXiaoUSC/LDcnv。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
Integrating genomic correlation structure improves copy number variations detection.整合基因组相关结构可改善拷贝数变异检测。
Bioinformatics. 2021 Apr 20;37(3):312-317. doi: 10.1093/bioinformatics/btaa737.
2
Shall genomic correlation structure be considered in copy number variants detection?在检测拷贝数变异时是否应考虑基因组相关性结构?
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab215.
3
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
4
Modified screening and ranking algorithm for copy number variation detection.用于拷贝数变异检测的改进筛选与排序算法
Bioinformatics. 2015 May 1;31(9):1341-8. doi: 10.1093/bioinformatics/btu850. Epub 2014 Dec 25.
5
An accurate and powerful method for copy number variation detection.一种精确而强大的拷贝数变异检测方法。
Bioinformatics. 2019 Sep 1;35(17):2891-2898. doi: 10.1093/bioinformatics/bty1041.
6
Genome-wide algorithm for detecting CNV associations with diseases.全基因组算法检测与疾病相关的 CNV 关联。
BMC Bioinformatics. 2011 Aug 9;12:331. doi: 10.1186/1471-2105-12-331.
7
Integrative DNA copy number detection and genotyping from sequencing and array-based platforms.整合测序和基于阵列平台的 DNA 拷贝数检测和基因分型。
Bioinformatics. 2018 Jul 15;34(14):2349-2355. doi: 10.1093/bioinformatics/bty104.
8
A multi-sample based method for identifying common CNVs in normal human genomic structure using high-resolution aCGH data.基于多样本的方法,利用高分辨率 aCGH 数据识别正常人类基因组结构中的常见 CNV。
PLoS One. 2011;6(10):e26975. doi: 10.1371/journal.pone.0026975. Epub 2011 Oct 31.
9
Copy number variations in the genome of the Qatari population.卡塔尔人群基因组中的拷贝数变异
BMC Genomics. 2015 Oct 22;16:834. doi: 10.1186/s12864-015-1991-5.
10
Modeling genetic inheritance of copy number variations.对拷贝数变异的遗传继承进行建模。
Nucleic Acids Res. 2008 Dec;36(21):e138. doi: 10.1093/nar/gkn641. Epub 2008 Oct 2.

引用本文的文献

1
A new insight into the impact of copy number variations on cell cycle deregulation of luminal-type breast cancer.对拷贝数变异对管腔型乳腺癌细胞周期失调影响的新见解。
Oncol Rev. 2025 Feb 12;19:1516409. doi: 10.3389/or.2025.1516409. eCollection 2025.
2
Identification of copy number variation in Tibetan sheep using whole genome resequencing reveals evidence of genomic selection.利用全基因组重测序鉴定藏羊的拷贝数变异揭示了基因组选择的证据。
BMC Genomics. 2023 Sep 19;24(1):555. doi: 10.1186/s12864-023-09672-z.
3
Shall genomic correlation structure be considered in copy number variants detection?
在检测拷贝数变异时是否应考虑基因组相关性结构?
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab215.