• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用具有相关结构的多个样本的DNA测序数据检测拷贝数变异区域

Detection of Copy Number Variation Regions Using the DNA-Sequencing Data from Multiple Profiles with Correlated Structure.

作者信息

Chen Jie, Deng Shirong

机构信息

1 Division of Biostatistics and Data Science, Department of Population Health Sciences, Medical College of Georgia, Augusta University , Augusta, Georgia .

2 School of Mathematics and Statistics, Wuhan University , Wuhan, China .

出版信息

J Comput Biol. 2018 Oct;25(10):1128-1140. doi: 10.1089/cmb.2018.0053. Epub 2018 Jul 27.

DOI:10.1089/cmb.2018.0053
PMID:30052071
Abstract

In this article, we investigate the problem of detecting boundaries of DNA copy number variation (CNV) regions using the DNA-sequencing data from multiple subject samples. Genomic features along the linear realization of the actual genome are correlated, especially within vicinity of a locus, so are the sequencing reads along the genome. It is then crucial to take the correlated structure of such high-throughput genomic data into consideration when modeling DNA-sequencing data for CNV detection from statistical and computational viewpoints. We use the framework of a fused Lasso latent feature model to solve the problem, and propose a modified information criterion for selecting the tuning parameter when search for common CNVs is shared by multiple subjects. Simulation studies and application on multiple subjects' next-generation sequencing data, downloaded from the 1000 Genome Project, showed that the proposed approach can effectively identify individual CNVs of a single subject profile and common CNVs shared by multiple subjects.

摘要

在本文中,我们研究了利用来自多个个体样本的DNA测序数据检测DNA拷贝数变异(CNV)区域边界的问题。沿着实际基因组的线性实现的基因组特征是相关的,特别是在一个基因座附近,沿着基因组的测序读数也是如此。因此,从统计和计算的角度对用于CNV检测的DNA测序数据进行建模时,考虑这种高通量基因组数据的相关结构至关重要。我们使用融合套索潜在特征模型的框架来解决该问题,并提出了一种改进的信息准则,用于在多个个体共享寻找常见CNV时选择调谐参数。对从千人基因组计划下载的多个个体的下一代测序数据进行的模拟研究和应用表明,所提出的方法可以有效地识别单个个体图谱的个体CNV以及多个个体共享的常见CNV。

相似文献

1
Detection of Copy Number Variation Regions Using the DNA-Sequencing Data from Multiple Profiles with Correlated Structure.利用具有相关结构的多个样本的DNA测序数据检测拷贝数变异区域
J Comput Biol. 2018 Oct;25(10):1128-1140. doi: 10.1089/cmb.2018.0053. Epub 2018 Jul 27.
2
A penalized regression approach for DNA copy number study using the sequencing data.一种使用测序数据进行DNA拷贝数研究的惩罚回归方法。
Stat Appl Genet Mol Biol. 2019 May 30;18(4):sagmb-2018-0001. doi: 10.1515/sagmb-2018-0001.
3
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
4
Detection of Significant Copy Number Variations From Multiple Samples in Next-Generation Sequencing Data.从下一代测序数据中检测多个样本中的显著拷贝数变异。
IEEE Trans Nanobioscience. 2018 Mar;17(1):12-20. doi: 10.1109/TNB.2017.2783910.
5
Modeling the next generation sequencing read count data for DNA copy number variant study.为DNA拷贝数变异研究对下一代测序读段计数数据进行建模。
Stat Appl Genet Mol Biol. 2015 Aug;14(4):361-74. doi: 10.1515/sagmb-2014-0054.
6
SM-RCNV: a statistical method to detect recurrent copy number variations in sequenced samples.SM-RCNV:一种用于检测测序样本中重现性拷贝数变异的统计方法。
Genes Genomics. 2019 May;41(5):529-536. doi: 10.1007/s13258-019-00788-9. Epub 2019 Feb 18.
7
An evaluation of copy number variation detection tools for cancer using whole exome sequencing data.使用全外显子组测序数据对癌症拷贝数变异检测工具的评估
BMC Bioinformatics. 2017 May 31;18(1):286. doi: 10.1186/s12859-017-1705-x.
8
CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data.CNV-CH:一种基于凸包的分割方法,用于使用下一代测序数据检测拷贝数变异(CNV)。
PLoS One. 2015 Aug 20;10(8):e0135895. doi: 10.1371/journal.pone.0135895. eCollection 2015.
9
iCopyDAV: Integrated platform for copy number variations-Detection, annotation and visualization.iCopyDAV:用于拷贝数变异检测、注释和可视化的集成平台。
PLoS One. 2018 Apr 5;13(4):e0195334. doi: 10.1371/journal.pone.0195334. eCollection 2018.
10
Copy number variations in the genome of the Qatari population.卡塔尔人群基因组中的拷贝数变异
BMC Genomics. 2015 Oct 22;16:834. doi: 10.1186/s12864-015-1991-5.

引用本文的文献

1
Mixed Lasso estimator for stochastic restricted regression models.随机受限回归模型的混合套索估计量
J Appl Stat. 2021 May 4;48(13-15):2795-2808. doi: 10.1080/02664763.2021.1922614. eCollection 2021.
2
Statistical Considerations on NGS Data for Inferring Copy Number Variations.关于推断拷贝数变异的 NGS 数据的统计考虑。
Methods Mol Biol. 2021;2243:27-58. doi: 10.1007/978-1-0716-1103-6_2.