• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用测序数据进行DNA拷贝数研究的惩罚回归方法。

A penalized regression approach for DNA copy number study using the sequencing data.

作者信息

Lee Jaeeun, Chen Jie

机构信息

Division of Biostatistics and Data Science, Department of Population Health Sciences, Medical College of Georgia, Augusta University, Augusta, GA 30912, USA.

出版信息

Stat Appl Genet Mol Biol. 2019 May 30;18(4):sagmb-2018-0001. doi: 10.1515/sagmb-2018-0001.

DOI:10.1515/sagmb-2018-0001
PMID:31145697
Abstract

Modeling the high-throughput next generation sequencing (NGS) data, resulting from experiments with the goal of profiling tumor and control samples for the study of DNA copy number variants (CNVs), remains to be a challenge in various ways. In this application work, we provide an efficient method for detecting multiple CNVs using NGS reads ratio data. This method is based on a multiple statistical change-points model with the penalized regression approach, 1d fused LASSO, that is designed for ordered data in a one-dimensional structure. In addition, since the path algorithm traces the solution as a function of a tuning parameter, the number and locations of potential CNV region boundaries can be estimated simultaneously in an efficient way. For tuning parameter selection, we then propose a new modified Bayesian information criterion, called JMIC, and compare the proposed JMIC with three different Bayes information criteria used in the literature. Simulation results have shown the better performance of JMIC for tuning parameter selection, in comparison with the other three criterion. We applied our approach to the sequencing data of reads ratio between the breast tumor cell lines HCC1954 and its matched normal cell line BL 1954 and the results are in-line with those discovered in the literature.

摘要

对以分析肿瘤样本和对照样本的DNA拷贝数变异(CNV)为目的的实验所产生的高通量下一代测序(NGS)数据进行建模,在诸多方面仍然是一项挑战。在本应用工作中,我们提供了一种利用NGS读段比率数据检测多个CNV的有效方法。该方法基于一个带有惩罚回归方法(一维融合套索回归,专为一维结构的有序数据设计)的多重统计变化点模型。此外,由于路径算法将解作为一个调优参数的函数进行追踪,潜在CNV区域边界的数量和位置能够以一种有效的方式同时得到估计。对于调优参数选择,我们随后提出了一种新的改进型贝叶斯信息准则,称为JMIC,并将所提出的JMIC与文献中使用的三种不同贝叶斯信息准则进行比较。模拟结果表明,与其他三种准则相比,JMIC在调优参数选择方面具有更好的性能。我们将我们的方法应用于乳腺癌细胞系HCC1954与其匹配的正常细胞系BL 1954之间的读段比率测序数据,结果与文献中发现的结果一致。

相似文献

1
A penalized regression approach for DNA copy number study using the sequencing data.一种使用测序数据进行DNA拷贝数研究的惩罚回归方法。
Stat Appl Genet Mol Biol. 2019 May 30;18(4):sagmb-2018-0001. doi: 10.1515/sagmb-2018-0001.
2
Detection of Copy Number Variation Regions Using the DNA-Sequencing Data from Multiple Profiles with Correlated Structure.利用具有相关结构的多个样本的DNA测序数据检测拷贝数变异区域
J Comput Biol. 2018 Oct;25(10):1128-1140. doi: 10.1089/cmb.2018.0053. Epub 2018 Jul 27.
3
Modeling the next generation sequencing read count data for DNA copy number variant study.为DNA拷贝数变异研究对下一代测序读段计数数据进行建模。
Stat Appl Genet Mol Biol. 2015 Aug;14(4):361-74. doi: 10.1515/sagmb-2014-0054.
4
SeqCNV: a novel method for identification of copy number variations in targeted next-generation sequencing data.SeqCNV:一种用于在靶向新一代测序数据中识别拷贝数变异的新方法。
BMC Bioinformatics. 2017 Mar 3;18(1):147. doi: 10.1186/s12859-017-1566-3.
5
CNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data.CNV-CH:一种基于凸包的分割方法,用于使用下一代测序数据检测拷贝数变异(CNV)。
PLoS One. 2015 Aug 20;10(8):e0135895. doi: 10.1371/journal.pone.0135895. eCollection 2015.
6
Modified screening and ranking algorithm for copy number variation detection.用于拷贝数变异检测的改进筛选与排序算法
Bioinformatics. 2015 May 1;31(9):1341-8. doi: 10.1093/bioinformatics/btu850. Epub 2014 Dec 25.
7
Detection Copy Number Variants from NGS with Sparse and Smooth Constraints.利用稀疏和平滑约束从二代测序中检测拷贝数变异
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jul-Aug;14(4):856-867. doi: 10.1109/TCBB.2016.2561933. Epub 2016 May 3.
8
Copy number variation detection using next generation sequencing read counts.使用下一代测序读段计数进行拷贝数变异检测。
BMC Bioinformatics. 2014 Apr 14;15:109. doi: 10.1186/1471-2105-15-109.
9
Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion.使用贝叶斯信息准则检测全基因组测序数据中的拷贝数变异。
Proc Natl Acad Sci U S A. 2011 Nov 15;108(46):E1128-36. doi: 10.1073/pnas.1110574108. Epub 2011 Nov 7.
10
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.

引用本文的文献

1
Mixed Lasso estimator for stochastic restricted regression models.随机受限回归模型的混合套索估计量
J Appl Stat. 2021 May 4;48(13-15):2795-2808. doi: 10.1080/02664763.2021.1922614. eCollection 2021.