• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高效的连续测量基因组序列的突变点检测。

Efficient change point detection for genomic sequences of continuous measurements.

机构信息

Dipartimento di Scienze Statistiche e Matematiche Vianelli, Università di Palermo, Palermo, Italy.

出版信息

Bioinformatics. 2011 Jan 15;27(2):161-6. doi: 10.1093/bioinformatics/btq647. Epub 2010 Nov 18.

DOI:10.1093/bioinformatics/btq647
PMID:21088029
Abstract

MOTIVATION

Knowing the exact locations of multiple change points in genomic sequences serves several biological needs, for instance when data represent aCGH profiles and it is of interest to identify possibly damaged genes involved in cancer and other diseases. Only a few of the currently available methods deal explicitly with estimation of the number and location of change points, and moreover these methods may be somewhat vulnerable to deviations of model assumptions usually employed.

RESULTS

We present a computationally efficient method to obtain estimates of the number and location of the change points. The method is based on a simple transformation of data and it provides results quite robust to model misspecifications. The efficiency of the method guarantees moderate computational times regardless of the series length and the number of change points.

AVAILABILITY

The methods described in this article are implemented in the new R package cumSeg available from the Comprehensive R Archive Network at http://CRAN.R-project.org/package=cumSeg.

摘要

动机

在基因组序列中准确地定位多个变化点,可满足多种生物学需求,例如当数据代表 aCGH 图谱,且需要鉴定可能与癌症和其他疾病相关的受损基因时。目前可用的少数方法明确涉及变化点数量和位置的估计,此外,这些方法可能对通常使用的模型假设的偏差有些敏感。

结果

我们提出了一种计算上有效的方法来获得变化点数量和位置的估计值。该方法基于数据的简单变换,并且对模型的误设具有相当稳健的结果。该方法的效率保证了即使在序列长度和变化点数量较大的情况下,计算时间也适中。

可用性

本文中描述的方法在新的 R 包 cumSeg 中实现,可从 Comprehensive R Archive Network 获取,网址为 http://CRAN.R-project.org/package=cumSeg。

相似文献

1
Efficient change point detection for genomic sequences of continuous measurements.高效的连续测量基因组序列的突变点检测。
Bioinformatics. 2011 Jan 15;27(2):161-6. doi: 10.1093/bioinformatics/btq647. Epub 2010 Nov 18.
2
Multiple Break-Points Detection in Array CGH Data via the Cross-Entropy Method.基于交叉熵方法的阵列比较基因组杂交数据中的多个断点检测
IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):487-98. doi: 10.1109/TCBB.2014.2361639.
3
Efficient change-points detection for genomic sequences via cumulative segmented regression.基于累积分段回归的基因组序列高效突变点检测
Bioinformatics. 2022 Jan 3;38(2):311-317. doi: 10.1093/bioinformatics/btab685.
4
Hybrid algorithms for multiple change-point detection in biological sequences.用于生物序列中多个变化点检测的混合算法。
Adv Exp Med Biol. 2015;823:41-61. doi: 10.1007/978-3-319-10984-8_3.
5
Finding recurrent copy number alterations preserving within-sample homogeneity.发现保留样本内同质性的复发性拷贝数改变。
Bioinformatics. 2011 Nov 1;27(21):2949-56. doi: 10.1093/bioinformatics/btr488. Epub 2011 Aug 25.
6
Smoothing waves in array CGH tumor profiles.平滑阵列比较基因组杂交肿瘤图谱中的波形。
Bioinformatics. 2009 May 1;25(9):1099-104. doi: 10.1093/bioinformatics/btp132. Epub 2009 Mar 10.
7
synbreed: a framework for the analysis of genomic prediction data using R.synbreed:一个使用 R 进行基因组预测数据分析的框架。
Bioinformatics. 2012 Aug 1;28(15):2086-7. doi: 10.1093/bioinformatics/bts335. Epub 2012 Jun 10.
8
CGHnormaliter: a Bioconductor package for normalization of array CGH data with many CNAs.CGHnormaliter:一个 Bioconductor 包,用于对包含多个拷贝数变异的阵列 CGH 数据进行标准化处理。
Bioinformatics. 2010 May 15;26(10):1366-7. doi: 10.1093/bioinformatics/btq155. Epub 2010 Apr 23.
9
Classification of array CGH data using smoothed logistic regression model.使用平滑逻辑回归模型对 array CGH 数据进行分类。
Stat Med. 2009 Dec 30;28(30):3798-810. doi: 10.1002/sim.3753.
10
Modeling the DNA copy number aberration patterns in observational high-throughput cancer data.在观察性高通量癌症数据中模拟DNA拷贝数畸变模式。
Stat Appl Genet Mol Biol. 2013 Apr 19;12(2):143-74. doi: 10.1515/sagmb-2012-0020.

引用本文的文献

1
Indicator from the graph Laplacian of stock market time series cross-sections can precisely determine the durations of market crashes.来自股票市场时间序列横截面图拉普拉斯算子的指标能够精确确定市场崩溃的持续时间。
PLoS One. 2025 Jul 18;20(7):e0327391. doi: 10.1371/journal.pone.0327391. eCollection 2025.
2
Testing for a General Changepoint in Medical and Psychometric Studies: Changes Detection and Sample Size Planning.医学与心理测量学研究中一般变化点的检验:变化检测与样本量规划
Stat Med. 2025 Jun;44(13-14):e70150. doi: 10.1002/sim.70150.
3
Estimating changepoints in extremal dependence, applied to aviation stock prices during COVID-19 pandemic.
估计极值相依性中的变化点,并应用于新冠疫情期间的航空股价。
J Appl Stat. 2024 Jul 3;52(3):525-554. doi: 10.1080/02664763.2024.2373939. eCollection 2025.
4
Change point detection in brucellosis time series from 2010 to 2023 in Xinjiang China using the BEAST algorithm.利用BEAST算法对2010年至2023年中国新疆布鲁氏菌病时间序列进行变点检测。
Sci Rep. 2025 Jan 30;15(1):3830. doi: 10.1038/s41598-025-88508-0.
5
A Selective Review on Information Criteria in Multiple Change Point Detection.多变化点检测中信息准则的选择性综述
Entropy (Basel). 2024 Jan 4;26(1):50. doi: 10.3390/e26010050.
6
Multiple change point analysis of hepatitis B reports in Xinjiang, China from 2006 to 2021.中国新疆地区 2006 年至 2021 年乙型肝炎报告的多变化点分析。
Front Public Health. 2023 Nov 14;11:1223176. doi: 10.3389/fpubh.2023.1223176. eCollection 2023.
7
A new time-varying coefficient regression approach for analyzing infectious disease data.一种用于分析传染病数据的新时变系数回归方法。
Sci Rep. 2023 Sep 6;13(1):14687. doi: 10.1038/s41598-023-41551-1.
8
Statistical Picking of Multivariate Waveforms.多元波形的统计选择。
Sensors (Basel). 2022 Dec 8;22(24):9636. doi: 10.3390/s22249636.
9
Testing for a Change in Mean After Changepoint Detection.在变点检测后对均值变化进行检验。
J R Stat Soc Series B Stat Methodol. 2022 Sep;84(4):1082-1104. doi: 10.1111/rssb.12501. Epub 2022 Apr 12.
10
Multiple change point estimation of trends in Covid-19 infections and deaths in India as compared with WHO regions.与世界卫生组织各区域相比,印度新冠病毒感染和死亡趋势的多变化点估计。
Spat Stat. 2022 Jun;49:100538. doi: 10.1016/j.spasta.2021.100538. Epub 2021 Sep 3.