• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

StrucBreak:一种用于 DNA 序列结构断裂检测的计算框架。

StrucBreak: A Computational Framework for Structural Break Detection in DNA Sequences.

机构信息

Department of Computer Science and Engineering, East West University Bangladesh, Dhaka, Bangladesh.

Department of Computer Science and Engineering, Notre Dame University Bangladesh, Dhaka, Bangladesh.

出版信息

Interdiscip Sci. 2017 Dec;9(4):512-527. doi: 10.1007/s12539-016-0158-7. Epub 2016 Mar 28.

DOI:10.1007/s12539-016-0158-7
PMID:27021490
Abstract

Damages or breaks in DNA may change the characteristics of genomes and causes various diseases. In this work we construct a system that incorporates the maximum likelihood-based probabilistic formula to assess the number of damages that have occurred in any DNA sequence. This approach has been progressively benchmarked by implementing simulated data sets so that the outcomes can be compared with a ground truth or reference value. At first the sequence data set order is checked through the statistical cumulative sum (STACUMSUM). The verified sequences are then estimated by prior and posterior probability to count the percentages of breaks and mutations. Maximum-likelihood estimation then finds out the exact numbers and positions of breaks and detections. In database manipulation, one factor that decides the orientation and order of the sequence is geometric distance between consecutive sequences. The geometric distance is measured for smooth representation of the genome or DNA sequences. Finally, we compared the performance of our system with DAMBE5: (A Comprehensive Software Package for Data Analysis in Molecular Biology and Evaluation), and in response to time and space complexity, StrucBreak is much faster and consumes much less space due to our algorithmic approaches.

摘要

DNA 的损伤或断裂可能会改变基因组的特征,导致各种疾病。在这项工作中,我们构建了一个系统,该系统结合了基于最大似然的概率公式来评估任何 DNA 序列中发生的损伤数量。通过实现模拟数据集,逐步对这种方法进行基准测试,以便可以将结果与真实值或参考值进行比较。首先,通过统计累积和 (STACUMSUM) 检查序列数据集的顺序。然后,通过先验和后验概率对经过验证的序列进行估计,以计算断裂和突变的百分比。最大似然估计然后找出断裂和检测的确切数量和位置。在数据库操作中,决定序列方向和顺序的一个因素是连续序列之间的几何距离。几何距离用于平滑表示基因组或 DNA 序列。最后,我们将我们的系统与 DAMBE5 的性能进行了比较:(用于分子生物学数据分析和评估的综合软件包),并且由于我们的算法方法,StrucBreak 在时间和空间复杂度方面的表现要快得多,消耗的空间也少得多。

相似文献

1
StrucBreak: A Computational Framework for Structural Break Detection in DNA Sequences.StrucBreak:一种用于 DNA 序列结构断裂检测的计算框架。
Interdiscip Sci. 2017 Dec;9(4):512-527. doi: 10.1007/s12539-016-0158-7. Epub 2016 Mar 28.
2
A structural EM algorithm for phylogenetic inference.一种用于系统发育推断的结构化期望最大化算法。
J Comput Biol. 2002;9(2):331-53. doi: 10.1089/10665270252935494.
3
Vestige: maximum likelihood phylogenetic footprinting.痕迹:最大似然系统发育足迹法。
BMC Bioinformatics. 2005 May 29;6:130. doi: 10.1186/1471-2105-6-130.
4
Minimal entropy probability paths between genome families.基因组家族之间的最小熵概率路径。
J Math Biol. 2004 May;48(5):563-90. doi: 10.1007/s00285-003-0248-0. Epub 2003 Dec 2.
5
Toward extracting all phylogenetic information from matrices of evolutionary distances.从进化距离矩阵中提取所有系统发育信息。
Science. 2010 Mar 12;327(5971):1376-9. doi: 10.1126/science.1182300.
6
Ancestral sequence alignment under optimal conditions.在最佳条件下进行祖先序列比对。
BMC Bioinformatics. 2005 Nov 17;6:273. doi: 10.1186/1471-2105-6-273.
7
Fast model-based protein homology detection without alignment.基于快速模型的无需比对的蛋白质同源性检测。
Bioinformatics. 2007 Jul 15;23(14):1728-36. doi: 10.1093/bioinformatics/btm247. Epub 2007 May 8.
8
A novel method for comparative analysis of DNA sequences by Ramanujan-Fourier transform.一种通过拉马努金-傅里叶变换对DNA序列进行比较分析的新方法。
J Comput Biol. 2014 Dec;21(12):867-79. doi: 10.1089/cmb.2014.0120.
9
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
10
A configuration space of homologous proteins conserving mutual information and allowing a phylogeny inference based on pair-wise Z-score probabilities.同源蛋白质的一种构象空间,其保留互信息并允许基于成对Z分数概率进行系统发育推断。
BMC Bioinformatics. 2005 Mar 10;6:49. doi: 10.1186/1471-2105-6-49.

引用本文的文献

1
Realizing drug repositioning by adapting a recommendation system to handle the process.通过调整推荐系统来处理流程,实现药物再定位。
BMC Bioinformatics. 2018 Apr 12;19(1):136. doi: 10.1186/s12859-018-2142-1.
2
HashClone: a new tool to quantify the minimal residual disease in B-cell lymphoma from deep sequencing data.HashClone:一种从深度测序数据中量化B细胞淋巴瘤微小残留病的新工具。
BMC Bioinformatics. 2017 Nov 23;18(1):516. doi: 10.1186/s12859-017-1923-2.