Suppr超能文献

一种在应用于电泳图谱分析时具有新颖改进的峰对齐算法。

A peak alignment algorithm with novel improvements in application to electropherogram analysis.

作者信息

Karabiber Fethullah

机构信息

Department of Computer Engineering, Yildiz Technical University, 34220, Istanbul, Turkey.

出版信息

J Bioinform Comput Biol. 2013 Oct;11(5):1350011. doi: 10.1142/S021972001350011X. Epub 2013 Jul 29.

Abstract

Alignment of peaks in electropherograms or chromatograms obtained from experimental techniques such capillary electrophoresis remains a significant challenge. Accurate alignment is critical for accurate interpretation of various classes of nucleic acid analysis technologies, including conventional DNA sequencing and new RNA structure probing technologies. An automated alignment algorithm was developed based on dynamic programming to align multiple-peak time-series data both globally and locally. This algorithm relies on a new peak similarity measure and other features such as time penalties, global constraints, and minimum-similarity scores and results in rapid, highly accurate comparisons of complex time-series datasets. As a demonstrative case study, the developed algorithm was applied to analysis of capillary electrophoresis data from a Selective 2'-Hydroxyl Acylation analyzed by Primer Extension (SHAPE) evaluation of RNA secondary structure. The algorithm yielded robust analysis of challenging SHAPE probing data. Experimental results show that the peak alignment algorithm corrects retention time variation efficiently due to the presence of fluorescent tags on fragments and differences in capillaries. The tools can be readily adapted for the analysis other biological datasets in which peak retention times vary.

摘要

从诸如毛细管电泳等实验技术获得的电泳图或色谱图中的峰对齐仍然是一项重大挑战。准确对齐对于准确解释各类核酸分析技术至关重要,包括传统的DNA测序和新的RNA结构探测技术。基于动态规划开发了一种自动对齐算法,用于对多峰时间序列数据进行全局和局部对齐。该算法依赖于一种新的峰相似性度量以及其他特征,如时间惩罚、全局约束和最小相似性分数,能够快速、高度准确地比较复杂的时间序列数据集。作为一个示范性案例研究,将所开发的算法应用于对通过RNA二级结构的选择性2'-羟基酰化引物延伸分析(SHAPE)评估得到的毛细管电泳数据进行分析。该算法对具有挑战性的SHAPE探测数据进行了稳健的分析。实验结果表明,由于片段上存在荧光标签以及毛细管的差异,峰对齐算法能够有效地校正保留时间变化。这些工具可以很容易地适用于分析峰保留时间变化的其他生物数据集。

相似文献

1
A peak alignment algorithm with novel improvements in application to electropherogram analysis.
J Bioinform Comput Biol. 2013 Oct;11(5):1350011. doi: 10.1142/S021972001350011X. Epub 2013 Jul 29.
2
RiboCAT: a new capillary electrophoresis data analysis tool for nucleic acid probing.
RNA. 2017 Feb;23(2):240-249. doi: 10.1261/rna.058404.116. Epub 2016 Nov 7.
4
RNA secondary structure prediction using high-throughput SHAPE.
J Vis Exp. 2013 May 31(75):e50243. doi: 10.3791/50243.
6
Automated band annotation for RNA structure probing experiments with numerous capillary electrophoresis profiles.
Bioinformatics. 2015 Sep 1;31(17):2808-15. doi: 10.1093/bioinformatics/btv282. Epub 2015 May 5.
7
High-throughput single-nucleotide structural mapping by capillary automated footprinting analysis.
Nucleic Acids Res. 2008 Jun;36(11):e63. doi: 10.1093/nar/gkn267. Epub 2008 May 13.
8
Simplified RNA secondary structure mapping by automation of SHAPE data analysis.
Nucleic Acids Res. 2011 Dec;39(22):e151. doi: 10.1093/nar/gkr773. Epub 2011 Sep 30.
10
Efficient alignment of RNA secondary structures using sparse dynamic programming.
BMC Bioinformatics. 2013 Sep 8;14:269. doi: 10.1186/1471-2105-14-269.

本文引用的文献

1
SHAPE-directed RNA secondary structure prediction.
Methods. 2010 Oct;52(2):150-8. doi: 10.1016/j.ymeth.2010.06.007. Epub 2010 Jun 8.
2
Enhancing metabolomic data analysis with Progressive Consensus Alignment of NMR Spectra (PCANS).
BMC Bioinformatics. 2010 Mar 9;11:123. doi: 10.1186/1471-2105-11-123.
6
What is dynamic programming?
Nat Biotechnol. 2004 Jul;22(7):909-10. doi: 10.1038/nbt0704-909.
7
A general method applicable to the search for similarities in the amino acid sequence of two proteins.
J Mol Biol. 1970 Mar;48(3):443-53. doi: 10.1016/0022-2836(70)90057-4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验