• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ReAlign-N:一种用于多核酸序列比对的综合重排方法,结合了全局和局部重排。

ReAlign-N: an integrated realignment approach for multiple nucleic acid sequence alignment, combining global and local realignments.

作者信息

Zhai Yixiao, Zhou Tong, Wei Yanming, Zou Quan, Wang Yansu

机构信息

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, No.2006, Xiyuan Avenue, Pidu Zone, Chengdu 610054, China.

Institute of Digital Health, Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, No.1, Chengdian Road, Kecheng Zone, Quzhou 324003, China.

出版信息

NAR Genom Bioinform. 2024 Dec 18;6(4):lqae170. doi: 10.1093/nargab/lqae170. eCollection 2024 Dec.

DOI:10.1093/nargab/lqae170
PMID:39703429
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11655299/
Abstract

Ensuring accurate multiple sequence alignment (MSA) is essential for comprehensive biological sequence analysis. However, the complexity of evolutionary relationships often results in variations that generic alignment tools may not adequately address. Realignment is crucial to remedy this issue. Currently, there is a lack of realignment methods tailored for nucleic acid sequences, particularly for lengthy sequences. Thus, there's an urgent need for the development of realignment methods better suited to address these challenges. This study presents ReAlign-N, a realignment method explicitly designed for multiple nucleic acid sequence alignment. ReAlign-N integrates both global and local realignment strategies for improved accuracy. In the global realignment phase, ReAlign-N incorporates K-Band and innovative memory-saving technology into the dynamic programming approach, ensuring high efficiency and minimal memory requirements for large-scale realignment tasks. The local realignment stage employs full matching and entropy scoring methods to identify low-quality regions and conducts realignment through MAFFT. Experimental results demonstrate that ReAlign-N consistently outperforms initial alignments on simulated and real datasets. Furthermore, compared to ReformAlign, the only existing multiple nucleic acid sequence realignment tool, ReAlign-N, exhibits shorter running times and occupies less memory space. The source code and test data for ReAlign-N are available on GitHub (https://github.com/malabz/ReAlign-N).

摘要

确保准确的多序列比对(MSA)对于全面的生物序列分析至关重要。然而,进化关系的复杂性常常导致通用比对工具可能无法充分解决的变异。重新比对对于解决这个问题至关重要。目前,缺乏专门针对核酸序列,特别是长序列的重新比对方法。因此,迫切需要开发更适合应对这些挑战的重新比对方法。本研究提出了ReAlign-N,一种专门为多个核酸序列比对设计的重新比对方法。ReAlign-N整合了全局和局部重新比对策略以提高准确性。在全局重新比对阶段,ReAlign-N将K波段和创新的内存节省技术纳入动态规划方法,确保大规模重新比对任务的高效率和最小内存需求。局部重新比对阶段采用完全匹配和熵评分方法来识别低质量区域,并通过MAFFT进行重新比对。实验结果表明,ReAlign-N在模拟和真实数据集上始终优于初始比对。此外,与现有的唯一多个核酸序列重新比对工具ReformAlign相比,ReAlign-N运行时间更短,占用内存空间更少。ReAlign-N的源代码和测试数据可在GitHub(https://github.com/malabz/ReAlign-N)上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/8393f2215f4a/lqae170fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/4e1f7f8bf65c/lqae170fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/5c82d6d25419/lqae170fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/29240da14375/lqae170fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/e10328c12bca/lqae170fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/8393f2215f4a/lqae170fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/4e1f7f8bf65c/lqae170fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/5c82d6d25419/lqae170fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/29240da14375/lqae170fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/e10328c12bca/lqae170fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8481/11655299/8393f2215f4a/lqae170fig5.jpg

相似文献

1
ReAlign-N: an integrated realignment approach for multiple nucleic acid sequence alignment, combining global and local realignments.ReAlign-N:一种用于多核酸序列比对的综合重排方法,结合了全局和局部重排。
NAR Genom Bioinform. 2024 Dec 18;6(4):lqae170. doi: 10.1093/nargab/lqae170. eCollection 2024 Dec.
2
TPMA: A two pointers meta-alignment tool to ensemble different multiple nucleic acid sequence alignments.TPMA:一种双指针元比对工具,用于集成不同的多个核酸序列比对。
PLoS Comput Biol. 2024 Apr 1;20(4):e1011988. doi: 10.1371/journal.pcbi.1011988. eCollection 2024 Apr.
3
WMSA: a novel method for multiple sequence alignment of DNA sequences.WMSA:一种用于 DNA 序列多重序列比对的新方法。
Bioinformatics. 2022 Nov 15;38(22):5019-5025. doi: 10.1093/bioinformatics/btac658.
4
SpliVert: A Protein Multiple Sequence Alignment Refinement Method Based on Splitting-Splicing Vertically.SpliVert:一种基于垂直拆分-拼接的蛋白质多序列比对优化方法。
Protein Pept Lett. 2020;27(4):295-302. doi: 10.2174/0929866526666190806143959.
5
EMMA: a new method for computing multiple sequence alignments given a constraint subset alignment.EMMA:一种在给定约束子集比对的情况下计算多序列比对的新方法。
Algorithms Mol Biol. 2023 Dec 7;18(1):21. doi: 10.1186/s13015-023-00247-x.
6
ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach.ReformAlign:基于轮廓的元对齐方法改进的多重序列比对。
BMC Bioinformatics. 2014 Aug 7;15(1):265. doi: 10.1186/1471-2105-15-265.
7
FMAlign2: a novel fast multiple nucleotide sequence alignment method for ultralong datasets.FMAlign2:一种新颖的快速多核苷酸序列比对方法,适用于超大数据集。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae014.
8
Adaptive Local Realignment of Protein Sequences.蛋白质序列的适应性局部重排
J Comput Biol. 2018 Jul;25(7):780-793. doi: 10.1089/cmb.2018.0045. Epub 2018 Jun 11.
9
WMSA 2: a multiple DNA/RNA sequence alignment tool implemented with accurate progressive mode and a fast win-win mode combining the center star and progressive strategies.WMSA 2:一种采用精确渐进模式和快速双赢模式(结合中心星和渐进策略)的多 DNA/RNA 序列比对工具。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad190.
10
MAGUS: Multiple sequence Alignment using Graph clUStering.MAGUS:基于图聚类的多重序列比对。
Bioinformatics. 2021 Jul 19;37(12):1666-1672. doi: 10.1093/bioinformatics/btaa992.

引用本文的文献

1
ReAlign-P: a vertical iterative realignment method for protein multiple sequence alignment.ReAlign-P:一种用于蛋白质多序列比对的垂直迭代重排方法。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf421.

本文引用的文献

1
FMAlign2: a novel fast multiple nucleotide sequence alignment method for ultralong datasets.FMAlign2:一种新颖的快速多核苷酸序列比对方法,适用于超大数据集。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae014.
2
WMSA 2: a multiple DNA/RNA sequence alignment tool implemented with accurate progressive mode and a fast win-win mode combining the center star and progressive strategies.WMSA 2:一种采用精确渐进模式和快速双赢模式(结合中心星和渐进策略)的多 DNA/RNA 序列比对工具。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad190.
3
WMSA: a novel method for multiple sequence alignment of DNA sequences.
WMSA:一种用于 DNA 序列多重序列比对的新方法。
Bioinformatics. 2022 Nov 15;38(22):5019-5025. doi: 10.1093/bioinformatics/btac658.
4
HAlign 3: Fast Multiple Alignment of Ultra-Large Numbers of Similar DNA/RNA Sequences.HAlign 3:快速对齐超大量相似 DNA/RNA 序列。
Mol Biol Evol. 2022 Aug 3;39(8). doi: 10.1093/molbev/msac166.
5
Developments in Algorithms for Sequence Alignment: A Review.序列比对算法的发展:综述。
Biomolecules. 2022 Apr 6;12(4):546. doi: 10.3390/biom12040546.
6
RPfam: A refiner towards curated-like multiple sequence alignments of the Pfam protein families.RPfam:一个针对 Pfam 蛋白质家族精心整理的多重序列比对的工具。
J Bioinform Comput Biol. 2022 Aug;20(4):2240002. doi: 10.1142/S0219720022400029. Epub 2022 Apr 14.
7
A novel fast multiple nucleotide sequence alignment method based on FM-index.基于 FM-index 的新型快速多核苷酸序列比对方法。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab519.
8
IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era.IQ-TREE 2:基因组时代系统发育推断的新模型和有效方法。
Mol Biol Evol. 2020 May 1;37(5):1530-1534. doi: 10.1093/molbev/msaa015.
9
Kalign 3: multiple sequence alignment of large data sets.Kalign 3:大型数据集的多序列比对
Bioinformatics. 2019 Oct 26;36(6):1928-9. doi: 10.1093/bioinformatics/btz795.
10
SpliVert: A Protein Multiple Sequence Alignment Refinement Method Based on Splitting-Splicing Vertically.SpliVert:一种基于垂直拆分-拼接的蛋白质多序列比对优化方法。
Protein Pept Lett. 2020;27(4):295-302. doi: 10.2174/0929866526666190806143959.