• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度强化学习的成对启发式序列比对算法

Pairwise Heuristic Sequence Alignment Algorithm Based on Deep Reinforcement Learning.

作者信息

Song Yong-Joon, Ji Dong Jin, Seo Hyein, Han Gyu-Bum, Cho Dong-Ho

机构信息

School of Electrical EngineeringKorea Advanced Institute of Science and Technology Daejeon 305-701 South Korea.

出版信息

IEEE Open J Eng Med Biol. 2021 Jan 29;2:36-43. doi: 10.1109/OJEMB.2021.3055424. eCollection 2021.

DOI:10.1109/OJEMB.2021.3055424
PMID:35402983
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8901008/
Abstract

Various methods have been developed to analyze the association between organisms and their genomic sequences. Among them, sequence alignment is the most frequently used method for comparative analysis of biological genomes. We intend to propose a novel pairwise sequence alignment method using deep reinforcement learning to break out the old pairwise alignment algorithms. We defined the environment and agent to enable reinforcement learning in the sequence alignment system. This novel method, named DQNalign, can immediately determine the next direction by observing the subsequences within the moving window. DQNalign shows superiority in the dissimilar sequence pairs that have low identity values. And theoretically, we confirm that DQNalign has a low dimension for the sequence length in view of the complexity. This research shows the application method of deep reinforcement learning to the sequence alignment system and how deep reinforcement learning can improve the conventional sequence alignment method.

摘要

已经开发出各种方法来分析生物体与其基因组序列之间的关联。其中,序列比对是生物基因组比较分析中最常用的方法。我们打算提出一种使用深度强化学习的新型成对序列比对方法,以突破旧的成对比对算法。我们定义了环境和智能体,以便在序列比对系统中进行强化学习。这种名为DQNalign的新方法可以通过观察移动窗口内的子序列立即确定下一个方向。DQNalign在具有低同一性值的不相似序列对中表现出优越性。并且从理论上讲,鉴于复杂性,我们确认DQNalign对于序列长度具有低维度。这项研究展示了深度强化学习在序列比对系统中的应用方法,以及深度强化学习如何改进传统的序列比对方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/011bedd52b67/cho6-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/8f919de2fb6f/cho1-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/69038befa56b/cho2-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/6b10ac0129bc/cho3-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/72f978a3da97/cho4-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/abc185d8dbdf/cho5-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/011bedd52b67/cho6-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/8f919de2fb6f/cho1-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/69038befa56b/cho2-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/6b10ac0129bc/cho3-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/72f978a3da97/cho4-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/abc185d8dbdf/cho5-3055424.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c448/8901008/011bedd52b67/cho6-3055424.jpg

相似文献

1
Pairwise Heuristic Sequence Alignment Algorithm Based on Deep Reinforcement Learning.基于深度强化学习的成对启发式序列比对算法
IEEE Open J Eng Med Biol. 2021 Jan 29;2:36-43. doi: 10.1109/OJEMB.2021.3055424. eCollection 2021.
2
Local Alignment of DNA Sequence Based on Deep Reinforcement Learning.基于深度强化学习的DNA序列局部比对
IEEE Open J Eng Med Biol. 2021 Apr 27;2:170-178. doi: 10.1109/OJEMB.2021.3076156. eCollection 2021.
3
Deep reinforcement learning-based pairwise DNA sequence alignment method compatible with embedded edge devices.基于深度强化学习的兼容嵌入式边缘设备的两两 DNA 序列比对方法。
Sci Rep. 2023 Feb 16;13(1):2773. doi: 10.1038/s41598-023-29277-6.
4
Multiple sequence alignment based on deep reinforcement learning with self-attention and positional encoding.基于深度强化学习的具有自注意力和位置编码的多序列比对。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad636.
5
Multiple Sequence Alignment based on deep Q network with negative feedback policy.基于深度 Q 网络与负反馈策略的多重序列比对。
Comput Biol Chem. 2022 Dec;101:107780. doi: 10.1016/j.compbiolchem.2022.107780. Epub 2022 Oct 27.
6
Super pairwise alignment (SPA): an efficient approach to global alignment for homologous sequences.超双序列比对(SPA):一种用于同源序列全局比对的高效方法。
J Comput Biol. 2002;9(3):477-86. doi: 10.1089/106652702760138574.
7
MTRAP: pairwise sequence alignment algorithm by a new measure based on transition probability between two consecutive pairs of residues.MTRAP:一种基于两个连续残基对之间转移概率的新度量的两两序列比对算法。
BMC Bioinformatics. 2010 May 8;11:235. doi: 10.1186/1471-2105-11-235.
8
NdPASA: a novel pairwise protein sequence alignment algorithm that incorporates neighbor-dependent amino acid propensities.NdPASA:一种整合了邻域依赖氨基酸倾向的新型双序列蛋白质序列比对算法。
Proteins. 2005 Feb 15;58(3):628-37. doi: 10.1002/prot.20359.
9
SGA: a grammar-based alignment algorithm.SGA:一种基于语法的比对算法。
Comput Methods Programs Biomed. 2007 Apr;86(1):17-20. doi: 10.1016/j.cmpb.2006.12.007. Epub 2007 Jan 30.
10
Detailed protein sequence alignment based on Spectral Similarity Score (SSS).基于光谱相似性评分(SSS)的详细蛋白质序列比对。
BMC Bioinformatics. 2005 Apr 23;6:105. doi: 10.1186/1471-2105-6-105.

引用本文的文献

1
Deep reinforcement learning-based pairwise DNA sequence alignment method compatible with embedded edge devices.基于深度强化学习的兼容嵌入式边缘设备的两两 DNA 序列比对方法。
Sci Rep. 2023 Feb 16;13(1):2773. doi: 10.1038/s41598-023-29277-6.
2
learnMSA: learning and aligning large protein families.learnMSA:学习和对齐大型蛋白质家族。
Gigascience. 2022 Nov 18;11. doi: 10.1093/gigascience/giac104.
3
Heuristic Pairwise Alignment in Database Environments.启发式两两比对在数据库环境中的应用。

本文引用的文献

1
Rat Genome Assemblies, Annotation, and Variant Repository.大鼠基因组组装、注释及变异体数据库
Methods Mol Biol. 2019;2018:43-70. doi: 10.1007/978-1-4939-9581-3_2.
2
The Genome of C57BL/6J "Eve", the Mother of the Laboratory Mouse Genome Reference Strain.C57BL/6J“Eve”基因组,实验鼠基因组参考株的“母亲”。
G3 (Bethesda). 2019 Jun 5;9(6):1795-1805. doi: 10.1534/g3.119.400071.
3
MUMmer4: A fast and versatile genome alignment system.MUMmer4:一种快速且通用的基因组比对系统。
Genes (Basel). 2022 Nov 2;13(11):2005. doi: 10.3390/genes13112005.
4
Local Alignment of DNA Sequence Based on Deep Reinforcement Learning.基于深度强化学习的DNA序列局部比对
IEEE Open J Eng Med Biol. 2021 Apr 27;2:170-178. doi: 10.1109/OJEMB.2021.3076156. eCollection 2021.
PLoS Comput Biol. 2018 Jan 26;14(1):e1005944. doi: 10.1371/journal.pcbi.1005944. eCollection 2018 Jan.
4
Maximum-likelihood estimation of the statistical distribution of Smith-Waterman local sequence similarity scores.史密斯-沃特曼局部序列相似性得分统计分布的最大似然估计。
Bull Math Biol. 1992 Jan;54(1):59-75. doi: 10.1007/BF02458620.
5
A novel k-word relative measure for sequence comparison.一种用于序列比较的新型k字相对度量。
Comput Biol Chem. 2014 Dec;53PB:331-338. doi: 10.1016/j.compbiolchem.2014.10.007. Epub 2014 Nov 7.
6
The ETS domain transcription factor ELK1 directs a critical component of growth signaling by the androgen receptor in prostate cancer cells.ETS 结构域转录因子 Elk1 指导雄激素受体在前列腺癌细胞中生长信号的关键组成部分。
J Biol Chem. 2013 Apr 19;288(16):11047-65. doi: 10.1074/jbc.M112.438473. Epub 2013 Feb 20.
7
MAFFT multiple sequence alignment software version 7: improvements in performance and usability.MAFFT 多序列比对软件版本 7:性能和易用性的改进。
Mol Biol Evol. 2013 Apr;30(4):772-80. doi: 10.1093/molbev/mst010. Epub 2013 Jan 16.
8
Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.使用 Clustal Omega 快速、可扩展地生成高质量蛋白质多重序列比对。
Mol Syst Biol. 2011 Oct 11;7:539. doi: 10.1038/msb.2011.75.
9
Novel association strategy with copy number variation for identifying new risk Loci of human diseases.新型关联策略结合拷贝数变异分析鉴定人类疾病新的风险基因座。
PLoS One. 2010 Aug 20;5(8):e12185. doi: 10.1371/journal.pone.0012185.
10
BLAST+: architecture and applications.BLAST+:体系结构与应用。
BMC Bioinformatics. 2009 Dec 15;10:421. doi: 10.1186/1471-2105-10-421.