• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iEnhancer-EBLSTM:通过双向长短期记忆集成识别增强子及其强度

iEnhancer-EBLSTM: Identifying Enhancers and Strengths by Ensembles of Bidirectional Long Short-Term Memory.

作者信息

Niu Kun, Luo Ximei, Zhang Shumei, Teng Zhixia, Zhang Tianjiao, Zhao Yuming

机构信息

College of Information and Computer Engineering, Northeast Forestry University, Harbin, China.

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.

出版信息

Front Genet. 2021 Mar 23;12:665498. doi: 10.3389/fgene.2021.665498. eCollection 2021.

DOI:10.3389/fgene.2021.665498
PMID:33833783
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8021722/
Abstract

Enhancers are regulatory DNA sequences that could be bound by specific proteins named transcription factors (TFs). The interactions between enhancers and TFs regulate specific genes by increasing the target gene expression. Therefore, enhancer identification and classification have been a critical issue in the enhancer field. Unfortunately, so far there has been a lack of suitable methods to identify enhancers. Previous research has mainly focused on the features of the enhancer's function and interactions, which ignores the sequence information. As we know, the recurrent neural network (RNN) and long short-term memory (LSTM) models are currently the most common methods for processing time series data. LSTM is more suitable than RNN to address the DNA sequence. In this paper, we take the advantages of LSTM to build a method named iEnhancer-EBLSTM to identify enhancers. iEnhancer-ensembles of bidirectional LSTM (EBLSTM) consists of two steps. In the first step, we extract subsequences by sliding a 3-mer window along the DNA sequence as features. Second, EBLSTM model is used to identify enhancers from the candidate input sequences. We use the dataset from the study of Quang H et al. as the benchmarks. The experimental results from the datasets demonstrate the efficiency of our proposed model.

摘要

增强子是可被名为转录因子(TFs)的特定蛋白质结合的调控DNA序列。增强子与转录因子之间的相互作用通过增加靶基因表达来调控特定基因。因此,增强子的识别和分类一直是增强子领域的关键问题。不幸的是,到目前为止一直缺乏合适的方法来识别增强子。先前的研究主要集中在增强子的功能和相互作用的特征上,而忽略了序列信息。众所周知,循环神经网络(RNN)和长短期记忆(LSTM)模型是目前处理时间序列数据最常用的方法。LSTM比RNN更适合处理DNA序列。在本文中,我们利用LSTM的优势构建了一种名为iEnhancer-EBLSTM的方法来识别增强子。双向LSTM增强子集成(EBLSTM)由两个步骤组成。第一步,我们通过沿着DNA序列滑动一个3聚体窗口来提取子序列作为特征。第二步,使用EBLSTM模型从候选输入序列中识别增强子。我们将Quang H等人研究中的数据集用作基准。来自这些数据集的实验结果证明了我们所提出模型的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/854262cff026/fgene-12-665498-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/a412e3b79dfb/fgene-12-665498-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/5db78cbbb4de/fgene-12-665498-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/d416dd1f6525/fgene-12-665498-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/c591fbf78ed4/fgene-12-665498-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/854262cff026/fgene-12-665498-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/a412e3b79dfb/fgene-12-665498-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/5db78cbbb4de/fgene-12-665498-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/d416dd1f6525/fgene-12-665498-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/c591fbf78ed4/fgene-12-665498-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c8e/8021722/854262cff026/fgene-12-665498-g005.jpg

相似文献

1
iEnhancer-EBLSTM: Identifying Enhancers and Strengths by Ensembles of Bidirectional Long Short-Term Memory.iEnhancer-EBLSTM:通过双向长短期记忆集成识别增强子及其强度
Front Genet. 2021 Mar 23;12:665498. doi: 10.3389/fgene.2021.665498. eCollection 2021.
2
iEnhancer-DLRA: identification of enhancers and their strengths by a self-attention fusion strategy for local and global features.iEnhancer-DLRA:通过自注意力融合策略识别增强子及其强度,用于局部和全局特征。
Brief Funct Genomics. 2022 Sep 16;21(5):399-407. doi: 10.1093/bfgp/elac023.
3
iEnhancer-SKNN: a stacking ensemble learning-based method for enhancer identification and classification using sequence information.iEnhancer-SKNN:一种基于堆叠集成学习的方法,用于使用序列信息进行增强子识别和分类。
Brief Funct Genomics. 2023 May 18;22(3):302-311. doi: 10.1093/bfgp/elac057.
4
iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework.iEnhancer-DCLA:基于深度学习框架,使用原始序列识别增强子及其强度。
BMC Bioinformatics. 2022 Nov 14;23(1):480. doi: 10.1186/s12859-022-05033-x.
5
iEnhancer-ECNN: identifying enhancers and their strength using ensembles of convolutional neural networks.iEnhancer-ECNN:使用卷积神经网络的集合来识别增强子及其强度。
BMC Genomics. 2019 Dec 24;20(Suppl 9):951. doi: 10.1186/s12864-019-6336-3.
6
iEnhancer-MFGBDT: Identifying enhancers and their strength by fusing multiple features and gradient boosting decision tree.iEnhancer-MFGBDT:通过融合多种特征和梯度提升决策树来识别增强子及其强度。
Math Biosci Eng. 2021 Oct 14;18(6):8797-8814. doi: 10.3934/mbe.2021434.
7
iEnhancer-GAN: A Deep Learning Framework in Combination with Word Embedding and Sequence Generative Adversarial Net to Identify Enhancers and Their Strength.iEnhancer-GAN:一种结合词嵌入和序列生成对抗网络以识别增强子及其强度的深度学习框架。
Int J Mol Sci. 2021 Mar 30;22(7):3589. doi: 10.3390/ijms22073589.
8
iEnhancer-RD: Identification of enhancers and their strength using RKPK features and deep neural networks.iEnhancer-RD:利用RKPK特征和深度神经网络识别增强子及其强度。
Anal Biochem. 2021 Oct 1;630:114318. doi: 10.1016/j.ab.2021.114318. Epub 2021 Aug 5.
9
iEnhancer-MRBF: Identifying enhancers and their strength with a multiple Laplacian-regularized radial basis function network.iEnhancer-MRBF:基于多重拉普拉斯正则化径向基函数网络的增强子识别及其强度预测。
Methods. 2022 Dec;208:1-8. doi: 10.1016/j.ymeth.2022.10.001. Epub 2022 Oct 8.
10
iEnhancer-5Step: Identifying enhancers using hidden information of DNA sequences via Chou's 5-step rule and word embedding.iEnhancer-5Step:通过 Chou 的 5 步规则和词嵌入利用 DNA 序列的隐藏信息识别增强子。
Anal Biochem. 2019 Apr 15;571:53-61. doi: 10.1016/j.ab.2019.02.017. Epub 2019 Feb 26.

引用本文的文献

1
AttnW2V-Enhancer: Leveraging attention and Word2Vec for enhanced enhancer prediction.注意力加权词向量增强器:利用注意力机制和词向量进行增强的增强子预测。
Comput Struct Biotechnol J. 2025 Jul 23;27:3275-3284. doi: 10.1016/j.csbj.2025.07.008. eCollection 2025.
2
iKcr-DRC: prediction of lysine crotonylation sites in proteins based on a novel attention module and DenseNet.iKcr-DRC:基于新型注意力模块和密集连接网络的蛋白质赖氨酸巴豆酰化位点预测
Front Genet. 2025 Jun 11;16:1574832. doi: 10.3389/fgene.2025.1574832. eCollection 2025.
3
iEnhancer-GDM: A Deep Learning Framework Based on Generative Adversarial Network and Multi-head Attention Mechanism to Identify Enhancers and Their Strength.

本文引用的文献

1
A sequence-based deep learning approach to predict CTCF-mediated chromatin loop.基于序列的深度学习方法预测 CTCF 介导的染色质环。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab031.
2
iBLP: An XGBoost-Based Predictor for Identifying Bioluminescent Proteins.iBLP:一种基于 XGBoost 的生物发光蛋白鉴定预测器。
Comput Math Methods Med. 2021 Jan 7;2021:6664362. doi: 10.1155/2021/6664362. eCollection 2021.
3
Prediction of bio-sequence modifications and the associations with diseases.生物序列修饰的预测及其与疾病的关联。
iEnhancer-GDM:一种基于生成对抗网络和多头注意力机制的深度学习框架,用于识别增强子及其强度。
Interdiscip Sci. 2025 May 7. doi: 10.1007/s12539-025-00703-9.
4
Utilizing a deep learning model based on BERT for identifying enhancers and their strength.利用基于BERT的深度学习模型来识别增强子及其强度。
PLoS One. 2025 Apr 9;20(4):e0320085. doi: 10.1371/journal.pone.0320085. eCollection 2025.
5
DeepEnhancerPPO: An Interpretable Deep Learning Approach for Enhancer Classification.深度增强子近端策略优化算法:一种用于增强子分类的可解释深度学习方法。
Int J Mol Sci. 2024 Dec 2;25(23):12942. doi: 10.3390/ijms252312942.
6
DeepDualEnhancer: A Dual-Feature Input DNABert Based Deep Learning Method for Enhancer Recognition.DeepDualEnhancer:一种基于双特征输入的 DNA 语言模型的深度学习方法,用于增强子识别。
Int J Mol Sci. 2024 Nov 1;25(21):11744. doi: 10.3390/ijms252111744.
7
CapsEnhancer: An Effective Computational Framework for Identifying Enhancers Based on Chaos Game Representation and Capsule Network.CapsEnhancer:一种基于混沌游戏表示和胶囊网络的有效识别增强子的计算框架。
J Chem Inf Model. 2024 Jul 22;64(14):5725-5736. doi: 10.1021/acs.jcim.4c00546. Epub 2024 Jun 30.
8
A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding.一种基于核苷酸位置感知特征编码的DNA增强子预测深度学习模型。
iScience. 2024 May 19;27(6):110030. doi: 10.1016/j.isci.2024.110030. eCollection 2024 Jun 21.
9
iPro2L-DG: Hybrid network based on improved densenet and global attention mechanism for identifying promoter sequences.iPro2L-DG:基于改进型密集连接网络和全局注意力机制的混合网络用于识别启动子序列。
Heliyon. 2024 Mar 6;10(6):e27364. doi: 10.1016/j.heliyon.2024.e27364. eCollection 2024 Mar 30.
10
Enhancer-MDLF: a novel deep learning framework for identifying cell-specific enhancers.增强型 MDLF:一种用于识别细胞特异性增强子的新型深度学习框架。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae083.
Brief Funct Genomics. 2021 Mar 2;20(1):1-18. doi: 10.1093/bfgp/elaa023.
4
Deep-Kcr: accurate detection of lysine crotonylation sites using deep learning method.Deep-Kcr:一种使用深度学习方法准确检测赖氨酸巴豆酰化位点的技术。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa255.
5
ProtFold-DFG: protein fold recognition by combining Directed Fusion Graph and PageRank algorithm.ProtFold-DFG:通过结合定向融合图和 PageRank 算法进行蛋白质折叠识别。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa192.
6
Prediction of antioxidant proteins using hybrid feature representation method and random forest.基于混合特征表示方法和随机森林的抗氧化蛋白预测。
Genomics. 2020 Nov;112(6):4666-4674. doi: 10.1016/j.ygeno.2020.08.016. Epub 2020 Aug 17.
7
Prediction of drug response in multilayer networks based on fusion of multiomics data.基于多组学数据融合的多层网络药物反应预测。
Methods. 2021 Aug;192:85-92. doi: 10.1016/j.ymeth.2020.08.006. Epub 2020 Aug 13.
8
iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features.iCarPS:一种通过新型编码特征识别蛋白质羰基化位点的计算工具。
Bioinformatics. 2021 Apr 19;37(2):171-177. doi: 10.1093/bioinformatics/btaa702.
9
Exploring Drug Treatment Patterns Based on the Action of Drug and Multilayer Network Model.基于药物作用和多层网络模型探索药物治疗模式。
Int J Mol Sci. 2020 Jul 16;21(14):5014. doi: 10.3390/ijms21145014.
10
IDP-Seq2Seq: identification of intrinsically disordered regions based on sequence to sequence learning.IDP-Seq2Seq:基于序列到序列学习的无规卷曲区域鉴定。
Bioinformatics. 2021 Jan 29;36(21):5177-5186. doi: 10.1093/bioinformatics/btaa667.