• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepIDP-2L:通过组合卷积注意网络和层次注意网络进行蛋白质固有无序区域预测。

DeepIDP-2L: protein intrinsically disordered region prediction by combining convolutional attention network and hierarchical attention network.

机构信息

School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China.

Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing 100081, China.

出版信息

Bioinformatics. 2022 Feb 7;38(5):1252-1260. doi: 10.1093/bioinformatics/btab810.

DOI:10.1093/bioinformatics/btab810
PMID:34864847
Abstract

MOTIVATION

Intrinsically disordered regions (IDRs) are widely distributed in proteins. Accurate prediction of IDRs is critical for the protein structure and function analysis. The IDRs are divided into long disordered regions (LDRs) and short disordered regions (SDRs) according to their lengths. Previous studies have shown that LDRs and SDRs have different proprieties. However, the existing computational methods fail to extract different features for LDRs and SDRs separately. As a result, they achieve unstable performance on datasets with different ratios of LDRs and SDRs.

RESULTS

In this study, a two-layer predictor was proposed called DeepIDP-2L. In the first layer, two kinds of attention-based models are used to extract different features for LDRs and SDRs, respectively. The hierarchical attention network is used to capture the distribution pattern features of LDRs, and convolutional attention network is used to capture the local correlation features of SDRs. The second layer of DeepIDP-2L maps the feature extracted in the first layer into a new feature space. Convolutional network and bidirectional long short term memory are used to capture the local and long-range information for predicting both SDRs and LDRs. Experimental results show that DeepIDP-2L can achieve more stable performance than other exiting predictors on independent test sets with different ratios of SDRs and LDRs.

AVAILABILITY AND IMPLEMENTATION

For the convenience of most experimental scientists, a user-friendly and publicly accessible web-server for the new predictor has been established at http://bliulab.net/DeepIDP-2L/. It is anticipated that DeepIDP-2L will become a very useful tool for identification of intrinsically disordered regions.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

无序区域(IDR)广泛存在于蛋白质中。准确预测 IDR 对于蛋白质结构和功能分析至关重要。根据长度,IDR 分为长无序区域(LDR)和短无序区域(SDR)。先前的研究表明,LDR 和 SDR 具有不同的特性。然而,现有的计算方法无法分别提取 LDR 和 SDR 的不同特征。因此,它们在 LDR 和 SDR 比例不同的数据集上表现不稳定。

结果

在这项研究中,提出了一种称为 DeepIDP-2L 的两层预测器。在第一层中,使用两种基于注意力的模型分别提取 LDR 和 SDR 的不同特征。使用层次注意力网络捕获 LDR 的分布模式特征,使用卷积注意力网络捕获 SDR 的局部相关特征。DeepIDP-2L 的第二层将第一层提取的特征映射到新的特征空间。卷积网络和双向长短期记忆用于捕获局部和远程信息,以预测 SDR 和 LDR。实验结果表明,与其他现有预测器相比,DeepIDP-2L 在 SDR 和 LDR 比例不同的独立测试集上可以实现更稳定的性能。

可用性和实现

为了方便大多数实验科学家,我们在 http://bliulab.net/DeepIDP-2L/ 上建立了一个用户友好且可公开访问的新预测器的网络服务器。预计 DeepIDP-2L 将成为识别内在无序区域的非常有用的工具。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
DeepIDP-2L: protein intrinsically disordered region prediction by combining convolutional attention network and hierarchical attention network.DeepIDP-2L:通过组合卷积注意网络和层次注意网络进行蛋白质固有无序区域预测。
Bioinformatics. 2022 Feb 7;38(5):1252-1260. doi: 10.1093/bioinformatics/btab810.
2
Protein intrinsically disordered region prediction by combining neural architecture search and multi-objective genetic algorithm.通过结合神经结构搜索和多目标遗传算法进行蛋白质无规则区域预测。
BMC Biol. 2023 Sep 7;21(1):188. doi: 10.1186/s12915-023-01672-5.
3
IDP-Seq2Seq: identification of intrinsically disordered regions based on sequence to sequence learning.IDP-Seq2Seq:基于序列到序列学习的无规卷曲区域鉴定。
Bioinformatics. 2021 Jan 29;36(21):5177-5186. doi: 10.1093/bioinformatics/btaa667.
4
RFPR-IDP: reduce the false positive rates for intrinsically disordered protein and region prediction by incorporating both fully ordered proteins and disordered proteins.RFPR-IDP:通过同时纳入完全有序的蛋白质和无序的蛋白质,降低内在无序蛋白质和区域预测的假阳性率。
Brief Bioinform. 2021 Mar 22;22(2):2000-2011. doi: 10.1093/bib/bbaa018.
5
TransDFL: Identification of Disordered Flexible Linkers in Proteins by Transfer Learning.基于迁移学习的蛋白质无序柔性连接子识别
Genomics Proteomics Bioinformatics. 2023 Apr;21(2):359-369. doi: 10.1016/j.gpb.2022.10.004. Epub 2022 Oct 19.
6
Local Disordered Region Sampling (LDRS) for ensemble modeling of proteins with experimentally undetermined or low confidence prediction segments.针对具有实验不确定或低置信度预测片段的蛋白质进行集成建模的局部无序区域采样(LDRS)。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad739.
7
Experimentally Determined Long Intrinsically Disordered Protein Regions Are Now Abundant in the Protein Data Bank.实验确定的长固有无序蛋白质区域现在在蛋白质数据库中大量存在。
Int J Mol Sci. 2020 Jun 24;21(12):4496. doi: 10.3390/ijms21124496.
8
IDP⁻CRF: Intrinsically Disordered Protein/Region Identification Based on Conditional Random Fields.IDP⁻CRF:基于条件随机场的无序蛋白/区域识别。
Int J Mol Sci. 2018 Aug 22;19(9):2483. doi: 10.3390/ijms19092483.
9
MemDis: Predicting Disordered Regions in Transmembrane Proteins.MemDis:预测跨膜蛋白中的无序区域。
Int J Mol Sci. 2021 Nov 12;22(22):12270. doi: 10.3390/ijms222212270.
10
Genome-scale prediction of proteins with long intrinsically disordered regions.具有长内在无序区域的蛋白质的全基因组规模预测。
Proteins. 2014 Jan;82(1):145-58. doi: 10.1002/prot.24348. Epub 2013 Sep 17.

引用本文的文献

1
FusionEncoder: identification of intrinsically disordered regions based on multi-feature fusion.融合编码器:基于多特征融合的内在无序区域识别
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf362.
2
IDP-EDL: enhancing intrinsically disordered protein prediction by combining protein language model and ensemble deep learning.IDP-EDL:通过结合蛋白质语言模型和集成深度学习增强内在无序蛋白质预测
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf182.
3
Computational Insights into Captopril's Inhibitory Potential Against MMP9 and LCN2 in Bladder Cancer: Implications for Therapeutic Application.
卡托普利对膀胱癌中基质金属蛋白酶9和脂质运载蛋白2抑制潜力的计算洞察:对治疗应用的启示
Cancer Inform. 2024 Sep 19;23:11769351241276759. doi: 10.1177/11769351241276759. eCollection 2024.
4
Assessment of Disordered Linker Predictions in the CAID2 Experiment.CAID2 实验中无序连接预测的评估。
Biomolecules. 2024 Feb 28;14(3):287. doi: 10.3390/biom14030287.
5
IDP-LM: Prediction of protein intrinsic disorder and disorder functions based on language models.IDP-LM:基于语言模型的蛋白质固有无序预测和无序功能预测。
PLoS Comput Biol. 2023 Nov 22;19(11):e1011657. doi: 10.1371/journal.pcbi.1011657. eCollection 2023 Nov.
6
Protein intrinsically disordered region prediction by combining neural architecture search and multi-objective genetic algorithm.通过结合神经结构搜索和多目标遗传算法进行蛋白质无规则区域预测。
BMC Biol. 2023 Sep 7;21(1):188. doi: 10.1186/s12915-023-01672-5.
7
CAID prediction portal: a comprehensive service for predicting intrinsic disorder and binding regions in proteins.CAID 预测门户:一个用于预测蛋白质中内源性无序区域和结合区域的综合服务。
Nucleic Acids Res. 2023 Jul 5;51(W1):W62-W69. doi: 10.1093/nar/gkad430.
8
DEPICTER2: a comprehensive webserver for intrinsic disorder and disorder function prediction.DEPICTER2:一个用于固有无序和无序功能预测的综合网络服务器。
Nucleic Acids Res. 2023 Jul 5;51(W1):W141-W147. doi: 10.1093/nar/gkad330.
9
iEnhancer-ELM: improve enhancer identification by extracting position-related multiscale contextual information based on enhancer language models.iEnhancer-ELM:基于增强子语言模型提取位置相关的多尺度上下文信息来改进增强子识别。
Bioinform Adv. 2023 Mar 25;3(1):vbad043. doi: 10.1093/bioadv/vbad043. eCollection 2023.
10
ProtGPT2 is a deep unsupervised language model for protein design.ProtGPT2 是一个用于蛋白质设计的深度无监督语言模型。
Nat Commun. 2022 Jul 27;13(1):4348. doi: 10.1038/s41467-022-32007-7.