• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过具有三种新型序列特征的深度神经网络识别内在无序蛋白质区域。

Identifying Intrinsically Disordered Protein Regions through a Deep Neural Network with Three Novel Sequence Features.

作者信息

Zhao Jiaxiang, Wang Zengke

机构信息

College of Electronic Information and Optical Engineering, Nankai University, Tianjin 300350, China.

出版信息

Life (Basel). 2022 Feb 26;12(3):345. doi: 10.3390/life12030345.

DOI:10.3390/life12030345
PMID:35330096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8950681/
Abstract

The fast, reliable, and accurate identification of IDPRs is essential, as in recent years it has come to be recognized more and more that IDPRs have a wide impact on many important physiological processes, such as molecular recognition and molecular assembly, the regulation of transcription and translation, protein phosphorylation, cellular signal transduction, etc. For the sake of cost-effectiveness, it is imperative to develop computational approaches for identifying IDPRs. In this study, a deep neural structure where a variant VGG19 is situated between two MLP networks is developed for identifying IDPRs. Furthermore, for the first time, three novel sequence features-i.e., persistent entropy and the probabilities associated with two and three consecutive amino acids of the protein sequence-are introduced for identifying IDPRs. The simulation results show that our neural structure either performs considerably better than other known methods or, when relying on a much smaller training set, attains a similar performance. Our deep neural structure, which exploits the VGG19 structure, is effective for identifying IDPRs. Furthermore, three novel sequence features-i.e., the persistent entropy and the probabilities associated with two and three consecutive amino acids of the protein sequence-could be used as valuable sequence features in the further development of identifying IDPRs.

摘要

快速、可靠且准确地识别内在无序蛋白质区域(IDPRs)至关重要,因为近年来人们越来越认识到,IDPRs对许多重要的生理过程有广泛影响,如分子识别与分子组装、转录与翻译调控、蛋白质磷酸化、细胞信号转导等。为了实现成本效益,开发用于识别IDPRs的计算方法势在必行。在本研究中,构建了一种深度神经结构,其中一个变体VGG19位于两个多层感知器(MLP)网络之间,用于识别IDPRs。此外,首次引入了三种新颖的序列特征,即持久熵以及与蛋白质序列中两个和三个连续氨基酸相关的概率,用于识别IDPRs。模拟结果表明,我们的神经结构要么比其他已知方法表现得好得多,要么在依赖小得多的训练集时能达到相似的性能。我们利用VGG19结构的深度神经结构对于识别IDPRs是有效的。此外,三种新颖的序列特征,即持久熵以及与蛋白质序列中两个和三个连续氨基酸相关的概率,可作为识别IDPRs进一步发展中有价值的序列特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e9e/8950681/1a36d2d7140a/life-12-00345-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e9e/8950681/52fe4d4b2064/life-12-00345-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e9e/8950681/d3d0354e9686/life-12-00345-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e9e/8950681/1a36d2d7140a/life-12-00345-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e9e/8950681/52fe4d4b2064/life-12-00345-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e9e/8950681/d3d0354e9686/life-12-00345-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e9e/8950681/1a36d2d7140a/life-12-00345-g003.jpg

相似文献

1
Identifying Intrinsically Disordered Protein Regions through a Deep Neural Network with Three Novel Sequence Features.通过具有三种新型序列特征的深度神经网络识别内在无序蛋白质区域。
Life (Basel). 2022 Feb 26;12(3):345. doi: 10.3390/life12030345.
2
Molecular Recognition Features in Zika Virus Proteome.寨卡病毒蛋白组中的分子识别特征。
J Mol Biol. 2018 Aug 3;430(16):2372-2388. doi: 10.1016/j.jmb.2017.10.018. Epub 2017 Nov 7.
3
The Structural and Functional Diversity of Intrinsically Disordered Regions in Transmembrane Proteins.跨膜蛋白中无规则区域的结构与功能多样性。
J Membr Biol. 2019 Oct;252(4-5):273-292. doi: 10.1007/s00232-019-00069-2. Epub 2019 May 28.
4
Targeting intrinsically disordered proteins in rational drug discovery.合理药物研发中针对内在无序蛋白的研究
Expert Opin Drug Discov. 2016;11(1):65-77. doi: 10.1517/17460441.2016.1107041. Epub 2015 Nov 7.
5
Proteins without unique 3D structures: biotechnological applications of intrinsically unstable/disordered proteins.没有独特三维结构的蛋白质:内在不稳定/无序蛋白质的生物技术应用
Biotechnol J. 2015 Mar;10(3):356-66. doi: 10.1002/biot.201400374. Epub 2014 Oct 6.
6
Does lack of secondary structure imply intrinsic disorder in proteins? A sequence analysis.蛋白质缺乏二级结构是否意味着其内在无序?一项序列分析。
Biochim Biophys Acta. 2014 Oct;1844(10):1827-34. doi: 10.1016/j.bbapap.2014.07.020. Epub 2014 Aug 8.
7
Dancing while self-eating: Protein intrinsic disorder in autophagy.自噬过程中的自我吞噬:蛋白质固有无序。
Prog Mol Biol Transl Sci. 2020;174:263-305. doi: 10.1016/bs.pmbts.2020.03.002. Epub 2020 Mar 31.
8
Intrinsic Disorder and Posttranslational Modifications: The Darker Side of the Biological Dark Matter.内在无序与翻译后修饰:生物暗物质的阴暗面
Front Genet. 2018 May 4;9:158. doi: 10.3389/fgene.2018.00158. eCollection 2018.
9
Portrait of the Intrinsically Disordered Side of the HTLV-1 Proteome.人类嗜T淋巴细胞病毒1型(HTLV-1)蛋白质组内在无序一面的描绘
ACS Omega. 2019 Jun 7;4(6):10003-10018. doi: 10.1021/acsomega.9b01017. eCollection 2019 Jun 30.
10
Prevalence and functionality of intrinsic disorder in human FG-nucleoporins.人类 FG 核孔蛋白中内源性无序的普遍性和功能。
Int J Biol Macromol. 2021 Apr 1;175:156-170. doi: 10.1016/j.ijbiomac.2021.01.218. Epub 2021 Feb 3.

本文引用的文献

1
IDP-Seq2Seq: identification of intrinsically disordered regions based on sequence to sequence learning.IDP-Seq2Seq:基于序列到序列学习的无规卷曲区域鉴定。
Bioinformatics. 2021 Jan 29;36(21):5177-5186. doi: 10.1093/bioinformatics/btaa667.
2
SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning.SPOT-Disorder2:通过集成深度学习提高蛋白质固有无序预测。
Genomics Proteomics Bioinformatics. 2019 Dec;17(6):645-656. doi: 10.1016/j.gpb.2019.01.004. Epub 2020 Mar 13.
3
RFPR-IDP: reduce the false positive rates for intrinsically disordered protein and region prediction by incorporating both fully ordered proteins and disordered proteins.
RFPR-IDP:通过同时纳入完全有序的蛋白质和无序的蛋白质,降低内在无序蛋白质和区域预测的假阳性率。
Brief Bioinform. 2021 Mar 22;22(2):2000-2011. doi: 10.1093/bib/bbaa018.
4
DisProt: intrinsic protein disorder annotation in 2020.DisProt:2020 年的内在蛋白无序注释。
Nucleic Acids Res. 2020 Jan 8;48(D1):D269-D276. doi: 10.1093/nar/gkz975.
5
Intrinsically disordered proteins and phenotypic switching: Implications in cancer.无规则卷曲蛋白质与表型转换:在癌症中的意义。
Prog Mol Biol Transl Sci. 2019;166:63-84. doi: 10.1016/bs.pmbts.2019.03.013. Epub 2019 Apr 11.
6
Disordered RNA chaperones can enhance nucleic acid folding via local charge screening.RNA 分子伴侣紊乱可通过局部电荷屏蔽增强核酸折叠。
Nat Commun. 2019 Jun 5;10(1):2453. doi: 10.1038/s41467-019-10356-0.
7
Improving protein disorder prediction by deep bidirectional long short-term memory recurrent neural networks.通过深度双向长短期记忆循环神经网络改进蛋白质无序预测。
Bioinformatics. 2017 Mar 1;33(5):685-692. doi: 10.1093/bioinformatics/btw678.
8
DisProt 7.0: a major update of the database of disordered proteins.DisProt 7.0:无序蛋白质数据库的重大更新。
Nucleic Acids Res. 2017 Jan 4;45(D1):D219-D227. doi: 10.1093/nar/gkw1056. Epub 2016 Nov 28.
9
Functional roles of transiently and intrinsically disordered regions within proteins.蛋白质中瞬时和内在无序区域的功能作用。
FEBS J. 2015 Apr;282(7):1182-9. doi: 10.1111/febs.13202. Epub 2015 Jan 29.
10
DISOPRED3: precise disordered region predictions with annotated protein-binding activity.DISOPRED3:具有注释蛋白质结合活性的精确无序区域预测
Bioinformatics. 2015 Mar 15;31(6):857-63. doi: 10.1093/bioinformatics/btu744. Epub 2014 Nov 12.