• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepSS2GO:基于二级结构的蛋白质功能预测

DeepSS2GO: protein function prediction from secondary structure.

作者信息

Song Fu V, Su Jiaqi, Huang Sixing, Zhang Neng, Li Kaiyue, Ni Ming, Liao Maofu

机构信息

Department of Chemical Biology, School of Life Sciences, Southern University of Science and Technology, Xueyuan Avenue, 518055, Shenzhen, China.

Gemini Data Japan, Kitaku Oujikamiya 1-11-11, 115-0043, Tokyo, Japan.

出版信息

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae196.

DOI:10.1093/bib/bbae196
PMID:38701416
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11066904/
Abstract

Predicting protein function is crucial for understanding biological life processes, preventing diseases and developing new drug targets. In recent years, methods based on sequence, structure and biological networks for protein function annotation have been extensively researched. Although obtaining a protein in three-dimensional structure through experimental or computational methods enhances the accuracy of function prediction, the sheer volume of proteins sequenced by high-throughput technologies presents a significant challenge. To address this issue, we introduce a deep neural network model DeepSS2GO (Secondary Structure to Gene Ontology). It is a predictor incorporating secondary structure features along with primary sequence and homology information. The algorithm expertly combines the speed of sequence-based information with the accuracy of structure-based features while streamlining the redundant data in primary sequences and bypassing the time-consuming challenges of tertiary structure analysis. The results show that the prediction performance surpasses state-of-the-art algorithms. It has the ability to predict key functions by effectively utilizing secondary structure information, rather than broadly predicting general Gene Ontology terms. Additionally, DeepSS2GO predicts five times faster than advanced algorithms, making it highly applicable to massive sequencing data. The source code and trained models are available at https://github.com/orca233/DeepSS2GO.

摘要

预测蛋白质功能对于理解生物生命过程、预防疾病以及开发新的药物靶点至关重要。近年来,基于序列、结构和生物网络的蛋白质功能注释方法得到了广泛研究。尽管通过实验或计算方法获得蛋白质的三维结构可提高功能预测的准确性,但高通量技术测序的蛋白质数量庞大,这带来了巨大挑战。为解决这一问题,我们引入了一种深度神经网络模型DeepSS2GO(二级结构到基因本体)。它是一种结合了二级结构特征以及一级序列和同源性信息的预测器。该算法巧妙地将基于序列信息的速度与基于结构特征的准确性相结合,同时精简了一级序列中的冗余数据,并绕过了三级结构分析耗时的难题。结果表明,其预测性能超越了现有最先进的算法。它能够通过有效利用二级结构信息来预测关键功能,而不是宽泛地预测一般的基因本体术语。此外,DeepSS2GO的预测速度比先进算法快五倍,使其非常适用于海量测序数据。源代码和训练模型可在https://github.com/orca233/DeepSS2GO获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/0ecb6c5c803f/bbae196f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/6edd16b7f0d5/bbae196f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/4bf42e9d271a/bbae196f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/2200c98ddeed/bbae196f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/0ecb6c5c803f/bbae196f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/6edd16b7f0d5/bbae196f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/4bf42e9d271a/bbae196f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/2200c98ddeed/bbae196f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd5/11066904/0ecb6c5c803f/bbae196f4.jpg

相似文献

1
DeepSS2GO: protein function prediction from secondary structure.DeepSS2GO:基于二级结构的蛋白质功能预测
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae196.
2
ProtGO: universal protein function prediction utilizing multi-modal gene ontology knowledge.ProtGO:利用多模态基因本体知识进行通用蛋白质功能预测
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf390.
3
Short-Term Memory Impairment短期记忆障碍
4
TransPTM: a transformer-based model for non-histone acetylation site prediction.TransPTM:一种基于转换器的非组蛋白乙酰化位点预测模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae219.
5
Multi-stage attention-based extraction and fusion of protein sequence and structural features for protein function prediction.基于多阶段注意力机制的蛋白质序列与结构特征提取及融合用于蛋白质功能预测
Bioinformatics. 2025 Jun 26. doi: 10.1093/bioinformatics/btaf374.
6
Predicting Affinity Through Homology (PATH): Interpretable binding affinity prediction with persistent homology.通过同源性预测亲和力(PATH):利用持久同源性进行可解释的结合亲和力预测。
PLoS Comput Biol. 2025 Jun 27;21(6):e1013216. doi: 10.1371/journal.pcbi.1013216. eCollection 2025 Jun.
7
RMDNet: RNA-aware dung beetle optimization-based multi-branch integration network for RNA-protein binding sites prediction.RMDNet:基于RNA感知蜣螂优化算法的多分支整合网络用于RNA-蛋白质结合位点预测
BMC Bioinformatics. 2025 Jul 11;26(1):176. doi: 10.1186/s12859-025-06197-y.
8
Predicting Affinity Through Homology (PATH): Interpretable Binding Affinity Prediction with Persistent Homology.通过同源性预测亲和力(PATH):基于持久同源性的可解释结合亲和力预测
bioRxiv. 2024 Oct 21:2023.11.16.567384. doi: 10.1101/2023.11.16.567384.
9
Unveiling the evolution of policies for enhancing protein structure predictions: A comprehensive analysis.揭示增强蛋白质结构预测政策的演变:全面分析。
Comput Biol Med. 2024 Sep;179:108815. doi: 10.1016/j.compbiomed.2024.108815. Epub 2024 Jul 11.
10
DrugTar improves druggability prediction by integrating large language models and gene ontologies.DrugTar通过整合大语言模型和基因本体来改善药物可及性预测。
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf360.

引用本文的文献

1
CPE-Pro: A Structure-Sensitive Deep Learning Method for Protein Representation and Origin Evaluation.CPE-Pro:一种用于蛋白质表征和起源评估的结构敏感深度学习方法。
Interdiscip Sci. 2025 Jun 8. doi: 10.1007/s12539-025-00732-4.
2
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
3
Prediction of new candidate proteins and analysis of sub-modules and protein hubs associated with seed development in rice (Oryza sativa) using an ensemble network-based systems biology approach.

本文引用的文献

1
Conformational ensembles of the human intrinsically disordered proteome.人类内在无序蛋白质组的构象集合
Nature. 2024 Feb;626(8000):897-904. doi: 10.1038/s41586-023-07004-5. Epub 2024 Jan 31.
2
Current successes and remaining challenges in protein function prediction.蛋白质功能预测的当前成果与尚存挑战
Front Bioinform. 2023 Jul 27;3:1222182. doi: 10.3389/fbinf.2023.1222182. eCollection 2023.
3
MMSMAPlus: a multi-view multi-scale multi-attention embedding model for protein function prediction.MMSMAPlus:一种用于蛋白质功能预测的多视角多尺度多注意力嵌入模型。
利用基于集成网络的系统生物学方法预测水稻(Oryza sativa)新的候选蛋白,并分析与种子发育相关的子模块和蛋白枢纽。
BMC Plant Biol. 2025 May 8;25(1):604. doi: 10.1186/s12870-025-06595-7.
4
DeepGOMeta for functional insights into microbial communities using deep learning-based protein function prediction.使用基于深度学习的蛋白质功能预测对微生物群落进行功能洞察的DeepGOMeta
Sci Rep. 2024 Dec 30;14(1):31813. doi: 10.1038/s41598-024-82956-w.
5
DeepEnzyme: a robust deep learning model for improved enzyme turnover number prediction by utilizing features of protein 3D-structures.DeepEnzyme:一种强大的深度学习模型,通过利用蛋白质 3D 结构的特征,提高酶转化数预测的准确性。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae409.
6
A comprehensive review and comparison of existing computational methods for protein function prediction.蛋白质功能预测现有计算方法的综合回顾与比较。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae289.
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad201.
4
Fast and accurate protein function prediction from sequence through pretrained language model and homology-based label diffusion.通过预训练语言模型和基于同源性的标签扩散,从序列快速准确地预测蛋白质功能。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad117.
5
Large language models generate functional protein sequences across diverse families.大型语言模型可生成不同家族的功能性蛋白质序列。
Nat Biotechnol. 2023 Aug;41(8):1099-1106. doi: 10.1038/s41587-022-01618-2. Epub 2023 Jan 26.
6
Pseudo-Isolated α-Helix Platform for the Recognition of Deep and Narrow Targets.用于识别深而窄目标的拟孤立 α-螺旋平台。
J Am Chem Soc. 2022 Aug 31;144(34):15519-15528. doi: 10.1021/jacs.2c03858. Epub 2022 Aug 16.
7
In pursuit of next-generation therapeutics: Antimicrobial peptides against superbugs, their sources, mechanism of action, nanotechnology-based delivery, and clinical applications.追求下一代治疗方法:针对超级细菌的抗菌肽、它们的来源、作用机制、基于纳米技术的递药系统以及临床应用。
Int J Biol Macromol. 2022 Oct 1;218:135-156. doi: 10.1016/j.ijbiomac.2022.07.103. Epub 2022 Jul 20.
8
Understudied proteins: opportunities and challenges for functional proteomics.研究不足的蛋白质:功能蛋白质组学面临的机遇与挑战
Nat Methods. 2022 Jul;19(7):774-779. doi: 10.1038/s41592-022-01454-x.
9
Reaching alignment-profile-based accuracy in predicting protein secondary and tertiary structural properties without alignment.无需对齐即可达到基于对齐轮廓的预测蛋白质二级和三级结构性质的准确性。
Sci Rep. 2022 May 9;12(1):7607. doi: 10.1038/s41598-022-11684-w.
10
Using deep learning to annotate the protein universe.利用深度学习标注蛋白质宇宙。
Nat Biotechnol. 2022 Jun;40(6):932-937. doi: 10.1038/s41587-021-01179-w. Epub 2022 Feb 21.