• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AFP-LSE:使用 k 间隔氨基酸对组成的潜在空间编码预测抗冻蛋白。

AFP-LSE: Antifreeze Proteins Prediction Using Latent Space Encoding of Composition of k-Spaced Amino Acid Pairs.

机构信息

Department of Computer Engineering, Chosun University, Gwangju, 61452, Republic of Korea.

Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea.

出版信息

Sci Rep. 2020 Apr 28;10(1):7197. doi: 10.1038/s41598-020-63259-2.

DOI:10.1038/s41598-020-63259-2
PMID:32345989
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7188683/
Abstract

Species living in extremely cold environments resist the freezing conditions through antifreeze proteins (AFPs). Apart from being essential proteins for various organisms living in sub-zero temperatures, AFPs have numerous applications in different industries. They possess very small resemblance to each other and cannot be easily identified using simple search algorithms such as BLAST and PSI-BLAST. Diverse AFPs found in fishes (Type I, II, III, IV and antifreeze glycoproteins (AFGPs)), are sub-types and show low sequence and structural similarity, making their accurate prediction challenging. Although several machine-learning methods have been proposed for the classification of AFPs, prediction methods that have greater reliability are required. In this paper, we propose a novel machine-learning-based approach for the prediction of AFP sequences using latent space learning through a deep auto-encoder method. For latent space pruning, we use the output of the auto-encoder with a deep neural network classifier to learn the non-linear mapping of the protein sequence descriptor and class label. The proposed method outperformed the existing methods, yielding excellent results in comparison. A comprehensive ablation study is performed, and the proposed method is evaluated in terms of widely used performance measures. In particular, the proposed method demonstrated a high Matthews correlation coefficient of 0.52, F-score of 0.49, and Youden's index of 0.81 on an independent test dataset, thereby outperforming the existing methods for AFP prediction.

摘要

生活在极寒环境中的物种通过抗冻蛋白(AFP)来抵抗冷冻条件。除了是生活在零度以下温度的各种生物体的必需蛋白外,AFP 在不同行业也有许多应用。它们彼此之间非常相似,无法使用 BLAST 和 PSI-BLAST 等简单的搜索算法轻易识别。在鱼类中发现的不同 AFP(I 型、II 型、III 型、IV 型和抗冻糖蛋白(AFGPs))是亚型,表现出低序列和结构相似性,使得它们的准确预测具有挑战性。尽管已经提出了几种用于 AFP 分类的机器学习方法,但需要更可靠的预测方法。在本文中,我们提出了一种新的基于机器学习的 AFP 序列预测方法,该方法通过深度自动编码器方法使用潜在空间学习。对于潜在空间修剪,我们使用自动编码器的输出与深度神经网络分类器一起学习蛋白质序列描述符和类标签的非线性映射。所提出的方法优于现有方法,取得了优异的比较结果。进行了全面的消融研究,并根据广泛使用的性能指标对所提出的方法进行了评估。特别是,所提出的方法在独立测试数据集上表现出高 0.52 的马修斯相关系数、0.49 的 F 分数和 0.81 的约登指数,从而优于现有的 AFP 预测方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/33039904a5d2/41598_2020_63259_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/e53ee29090cd/41598_2020_63259_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/2bbc8cbd0823/41598_2020_63259_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/02fdfbfed7dd/41598_2020_63259_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/9a928243d77e/41598_2020_63259_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/8392c31923a2/41598_2020_63259_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/33039904a5d2/41598_2020_63259_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/e53ee29090cd/41598_2020_63259_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/2bbc8cbd0823/41598_2020_63259_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/02fdfbfed7dd/41598_2020_63259_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/9a928243d77e/41598_2020_63259_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/8392c31923a2/41598_2020_63259_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86c7/7188683/33039904a5d2/41598_2020_63259_Fig6_HTML.jpg

相似文献

1
AFP-LSE: Antifreeze Proteins Prediction Using Latent Space Encoding of Composition of k-Spaced Amino Acid Pairs.AFP-LSE:使用 k 间隔氨基酸对组成的潜在空间编码预测抗冻蛋白。
Sci Rep. 2020 Apr 28;10(1):7197. doi: 10.1038/s41598-020-63259-2.
2
AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information.AFP-CMBPred:通过将共识序列扩展到多块进化信息来计算识别抗冻蛋白。
Comput Biol Med. 2021 Dec;139:105006. doi: 10.1016/j.compbiomed.2021.105006. Epub 2021 Nov 2.
3
RAFP-Pred: Robust Prediction of Antifreeze Proteins Using Localized Analysis of n-Peptide Compositions.RAFP-Pred:使用 n-肽组成的局部分析进行抗冻蛋白的稳健预测。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Jan-Feb;15(1):244-250. doi: 10.1109/TCBB.2016.2617337. Epub 2016 Oct 13.
4
An insight into the molecular basis for convergent evolution in fish antifreeze Proteins.鱼类抗冻蛋白趋同进化的分子基础研究进展
Comput Biol Med. 2013 Aug 1;43(7):817-21. doi: 10.1016/j.compbiomed.2013.04.013. Epub 2013 Apr 30.
5
AFP-Pred: A random forest approach for predicting antifreeze proteins from sequence-derived properties.AFP-Pred:一种基于序列衍生特性预测抗冻蛋白的随机森林方法。
J Theor Biol. 2011 Feb 7;270(1):56-62. doi: 10.1016/j.jtbi.2010.10.037. Epub 2010 Nov 4.
6
Helical antifreeze proteins have independently evolved in fishes on four occasions.螺旋形抗冻蛋白在鱼类中已经独立进化了四次。
PLoS One. 2013 Dec 6;8(12):e81285. doi: 10.1371/journal.pone.0081285. eCollection 2013.
7
Lateral transfer of a lectin-like antifreeze protein gene in fishes.鱼类中一种凝集素样抗冻蛋白基因的横向转移
PLoS One. 2008 Jul 9;3(7):e2616. doi: 10.1371/journal.pone.0002616.
8
Chou's pseudo amino acid composition improves sequence-based antifreeze protein prediction.周的伪氨基酸组成改进了基于序列的抗冻蛋白预测。
J Theor Biol. 2014 Sep 7;356:30-5. doi: 10.1016/j.jtbi.2014.04.006. Epub 2014 Apr 13.
9
TargetFreeze: Identifying Antifreeze Proteins via a Combination of Weights using Sequence Evolutionary Information and Pseudo Amino Acid Composition.TargetFreeze:通过结合使用序列进化信息和伪氨基酸组成的权重来鉴定抗冻蛋白
J Membr Biol. 2015 Dec;248(6):1005-14. doi: 10.1007/s00232-015-9811-z. Epub 2015 Jun 10.
10
Fish-Derived Antifreeze Proteins and Antifreeze Glycoprotein Exhibit a Different Ice-Binding Property with Increasing Concentration.鱼类来源的抗冻蛋白和抗冻糖蛋白随着浓度的增加表现出不同的冰结合特性。
Biomolecules. 2020 Mar 9;10(3):423. doi: 10.3390/biom10030423.

引用本文的文献

1
VirusImmu: a novel ensemble machine learning approach for viral immunogenicity prediction.VirusImmu:一种用于病毒免疫原性预测的新型集成机器学习方法。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elaf008.
2
BERT-DomainAFP: Antifreeze protein recognition and classification model based on BERT and structural domain annotation.BERT-DomainAFP:基于BERT和结构域注释的抗冻蛋白识别与分类模型
iScience. 2025 Mar 6;28(4):112077. doi: 10.1016/j.isci.2025.112077. eCollection 2025 Apr 18.
3
PMTPred: machine-learning-based prediction of protein methyltransferases using the composition of k-spaced amino acid pairs.

本文引用的文献

1
The role of antifreeze proteins in the regulation of genes involved in the response of to cold.抗冻蛋白在调控参与对寒冷反应的基因中的作用。
3 Biotech. 2019 Sep;9(9):335. doi: 10.1007/s13205-019-1859-5. Epub 2019 Aug 17.
2
A brief review of applications of antifreeze proteins in cryopreservation and metabolic genetic engineering.抗冻蛋白在冷冻保存和代谢基因工程中的应用简述。
3 Biotech. 2019 Sep;9(9):329. doi: 10.1007/s13205-019-1861-y. Epub 2019 Aug 12.
3
Expression of Ice-Binding Proteins in Caenorhabditis elegans Improves the Survival Rate upon Cold Shock and during Freezing.
PMTPred:基于k间隔氨基酸对组成的蛋白质甲基转移酶的机器学习预测
Mol Divers. 2024 Aug;28(4):2301-2315. doi: 10.1007/s11030-024-10937-2. Epub 2024 Jul 21.
4
Analysis of Ice-Binding Protein Evolution.冰结合蛋白进化分析。
Methods Mol Biol. 2024;2730:219-229. doi: 10.1007/978-1-0716-3503-2_16.
5
Prediction of antifreeze proteins using machine learning.使用机器学习预测抗冻蛋白。
Sci Rep. 2022 Nov 30;12(1):20672. doi: 10.1038/s41598-022-24501-1.
6
Cold adaptation strategies in plants-An emerging role of epigenetics and antifreeze proteins to engineer cold resilient plants.植物的冷适应策略——表观遗传学和抗冻蛋白在培育抗寒植物中的新作用
Front Genet. 2022 Aug 25;13:909007. doi: 10.3389/fgene.2022.909007. eCollection 2022.
7
Antifreeze Proteins: Novel Applications and Navigation towards Their Clinical Application in Cryobanking.抗冻蛋白:新应用及在低温生物学中的临床应用探索
Int J Mol Sci. 2022 Feb 27;23(5):2639. doi: 10.3390/ijms23052639.
8
ECM-LSE: Prediction of Extracellular Matrix Proteins Using Deep Latent Space Encoding of k-Spaced Amino Acid Pairs.ECM-LSE:利用k间隔氨基酸对的深度潜在空间编码预测细胞外基质蛋白
Front Bioeng Biotechnol. 2021 Oct 14;9:752658. doi: 10.3389/fbioe.2021.752658. eCollection 2021.
9
AoP-LSE: Antioxidant Proteins Classification Using Deep Latent Space Encoding of Sequence Features.AoP-LSE:基于序列特征的深度潜在空间编码的抗氧化蛋白分类。
Curr Issues Mol Biol. 2021 Oct 9;43(3):1489-1501. doi: 10.3390/cimb43030105.
10
Prediction and analysis of antifreeze proteins.抗冻蛋白的预测与分析
Heliyon. 2021 Sep 8;7(9):e07953. doi: 10.1016/j.heliyon.2021.e07953. eCollection 2021 Sep.
在秀丽隐杆线虫中表达冰结合蛋白可提高冷休克和冷冻过程中的存活率。
Sci Rep. 2019 May 15;9(1):6246. doi: 10.1038/s41598-019-42650-8.
4
Molecular structure of a hyperactive antifreeze protein adsorbed to ice.超活性抗冻蛋白吸附在冰上的分子结构。
J Chem Phys. 2019 Apr 7;150(13):131101. doi: 10.1063/1.5090589.
5
DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins.DeepUbi:一种用于预测蛋白质中泛素化位点的深度学习框架。
BMC Bioinformatics. 2019 Feb 18;20(1):86. doi: 10.1186/s12859-019-2677-9.
6
A deep learning method to more accurately recall known lysine acetylation sites.一种更准确地召回已知赖氨酸乙酰化位点的深度学习方法。
BMC Bioinformatics. 2019 Jan 23;20(1):49. doi: 10.1186/s12859-019-2632-9.
7
Building up bioluminescence.增强生物发光。
Nat Methods. 2019 Jan;16(1):20. doi: 10.1038/s41592-018-0274-x.
8
Beetle, Dendroides canadensis, antifreeze proteins increased high temperature survivorship in transgenic fruit flies, Drosophila melanogaster.甲虫,加拿大一枝黄花,抗冻蛋白增加了转基因为转基因果蝇,黑腹果蝇的高温存活率。
J Insect Physiol. 2019 Jan;112:68-72. doi: 10.1016/j.jinsphys.2018.12.004. Epub 2018 Dec 15.
9
Ice-binding proteins from the fungus Antarctomyces psychrotrophicus possibly originate from two different bacteria through horizontal gene transfer.真菌南极冰霉中的冰结合蛋白可能通过水平基因转移来自两种不同的细菌。
FEBS J. 2019 Mar;286(5):946-962. doi: 10.1111/febs.14725. Epub 2018 Dec 28.
10
Efficient B-Mode Ultrasound Image Reconstruction From Sub-Sampled RF Data Using Deep Learning.基于深度学习的从欠采样 RF 数据进行高效 B 模式超声图像重建。
IEEE Trans Med Imaging. 2019 Feb;38(2):325-336. doi: 10.1109/TMI.2018.2864821. Epub 2018 Aug 10.