• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ECM-LSE:利用k间隔氨基酸对的深度潜在空间编码预测细胞外基质蛋白

ECM-LSE: Prediction of Extracellular Matrix Proteins Using Deep Latent Space Encoding of k-Spaced Amino Acid Pairs.

作者信息

Al-Saggaf Ubaid M, Usman Muhammad, Naseem Imran, Moinuddin Muhammad, Jiman Ahmad A, Alsaggaf Mohammed U, Alshoubaki Hitham K, Khan Shujaat

机构信息

Center of Excellence in Intelligent Engineering Systems, King Abdulaziz University, Jeddah, Saudi Arabia.

Electrical and Computer Engineering Department, King Abdulaziz University, Jeddah, Saudi Arabia.

出版信息

Front Bioeng Biotechnol. 2021 Oct 14;9:752658. doi: 10.3389/fbioe.2021.752658. eCollection 2021.

DOI:10.3389/fbioe.2021.752658
PMID:34722479
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8552119/
Abstract

Extracelluar matrix (ECM) proteins create complex networks of macromolecules which fill-in the extracellular spaces of living tissues. They provide structural support and play an important role in maintaining cellular functions. Identification of ECM proteins can play a vital role in studying various types of diseases. Conventional wet lab-based methods are reliable; however, they are expensive and time consuming and are, therefore, not scalable. In this research, we propose a sequence-based novel machine learning approach for the prediction of ECM proteins. In the proposed method, composition of k-spaced amino acid pair (CKSAAP) features are encoded into a classifiable latent space (LS) with the help of deep latent space encoding (LSE). A comprehensive ablation analysis is conducted for performance evaluation of the proposed method. Results are compared with other state-of-the-art methods on the benchmark dataset, and the proposed ECM-LSE approach has shown to comprehensively outperform the contemporary methods.

摘要

细胞外基质(ECM)蛋白形成复杂的大分子网络,填充活组织的细胞外空间。它们提供结构支持,并在维持细胞功能方面发挥重要作用。ECM蛋白的鉴定在研究各种疾病中起着至关重要的作用。传统的基于湿实验室的方法是可靠的;然而,它们昂贵且耗时,因此不可扩展。在本研究中,我们提出了一种基于序列的新型机器学习方法来预测ECM蛋白。在所提出的方法中,借助深度潜在空间编码(LSE)将k间隔氨基酸对(CKSAAP)特征的组成编码到可分类的潜在空间(LS)中。对所提出的方法进行了全面的消融分析以进行性能评估。在基准数据集上与其他现有方法进行了结果比较,所提出的ECM-LSE方法已显示出全面优于当代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/087c2dd53309/fbioe-09-752658-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/a3d9d74be423/fbioe-09-752658-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/7f9bb03d6e55/fbioe-09-752658-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/2615c0ed3398/fbioe-09-752658-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/40b7ec634063/fbioe-09-752658-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/087c2dd53309/fbioe-09-752658-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/a3d9d74be423/fbioe-09-752658-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/7f9bb03d6e55/fbioe-09-752658-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/2615c0ed3398/fbioe-09-752658-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/40b7ec634063/fbioe-09-752658-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bca/8552119/087c2dd53309/fbioe-09-752658-g005.jpg

相似文献

1
ECM-LSE: Prediction of Extracellular Matrix Proteins Using Deep Latent Space Encoding of k-Spaced Amino Acid Pairs.ECM-LSE:利用k间隔氨基酸对的深度潜在空间编码预测细胞外基质蛋白
Front Bioeng Biotechnol. 2021 Oct 14;9:752658. doi: 10.3389/fbioe.2021.752658. eCollection 2021.
2
AoP-LSE: Antioxidant Proteins Classification Using Deep Latent Space Encoding of Sequence Features.AoP-LSE:基于序列特征的深度潜在空间编码的抗氧化蛋白分类。
Curr Issues Mol Biol. 2021 Oct 9;43(3):1489-1501. doi: 10.3390/cimb43030105.
3
AFP-LSE: Antifreeze Proteins Prediction Using Latent Space Encoding of Composition of k-Spaced Amino Acid Pairs.AFP-LSE:使用 k 间隔氨基酸对组成的潜在空间编码预测抗冻蛋白。
Sci Rep. 2020 Apr 28;10(1):7197. doi: 10.1038/s41598-020-63259-2.
4
Prediction of mucin-type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs.利用k间隔氨基酸对的组成预测哺乳动物蛋白质中的粘蛋白型O-糖基化位点
BMC Bioinformatics. 2008 Feb 18;9:101. doi: 10.1186/1471-2105-9-101.
5
Mini-review: Recent advances in post-translational modification site prediction based on deep learning.小型综述:基于深度学习的翻译后修饰位点预测的最新进展
Comput Struct Biotechnol J. 2022 Jun 30;20:3522-3532. doi: 10.1016/j.csbj.2022.06.045. eCollection 2022.
6
Identification of Human Enzymes Using Amino Acid Composition and the Composition of -Spaced Amino Acid Pairs.利用氨基酸组成和 - 间隔氨基酸对组成鉴定人类酶。
Biomed Res Int. 2020 May 22;2020:9235920. doi: 10.1155/2020/9235920. eCollection 2020.
7
Prediction of lysine crotonylation sites by incorporating the composition of k-spaced amino acid pairs into Chou's general PseAAC.通过将k间隔氨基酸对的组成纳入周的广义伪氨基酸组成来预测赖氨酸巴豆酰化位点。
J Mol Graph Model. 2017 Oct;77:200-204. doi: 10.1016/j.jmgm.2017.08.020. Epub 2017 Aug 24.
8
Predicting lysine phosphoglycerylation with fuzzy SVM by incorporating k-spaced amino acid pairs into Chou׳s general PseAAC.通过将k间隔氨基酸对纳入周氏广义伪氨基酸组成,利用模糊支持向量机预测赖氨酸磷酸甘油化。
J Theor Biol. 2016 May 21;397:145-50. doi: 10.1016/j.jtbi.2016.02.020. Epub 2016 Feb 22.
9
Prediction of citrullination sites by incorporating k-spaced amino acid pairs into Chou's general pseudo amino acid composition.通过将 k 间隔氨基酸对纳入周元的通用伪氨基酸组成来预测瓜氨酸化位点。
Gene. 2018 Jul 20;664:78-83. doi: 10.1016/j.gene.2018.04.055. Epub 2018 Apr 23.
10
Prediction of palmitoylation sites using the composition of k-spaced amino acid pairs.使用 k 间隔氨基酸对组成预测棕榈酰化位点。
Protein Eng Des Sel. 2009 Nov;22(11):707-12. doi: 10.1093/protein/gzp055. Epub 2009 Sep 25.

引用本文的文献

1
VirusImmu: a novel ensemble machine learning approach for viral immunogenicity prediction.VirusImmu:一种用于病毒免疫原性预测的新型集成机器学习方法。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elaf008.
2
LncSL: A Novel Stacked Ensemble Computing Tool for Subcellular Localization of lncRNA by Amino Acid-Enhanced Features and Two-Stage Automated Selection Strategy.LncSL:一种通过氨基酸增强特征和两阶段自动选择策略进行长链非编码RNA亚细胞定位的新型堆叠集成计算工具。
Int J Mol Sci. 2024 Dec 23;25(24):13734. doi: 10.3390/ijms252413734.
3
GBDT_KgluSite: An improved computational prediction model for lysine glutarylation sites based on feature fusion and GBDT classifier.

本文引用的文献

1
In situ Sub-Cellular Identification of Functional Amyloids in Bacteria and Archaea by Infrared Nanospectroscopy.利用红外纳米光谱对细菌和古菌中的功能性淀粉样纤维进行原位亚细胞鉴定。
Small Methods. 2021 Jun;5(6):e2001002. doi: 10.1002/smtd.202001002. Epub 2021 May 6.
2
Advances in Computational Methodologies for Classification and Sub-Cellular Locality Prediction of Non-Coding RNAs.计算方法在非编码 RNA 分类和亚细胞定位预测中的研究进展。
Int J Mol Sci. 2021 Aug 13;22(16):8719. doi: 10.3390/ijms22168719.
3
Prediction of Genetic Factors of Hyperthyroidism Based on Gene Interaction Network.
GBDT_KgluSite:一种基于特征融合和 GBDT 分类器的赖氨酸谷氨酰化位点改进计算预测模型。
BMC Genomics. 2023 Dec 11;24(1):765. doi: 10.1186/s12864-023-09834-z.
4
AoP-LSE: Antioxidant Proteins Classification Using Deep Latent Space Encoding of Sequence Features.AoP-LSE:基于序列特征的深度潜在空间编码的抗氧化蛋白分类。
Curr Issues Mol Biol. 2021 Oct 9;43(3):1489-1501. doi: 10.3390/cimb43030105.
基于基因相互作用网络的甲状腺功能亢进症遗传因素预测
Front Cell Dev Biol. 2021 Aug 2;9:700355. doi: 10.3389/fcell.2021.700355. eCollection 2021.
4
iDeepSubMito: identification of protein submitochondrial localization with deep learning.iDeepSubMito:基于深度学习的蛋白质亚线粒体定位鉴定。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab288.
5
Magnetic Alignment of Electrospun Fiber Segments Within a Hydrogel Composite Guides Cell Spreading and Migration Phenotype Switching.水凝胶复合材料中电纺纤维段的磁排列引导细胞铺展和迁移表型转换。
Front Bioeng Biotechnol. 2021 Jun 16;9:679165. doi: 10.3389/fbioe.2021.679165. eCollection 2021.
6
Protein Secondary Structure Prediction With a Reductive Deep Learning Method.基于还原深度学习方法的蛋白质二级结构预测
Front Bioeng Biotechnol. 2021 Jun 15;9:687426. doi: 10.3389/fbioe.2021.687426. eCollection 2021.
7
Prediction of unconventional protein secretion by exosomes.外泌体中非传统蛋白分泌的预测。
BMC Bioinformatics. 2021 Jun 16;22(1):333. doi: 10.1186/s12859-021-04219-z.
8
Win, Lose, or Tie: Mathematical Modeling of Ligand Competition at the Cell-Extracellular Matrix Interface.胜、负或平:细胞 - 细胞外基质界面配体竞争的数学建模
Front Bioeng Biotechnol. 2021 Apr 29;9:657244. doi: 10.3389/fbioe.2021.657244. eCollection 2021.
9
Fibronectin in Fracture Healing: Biological Mechanisms and Regenerative Avenues.骨折愈合中的纤连蛋白:生物学机制与再生途径
Front Bioeng Biotechnol. 2021 Apr 16;9:663357. doi: 10.3389/fbioe.2021.663357. eCollection 2021.
10
Prediction and collection of protein-metabolite interactions.预测和收集蛋白质-代谢物相互作用。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab014.