• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于半胱氨酸模式条形码的数据集过滤增强了机器学习辅助的芋螺毒液肽疗法解释。

Cysteine pattern barcoding-based dataset filtration enhances the machine learning-assisted interpretation of Conus venom peptide therapeutics.

作者信息

Bibi Rimsha, Qasmi Noshaba, Rashid Sajid

机构信息

National Center for Bioinformatics, Quaid-i-Azam University, Pakistan.

出版信息

PLoS One. 2025 Jul 11;20(7):e0327578. doi: 10.1371/journal.pone.0327578. eCollection 2025.

DOI:10.1371/journal.pone.0327578
PMID:40644439
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12250603/
Abstract

Crude cone snail venom is a rich source of bioactive compounds with significant therapeutic potential. In this study, we conducted a comprehensive analysis of 5,985 cone snail peptides across 82 Conus species to identify unique cysteine (Cys) patterns and associated frameworks. The classification of these Cys patterns, based on conserved framework combinations, enabled the generation of species-level pattern barcodes. These barcodes were then evaluated to assess the species correlations of individual sequences. By analyzing 151 known Conus peptide PDB files, we computed Cys disulfide linkages to assess overall stability profiles. Incorporating barcode data allowed us to filter the dataset and prepare it for machine learning (ML) processing. Random Forest (RF) modeling, a supervised learning technique, was used to predict the therapeutic potential of venom peptides. Feature extraction was based on known venom-derived approved peptide-based drugs. The dataset was split into a 70:30 train-test ratio. A total of 6,430 peptides (5,985 from cone snails and 445 from other venomous species) were used to evaluate model prediction capability. The proposed model achieved ideal accuracy (90.48%) in peptide therapeutic classification. Subsequent model outputs underwent further structural and binding pattern analysis against known targets, revealing significant similarities between the binding patterns of approved and novel peptides. The model's performance could be further enhanced by incorporating additional datasets and optimizing feature selection, potentially broadening its applicability to larger peptide datasets. Overall, this study underscores the potential of ML in advancing pharmacological research on diverse venom peptides.

摘要

粗制芋螺毒液是生物活性化合物的丰富来源,具有巨大的治疗潜力。在本研究中,我们对82种芋螺的5985种芋螺肽进行了全面分析,以确定独特的半胱氨酸(Cys)模式及相关框架。基于保守框架组合对这些Cys模式进行分类,能够生成物种水平的模式条形码。然后对这些条形码进行评估,以评估单个序列的物种相关性。通过分析151个已知的芋螺肽PDB文件,我们计算了Cys二硫键以评估整体稳定性概况。纳入条形码数据使我们能够筛选数据集并为机器学习(ML)处理做好准备。随机森林(RF)建模是一种监督学习技术,用于预测毒液肽的治疗潜力。特征提取基于已知的源自毒液的已批准的基于肽的药物。数据集按70:30的训练-测试比例划分。总共6430种肽(5985种来自芋螺,445种来自其他有毒物种)用于评估模型预测能力。所提出的模型在肽治疗分类中达到了理想的准确率(90.48%)。随后,针对已知靶点对模型输出进行了进一步的结构和结合模式分析,揭示了已批准肽和新型肽结合模式之间的显著相似性。通过纳入更多数据集和优化特征选择,模型的性能可以进一步提高,这可能会扩大其对更大肽数据集的适用性。总体而言,本研究强调了机器学习在推进对多种毒液肽的药理学研究方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/260fa375e2df/pone.0327578.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/bc2ab8f8825d/pone.0327578.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/9ab9f007a1c1/pone.0327578.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/d4acf3ae0730/pone.0327578.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/e6105eb3edef/pone.0327578.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/0217bc5c26d9/pone.0327578.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/c81481204688/pone.0327578.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/b5b9098c842a/pone.0327578.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/d8e674bf38b7/pone.0327578.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/050d64340a24/pone.0327578.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/b24ec99004b9/pone.0327578.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/71a4ce7185a0/pone.0327578.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/260fa375e2df/pone.0327578.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/bc2ab8f8825d/pone.0327578.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/9ab9f007a1c1/pone.0327578.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/d4acf3ae0730/pone.0327578.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/e6105eb3edef/pone.0327578.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/0217bc5c26d9/pone.0327578.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/c81481204688/pone.0327578.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/b5b9098c842a/pone.0327578.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/d8e674bf38b7/pone.0327578.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/050d64340a24/pone.0327578.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/b24ec99004b9/pone.0327578.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/71a4ce7185a0/pone.0327578.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/033f/12250603/260fa375e2df/pone.0327578.g012.jpg

相似文献

1
Cysteine pattern barcoding-based dataset filtration enhances the machine learning-assisted interpretation of Conus venom peptide therapeutics.基于半胱氨酸模式条形码的数据集过滤增强了机器学习辅助的芋螺毒液肽疗法解释。
PLoS One. 2025 Jul 11;20(7):e0327578. doi: 10.1371/journal.pone.0327578. eCollection 2025.
2
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
5
Prediction of Specificity of α-Conotoxins to Subtypes of Human Nicotinic Acetylcholine Receptors with Semi-supervised Machine Learning.基于半监督机器学习预测α-芋螺毒素对人源烟碱型乙酰胆碱受体亚型的特异性
ACS Chem Neurosci. 2025 Jun 18;16(12):2196-2207. doi: 10.1021/acschemneuro.4c00760. Epub 2025 May 29.
6
iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.iACP-DPNet:一种用于可解释抗癌肽识别的双池因果扩张卷积网络。
Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.
7
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
8
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
9
Deciphering Shared Gene Signatures and Immune Infiltration Characteristics Between Gestational Diabetes Mellitus and Preeclampsia by Integrated Bioinformatics Analysis and Machine Learning.通过综合生物信息学分析和机器学习破译妊娠期糖尿病和子痫前期之间共享的基因特征及免疫浸润特征
Reprod Sci. 2025 May 15. doi: 10.1007/s43032-025-01847-1.
10
Machine Learning Did Not Outperform Conventional Competing Risk Modeling to Predict Revision Arthroplasty.在预测翻修关节成形术方面,机器学习的表现并未优于传统的竞争风险模型。
Clin Orthop Relat Res. 2024 Aug 1;482(8):1472-1482. doi: 10.1097/CORR.0000000000003018. Epub 2024 Mar 12.

本文引用的文献

1
Leveraging machine learning models for peptide-protein interaction prediction.利用机器学习模型进行肽-蛋白质相互作用预测。
RSC Chem Biol. 2024 Mar 13;5(5):401-417. doi: 10.1039/d3cb00208j. eCollection 2024 May 8.
2
Deep learning in bioinformatics.生物信息学中的深度学习。
Turk J Biol. 2023 Dec 18;47(6):366-382. doi: 10.55730/1300-0152.2671. eCollection 2023.
3
De Novo Discovery of Cysteine Frameworks for Developing Multicyclic Peptide Libraries for Ligand Discovery.从头发现半胱氨酸框架,用于开发用于配体发现的多环肽文库。
J Am Chem Soc. 2023 Dec 27;145(51):28264-28275. doi: 10.1021/jacs.3c11856. Epub 2023 Dec 13.
4
Cysteine-free cone snail venom peptides: Classification of precursor proteins and identification of mature peptides.不含半胱氨酸的芋螺毒液肽:前体蛋白的分类及成熟肽的鉴定
J Pept Sci. 2024 Apr;30(4):e3554. doi: 10.1002/psc.3554. Epub 2023 Nov 27.
5
DrugBank 6.0: the DrugBank Knowledgebase for 2024.DrugBank 6.0:2024 年版 DrugBank 知识库。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1265-D1275. doi: 10.1093/nar/gkad976.
6
Developing anti-microbial peptide database version 1 to provide comprehensive and exhaustive resource of manually curated AMPs.开发抗菌肽数据库版本 1,提供全面、详尽的人工 curated AMPs 资源。
Sci Rep. 2023 Oct 19;13(1):17843. doi: 10.1038/s41598-023-45016-3.
7
Revolutionizing Medicinal Chemistry: The Application of Artificial Intelligence (AI) in Early Drug Discovery.变革药物化学:人工智能在早期药物发现中的应用。
Pharmaceuticals (Basel). 2023 Sep 6;16(9):1259. doi: 10.3390/ph16091259.
8
A Random Forest Model for Peptide Classification Based on Virtual Docking Data.基于虚拟对接数据的肽分类随机森林模型。
Int J Mol Sci. 2023 Jul 13;24(14):11409. doi: 10.3390/ijms241411409.
9
Native and Engineered Cyclic Disulfide-Rich Peptides as Drug Leads.天然和工程化的富含环二硫键的肽类作为药物先导物。
Molecules. 2023 Apr 3;28(7):3189. doi: 10.3390/molecules28073189.
10
Current Insights in the Mechanisms of Cobra Venom Cytotoxins and Their Complexes in Inducing Toxicity: Implications in Antivenom Therapy.当前对眼镜蛇毒液细胞毒素及其复合物诱导毒性的机制的深入了解:抗蛇毒治疗的意义。
Toxins (Basel). 2022 Dec 1;14(12):839. doi: 10.3390/toxins14120839.