• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PDAUG:一个基于 Galaxy 的肽库分析、可视化和机器学习建模工具集。

PDAUG: a Galaxy based toolset for peptide library analysis, visualization, and machine learning modeling.

机构信息

Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA.

Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA.

出版信息

BMC Bioinformatics. 2022 May 28;23(1):197. doi: 10.1186/s12859-022-04727-6.

DOI:10.1186/s12859-022-04727-6
PMID:35643441
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9148462/
Abstract

BACKGROUND

Computational methods based on initial screening and prediction of peptides for desired functions have proven to be effective alternatives to lengthy and expensive biochemical experimental methods traditionally utilized in peptide research, thus saving time and effort. However, for many researchers, the lack of expertise in utilizing programming libraries, access to computational resources, and flexible pipelines are big hurdles to adopting these advanced methods.

RESULTS

To address the above mentioned barriers, we have implemented the peptide design and analysis under Galaxy (PDAUG) package, a Galaxy-based Python powered collection of tools, workflows, and datasets for rapid in-silico peptide library analysis. In contrast to existing methods like standard programming libraries or rigid single-function web-based tools, PDAUG offers an integrated GUI-based toolset, providing flexibility to build and distribute reproducible pipelines and workflows without programming expertise. Finally, we demonstrate the usability of PDAUG in predicting anticancer properties of peptides using four different feature sets and assess the suitability of various ML algorithms.

CONCLUSION

PDAUG offers tools for peptide library generation, data visualization, built-in and public database peptide sequence retrieval, peptide feature calculation, and machine learning (ML) modeling. Additionally, this toolset facilitates researchers to combine PDAUG with hundreds of compatible existing Galaxy tools for limitless analytic strategies.

摘要

背景

基于初始筛选和预测具有所需功能的肽的计算方法已被证明是替代传统肽研究中冗长且昂贵的生化实验方法的有效方法,从而节省了时间和精力。然而,对于许多研究人员来说,缺乏利用编程库、计算资源和灵活的管道的专业知识是采用这些先进方法的主要障碍。

结果

为了解决上述问题,我们实现了 Galaxy 下的肽设计和分析 (PDAUG) 包,这是一个基于 Galaxy 的 Python 驱动的工具、工作流程和数据集集合,用于快速进行计算机肽文库分析。与现有的方法(如标准编程库或刚性单功能基于网络的工具)不同,PDAUG 提供了一个集成的基于 GUI 的工具集,无需编程专业知识即可提供构建和分发可重复使用的管道和工作流程的灵活性。最后,我们使用四个不同的特征集来展示 PDAUG 在预测肽的抗癌特性方面的可用性,并评估各种 ML 算法的适用性。

结论

PDAUG 提供了用于肽文库生成、数据可视化、内置和公共数据库肽序列检索、肽特征计算和机器学习 (ML) 建模的工具。此外,该工具集还便于研究人员将 PDAUG 与数百个兼容的现有 Galaxy 工具结合使用,以实现无限的分析策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/8c41bab31339/12859_2022_4727_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/3957be25e628/12859_2022_4727_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/f3bff3145f26/12859_2022_4727_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/c0d071451fac/12859_2022_4727_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/65ce15256943/12859_2022_4727_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/eed6cef4bf43/12859_2022_4727_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/8c41bab31339/12859_2022_4727_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/3957be25e628/12859_2022_4727_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/f3bff3145f26/12859_2022_4727_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/c0d071451fac/12859_2022_4727_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/65ce15256943/12859_2022_4727_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/eed6cef4bf43/12859_2022_4727_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ecc/9148462/8c41bab31339/12859_2022_4727_Fig6_HTML.jpg

相似文献

1
PDAUG: a Galaxy based toolset for peptide library analysis, visualization, and machine learning modeling.PDAUG:一个基于 Galaxy 的肽库分析、可视化和机器学习建模工具集。
BMC Bioinformatics. 2022 May 28;23(1):197. doi: 10.1186/s12859-022-04727-6.
2
modlAMP: Python for antimicrobial peptides.modlAMP:用于抗菌肽的Python
Bioinformatics. 2017 Sep 1;33(17):2753-2755. doi: 10.1093/bioinformatics/btx285.
3
Standardizing and simplifying analysis of peptide library data.标准化和简化肽库数据分析。
J Chem Inf Model. 2013 Feb 25;53(2):493-9. doi: 10.1021/ci300484q. Epub 2013 Feb 4.
4
SciPipe: A workflow library for agile development of complex and dynamic bioinformatics pipelines.SciPipe:一个用于敏捷开发复杂和动态生物信息学管道的工作流库。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz044.
5
Galaxy for open-source computational drug discovery solutions.星系:开源计算药物发现解决方案。
Expert Opin Drug Discov. 2023 Jun;18(6):579-590. doi: 10.1080/17460441.2023.2205122. Epub 2023 Apr 23.
6
Galaxy mothur Toolset (GmT): a user-friendly application for 16S rRNA gene sequencing analysis using mothur.Galaxy mothur 工具集(GmT):一个使用 mothur 进行 16S rRNA 基因测序分析的用户友好型应用程序。
Gigascience. 2019 Feb 1;8(2). doi: 10.1093/gigascience/giy166.
7
CycloPs: generating virtual libraries of cyclized and constrained peptides including nonnatural amino acids.CycloPs:生成包括非天然氨基酸在内的环化和约束肽的虚拟库。
J Chem Inf Model. 2011 Apr 25;51(4):829-36. doi: 10.1021/ci100431r. Epub 2011 Mar 24.
8
ballaxy: web services for structural bioinformatics.Ballaxy:用于结构生物信息学的网络服务。
Bioinformatics. 2015 Jan 1;31(1):121-2. doi: 10.1093/bioinformatics/btu574. Epub 2014 Sep 2.
9
MaD GUI: An Open-Source Python Package for Annotation and Analysis of Time-Series Data.MaD GUI:一个用于时间序列数据注释和分析的开源 Python 包。
Sensors (Basel). 2022 Aug 5;22(15):5849. doi: 10.3390/s22155849.
10
Disseminating Metaproteomic Informatics Capabilities and Knowledge Using the Galaxy-P Framework.使用Galaxy-P框架传播元蛋白质组学信息学能力和知识。
Proteomes. 2018 Jan 31;6(1):7. doi: 10.3390/proteomes6010007.

引用本文的文献

1
Large-scale classification of metagenomic samples: a comparative analysis of classical machine learning techniques vs a novel brain-inspired hyperdimensional computing approach.宏基因组样本的大规模分类:经典机器学习技术与新型脑启发式高维计算方法的比较分析
bioRxiv. 2025 Jul 7:2025.07.06.663394. doi: 10.1101/2025.07.06.663394.
2
Unveiling Encrypted Antimicrobial Peptides from Cephalopods' Salivary Glands: A Proteolysis-Driven Virtual Approach.揭示头足类动物唾液腺中的加密抗菌肽:一种蛋白水解驱动的虚拟方法。
ACS Omega. 2024 Oct 14;9(43):43353-43367. doi: 10.1021/acsomega.4c01959. eCollection 2024 Oct 29.
3
Therapeutic peptides for coronary artery diseases: in silico methods and current perspectives.

本文引用的文献

1
PepFun: Open Source Protocols for Peptide-Related Computational Analysis.PepFun:用于肽相关计算分析的开源协议。
Molecules. 2021 Mar 16;26(6):1664. doi: 10.3390/molecules26061664.
2
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2020 update.Galaxy 平台,用于实现可访问、可重现和协作的生物医学分析:2020 年更新。
Nucleic Acids Res. 2020 Jul 2;48(W1):W395-W402. doi: 10.1093/nar/gkaa434.
3
Design of a peptide-based subunit vaccine against novel coronavirus SARS-CoV-2.针对新型冠状病毒 SARS-CoV-2 的基于肽的亚单位疫苗设计。
治疗冠状动脉疾病的治疗性肽:计算方法和当前观点。
Amino Acids. 2024 May 31;56(1):37. doi: 10.1007/s00726-024-03397-3.
4
Virtual Screening of Peptide Libraries: The Search for Peptide-Based Therapeutics Using Computational Tools.虚拟筛选肽库:使用计算工具寻找基于肽的治疗方法。
Int J Mol Sci. 2024 Feb 1;25(3):1798. doi: 10.3390/ijms25031798.
5
ABP-Finder: A Tool to Identify Antibacterial Peptides and the Gram-Staining Type of Targeted Bacteria.ABP-Finder:一种识别抗菌肽及靶向细菌革兰氏染色类型的工具。
Antibiotics (Basel). 2022 Nov 26;11(12):1708. doi: 10.3390/antibiotics11121708.
6
Towards rational computational peptide design.迈向合理的计算肽设计。
Front Bioinform. 2022 Oct 21;2:1046493. doi: 10.3389/fbinf.2022.1046493. eCollection 2022.
Microb Pathog. 2020 Aug;145:104236. doi: 10.1016/j.micpath.2020.104236. Epub 2020 May 4.
4
Protein sequence information extraction and subcellular localization prediction with gapped k-Mer method.使用缺口 k-Mer 方法进行蛋白质序列信息提取和亚细胞定位预测。
BMC Bioinformatics. 2019 Dec 30;20(Suppl 22):719. doi: 10.1186/s12859-019-3232-4.
5
Peptide-Based Vaccines: Current Progress and Future Challenges.基于肽的疫苗:当前进展和未来挑战。
Chem Rev. 2020 Mar 25;120(6):3210-3229. doi: 10.1021/acs.chemrev.9b00472. Epub 2019 Dec 5.
6
PTPD: predicting therapeutic peptides by deep learning and word2vec.PTPD:深度学习和词向量预测治疗性肽。
BMC Bioinformatics. 2019 Sep 6;20(1):456. doi: 10.1186/s12859-019-3006-z.
7
ACPred: A Computational Tool for the Prediction and Analysis of Anticancer Peptides.ACPred:一种用于抗癌肽预测和分析的计算工具。
Molecules. 2019 May 22;24(10):1973. doi: 10.3390/molecules24101973.
8
A Comprehensive Review on Current Advances in Peptide Drug Development and Design.关于肽类药物研发与设计的最新进展的全面综述。
Int J Mol Sci. 2019 May 14;20(10):2383. doi: 10.3390/ijms20102383.
9
Intracellular Peptides in Cell Biology and Pharmacology.细胞生物学和药理学中的细胞内肽。
Biomolecules. 2019 Apr 16;9(4):150. doi: 10.3390/biom9040150.
10
Functional characterization of 3D protein structures informed by human genetic diversity.基于人类遗传多样性的三维蛋白质结构功能特征分析。
Proc Natl Acad Sci U S A. 2019 Apr 30;116(18):8960-8965. doi: 10.1073/pnas.1820813116. Epub 2019 Apr 15.