• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GibbsCluster:肽序列的无监督聚类和对齐。

GibbsCluster: unsupervised clustering and alignment of peptide sequences.

机构信息

Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, 1650 San Martín, Argentina.

Department of Bio and Health Informatics, Technical University of Denmark, DK-2800 Lyngby, Denmark.

出版信息

Nucleic Acids Res. 2017 Jul 3;45(W1):W458-W463. doi: 10.1093/nar/gkx248.

DOI:10.1093/nar/gkx248
PMID:28407089
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5570237/
Abstract

Receptor interactions with short linear peptide fragments (ligands) are at the base of many biological signaling processes. Conserved and information-rich amino acid patterns, commonly called sequence motifs, shape and regulate these interactions. Because of the properties of a receptor-ligand system or of the assay used to interrogate it, experimental data often contain multiple sequence motifs. GibbsCluster is a powerful tool for unsupervised motif discovery because it can simultaneously cluster and align peptide data. The GibbsCluster 2.0 presented here is an improved version incorporating insertion and deletions accounting for variations in motif length in the peptide input. In basic terms, the program takes as input a set of peptide sequences and clusters them into meaningful groups. It returns the optimal number of clusters it identified, together with the sequence alignment and sequence motif characterizing each cluster. Several parameters are available to customize cluster analysis, including adjustable penalties for small clusters and overlapping groups and a trash cluster to remove outliers. As an example application, we used the server to deconvolute multiple specificities in large-scale peptidome data generated by mass spectrometry. The server is available at http://www.cbs.dtu.dk/services/GibbsCluster-2.0.

摘要

受体与短线性肽片段(配体)的相互作用是许多生物信号转导过程的基础。保守且信息丰富的氨基酸模式,通常称为序列基序,决定并调节这些相互作用。由于受体-配体系统的特性或用于检测它的检测方法,实验数据通常包含多个序列基序。GibbsCluster 是一种强大的无监督基序发现工具,因为它可以同时对肽数据进行聚类和对齐。这里介绍的 GibbsCluster 2.0 是一个改进的版本,它考虑了插入和缺失,以适应输入肽中基序长度的变化。简而言之,该程序将一组肽序列作为输入,并将它们聚类为有意义的组。它返回识别的最佳聚类数量,以及每个聚类的序列比对和序列基序。有几个参数可用于自定义聚类分析,包括对小聚类和重叠组的可调节惩罚以及用于去除异常值的垃圾聚类。作为一个示例应用,我们使用该服务器从质谱生成的大规模肽组学数据中推断出多种特异性。该服务器可在 http://www.cbs.dtu.dk/services/GibbsCluster-2.0 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43e2/5570237/283d916e3344/gkx248fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43e2/5570237/0795866b3541/gkx248fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43e2/5570237/5aced0d96c43/gkx248fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43e2/5570237/283d916e3344/gkx248fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43e2/5570237/0795866b3541/gkx248fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43e2/5570237/5aced0d96c43/gkx248fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43e2/5570237/283d916e3344/gkx248fig3.jpg

相似文献

1
GibbsCluster: unsupervised clustering and alignment of peptide sequences.GibbsCluster:肽序列的无监督聚类和对齐。
Nucleic Acids Res. 2017 Jul 3;45(W1):W458-W463. doi: 10.1093/nar/gkx248.
2
Simultaneous alignment and clustering of peptide data using a Gibbs sampling approach.使用 Gibbs 采样方法同时对齐和聚类肽数据。
Bioinformatics. 2013 Jan 1;29(1):8-14. doi: 10.1093/bioinformatics/bts621. Epub 2012 Oct 24.
3
NNAlign: a platform to construct and evaluate artificial neural network models of receptor-ligand interactions.NNAlign:构建和评估受体-配体相互作用的人工神经网络模型的平台。
Nucleic Acids Res. 2017 Jul 3;45(W1):W344-W349. doi: 10.1093/nar/gkx276.
4
Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets.吊床:一种基于隐马尔可夫模型的肽聚类算法,用于在大型数据集中识别蛋白质相互作用共有基序。
Bioinformatics. 2016 Jan 1;32(1):9-16. doi: 10.1093/bioinformatics/btv522. Epub 2015 Sep 5.
5
MHCpLogics: an interactive machine learning-based tool for unsupervised data visualization and cluster analysis of immunopeptidomes.MHCpLogics:一种基于机器学习的交互式工具,用于免疫肽组学数据的无监督可视化和聚类分析。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae087.
6
Unsupervised HLA Peptidome Deconvolution Improves Ligand Prediction Accuracy and Predicts Cooperative Effects in Peptide-HLA Interactions.无监督的HLA肽组去卷积提高了配体预测准确性,并预测了肽-HLA相互作用中的协同效应。
J Immunol. 2016 Sep 15;197(6):2492-9. doi: 10.4049/jimmunol.1600808. Epub 2016 Aug 10.
7
NNAlign: a web-based prediction method allowing non-expert end-user discovery of sequence motifs in quantitative peptide data.NNAlign:一种基于网络的预测方法,允许非专业的终端用户在定量肽数据中发现序列基序。
PLoS One. 2011;6(11):e26781. doi: 10.1371/journal.pone.0026781. Epub 2011 Nov 2.
8
GuiTope: an application for mapping random-sequence peptides to protein sequences.GuiTope:一种将随机序列肽映射到蛋白质序列的应用程序。
BMC Bioinformatics. 2012 Jan 3;13:1. doi: 10.1186/1471-2105-13-1.
9
RevTrans: Multiple alignment of coding DNA from aligned amino acid sequences.RevTrans:从比对的氨基酸序列进行编码DNA的多序列比对。
Nucleic Acids Res. 2003 Jul 1;31(13):3537-9. doi: 10.1093/nar/gkg609.
10
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.transAlign:利用氨基酸促进蛋白质编码DNA序列的多重比对。
BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.

引用本文的文献

1
Upstream open reading frame translation enhances immunogenic peptide presentation in mitotically arrested cancer cells.上游开放阅读框翻译增强有丝分裂停滞癌细胞中的免疫原性肽呈递。
Nat Commun. 2025 Aug 27;16(1):8008. doi: 10.1038/s41467-025-63405-2.
2
MHC1-TIP enables single-tube multimodal immunopeptidome profiling and uncovers intratumoral heterogeneity in antigen presentation.MHC1-TIP可实现单管多模态免疫肽组分析,并揭示抗原呈递中的肿瘤内异质性。
bioRxiv. 2025 Jul 21:2025.07.17.664894. doi: 10.1101/2025.07.17.664894.
3
Targeting infection-specific peptides in immunopeptidomics studies for vaccine target discovery.

本文引用的文献

1
Direct identification of clinically relevant neoepitopes presented on native human melanoma tissue by mass spectrometry.通过质谱直接鉴定天然人黑色素瘤组织上呈现的临床相关新表位。
Nat Commun. 2016 Nov 21;7:13404. doi: 10.1038/ncomms13404.
2
A machine learning strategy for predicting localization of post-translational modification sites in protein-protein interacting regions.一种用于预测蛋白质-蛋白质相互作用区域中翻译后修饰位点定位的机器学习策略。
BMC Bioinformatics. 2016 Aug 17;17(1):307. doi: 10.1186/s12859-016-1165-8.
3
Unsupervised HLA Peptidome Deconvolution Improves Ligand Prediction Accuracy and Predicts Cooperative Effects in Peptide-HLA Interactions.
在免疫肽组学研究中靶向感染特异性肽以发现疫苗靶点
J Exp Med. 2025 Oct 6;222(10). doi: 10.1084/jem.20250444. Epub 2025 Jul 21.
4
Deep Exploration of the Immunopeptidome of a Pancreatic Cancer Cell Line: Implications for Clinical Immunopeptidomics and Immunotherapy.胰腺癌细胞系免疫肽组的深入探索:对临床免疫肽组学和免疫治疗的意义
Mol Cell Proteomics. 2025 Jul 8;24(8):101030. doi: 10.1016/j.mcpro.2025.101030.
5
Evidence of focusing the MHC class I immunopeptidome by tapasin.通过TAP结合蛋白聚焦MHC I类免疫肽组的证据。
Front Immunol. 2025 May 8;16:1563789. doi: 10.3389/fimmu.2025.1563789. eCollection 2025.
6
Immunopeptidomics identified antigens for mRNA-lipid nanoparticle vaccines with alpha-galactosylceramide in multiple myeloma therapy.免疫肽组学鉴定了用于多发性骨髓瘤治疗中含α-半乳糖神经酰胺的mRNA-脂质纳米颗粒疫苗的抗原。
J Immunother Cancer. 2025 Apr 29;13(4):e010673. doi: 10.1136/jitc-2024-010673.
7
Investigative needle core biopsies support multimodal deep-data generation in glioblastoma.研究性针芯活检支持胶质母细胞瘤的多模态深度数据生成。
Nat Commun. 2025 Apr 28;16(1):3957. doi: 10.1038/s41467-025-58452-8.
8
Maximizing Immunopeptidomics-Based Bacterial Epitope Discovery by Multiple Search Engines and Rescoring.通过多搜索引擎和重新评分最大化基于免疫肽组学的细菌表位发现
J Proteome Res. 2025 Apr 4;24(4):2141-2151. doi: 10.1021/acs.jproteome.4c00864. Epub 2025 Mar 13.
9
PEPSeek-Mediated Identification of Novel Epitopes From Viral and Bacterial Pathogens and the Impact on Host Cell Immunopeptidomes.PEPSeek介导的病毒和细菌病原体新型表位鉴定及其对宿主细胞免疫肽组的影响
Mol Cell Proteomics. 2025 Apr;24(4):100937. doi: 10.1016/j.mcpro.2025.100937. Epub 2025 Mar 3.
10
Rational engineering of minimally immunogenic nucleases for gene therapy.用于基因治疗的最小免疫原性核酸酶的合理工程设计。
Nat Commun. 2025 Jan 2;16(1):105. doi: 10.1038/s41467-024-55522-1.
无监督的HLA肽组去卷积提高了配体预测准确性,并预测了肽-HLA相互作用中的协同效应。
J Immunol. 2016 Sep 15;197(6):2492-9. doi: 10.4049/jimmunol.1600808. Epub 2016 Aug 10.
4
NetMHCpan-3.0; improved prediction of binding to MHC class I molecules integrating information from multiple receptor and peptide length datasets.NetMHCpan-3.0;整合来自多个受体和肽长度数据集的信息,改进对与MHC I类分子结合的预测。
Genome Med. 2016 Mar 30;8(1):33. doi: 10.1186/s13073-016-0288-x.
5
High-sensitivity HLA class I peptidome analysis enables a precise definition of peptide motifs and the identification of peptides from cell lines and patients' sera.高灵敏度的HLA I类肽组分析能够精确定义肽基序,并从细胞系和患者血清中鉴定肽段。
Proteomics. 2016 May;16(10):1570-80. doi: 10.1002/pmic.201500445.
6
The Length Distribution of Class I-Restricted T Cell Epitopes Is Determined by Both Peptide Supply and MHC Allele-Specific Binding Preference.I类限制性T细胞表位的长度分布由肽供应和MHC等位基因特异性结合偏好共同决定。
J Immunol. 2016 Feb 15;196(4):1480-7. doi: 10.4049/jimmunol.1501721. Epub 2016 Jan 18.
7
Sampling From the Proteome to the Human Leukocyte Antigen-DR (HLA-DR) Ligandome Proceeds Via High Specificity.从蛋白质组到人类白细胞抗原-DR(HLA-DR)配体组的采样是通过高特异性进行的。
Mol Cell Proteomics. 2016 Apr;15(4):1412-23. doi: 10.1074/mcp.M115.055780. Epub 2016 Jan 13.
8
Analysis of Major Histocompatibility Complex (MHC) Immunopeptidomes Using Mass Spectrometry.使用质谱分析法分析主要组织相容性复合体(MHC)免疫肽组
Mol Cell Proteomics. 2015 Dec;14(12):3105-17. doi: 10.1074/mcp.O115.052431.
9
ELM 2016--data update and new functionality of the eukaryotic linear motif resource.ELM 2016——真核生物线性基序资源的数据更新与新功能
Nucleic Acids Res. 2016 Jan 4;44(D1):D294-300. doi: 10.1093/nar/gkv1291. Epub 2015 Nov 28.
10
Gapped sequence alignment using artificial neural networks: application to the MHC class I system.使用人工神经网络的缺口序列比对:在主要组织相容性复合体I类系统中的应用。
Bioinformatics. 2016 Feb 15;32(4):511-7. doi: 10.1093/bioinformatics/btv639. Epub 2015 Oct 29.