PANDORA：通过注释的分层集成分析蛋白质和肽组。

PANDORA: analysis of protein and peptide sets through the hierarchical integration of annotations.

机构信息

School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel.

出版信息

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W84-9. doi: 10.1093/nar/gkq320. Epub 2010 May 5.

DOI:10.1093/nar/gkq320

PMID:20444873

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2896089/

Abstract

Derivation of biological meaning from large sets of proteins or genes is a frequent task in genomic and proteomic studies. Such sets often arise from experimental methods including large-scale gene expression experiments and mass spectrometry (MS) proteomics. Large sets of genes or proteins are also the outcome of computational methods such as BLAST search and homology-based classifications. We have developed the PANDORA web server, which functions as a platform for the advanced biological analysis of sets of genes, proteins, or proteolytic peptides. First, the input set is mapped to a set of corresponding proteins. Then, an analysis of the protein set produces a graph-based hierarchy which highlights intrinsic relations amongst biological subsets, in light of their different annotations from multiple annotation resources. PANDORA integrates a large collection of annotation sources (GO, UniProt Keywords, InterPro, Enzyme, SCOP, CATH, Gene-3D, NCBI taxonomy and more) that comprise approximately 200,000 different annotation terms associated with approximately 3.2 million sequences from UniProtKB. Statistical enrichment based on a binomial approximation of the hypergeometric distribution and corrected for multiple hypothesis tests is calculated using several background sets, including major gene-expression DNA-chip platforms. Users can also visualize either standard or user-defined binary and quantitative properties alongside the proteins. PANDORA 4.2 is available at http://www.pandora.cs.huji.ac.il.

摘要

从大量蛋白质或基因中推导出生物学意义是基因组学和蛋白质组学研究中的常见任务。这些集合通常来自于实验方法，包括大规模基因表达实验和质谱（MS）蛋白质组学。大型基因或蛋白质集合也是 BLAST 搜索和基于同源性分类等计算方法的结果。我们开发了 PANDORA 网络服务器，它是用于对基因、蛋白质或蛋白水解肽集合进行高级生物学分析的平台。首先，将输入集合映射到一组相应的蛋白质。然后，对蛋白质集合的分析会生成基于图的层次结构，根据来自多个注释资源的不同注释，突出生物学子集之间的内在关系。PANDORA 集成了大量注释源（GO、UniProt Keywords、InterPro、Enzyme、SCOP、CATH、Gene-3D、NCBI 分类法等），其中包含大约 200,000 个不同的注释术语，这些术语与来自 UniProtKB 的大约 320 万个序列相关联。使用几种背景集（包括主要基因表达 DNA 芯片平台），基于超几何分布的二项式逼近并针对多重假设检验进行校正，计算基于泊松分布的统计富集。用户还可以与蛋白质一起可视化标准或用户定义的二进制和定量属性。PANDORA 4.2 可在 http://www.pandora.cs.huji.ac.il 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad55/2896089/2fbb96041ba6/gkq320f1.jpg

相似文献

PANDORA: analysis of protein and peptide sets through the hierarchical integration of annotations.

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W84-9. doi: 10.1093/nar/gkq320. Epub 2010 May 5.

PANDORA: keyword-based analysis of protein sets by integration of annotation sources.

Nucleic Acids Res. 2003 Oct 1;31(19):5617-26. doi: 10.1093/nar/gkg769.

Columba: an integrated database of proteins, structures, and annotations.

BMC Bioinformatics. 2005 Mar 31;6:81. doi: 10.1186/1471-2105-6-81.

Software tool for researching annotations of proteins: open-source protein annotation software with data visualization.

Anal Chem. 2009 Dec 1;81(23):9819-23. doi: 10.1021/ac901335x.

piNET: a versatile web platform for downstream analysis and visualization of proteomics data.

Nucleic Acids Res. 2020 Jul 2;48(W1):W85-W93. doi: 10.1093/nar/gkaa436.

GeneTools--application for functional annotation and statistical hypothesis testing.

BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470.

ProtoNet 6.0: organizing 10 million protein sequences in a compact hierarchical family tree.

Nucleic Acids Res. 2012 Jan;40(Database issue):D313-20. doi: 10.1093/nar/gkr1027. Epub 2011 Nov 25.

Analysis of the tryptic search space in UniProt databases.

Proteomics. 2015 Jan;15(1):48-57. doi: 10.1002/pmic.201400227. Epub 2014 Dec 3.

The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D262-6. doi: 10.1093/nar/gkh021.

ISPIDER Central: an integrated database web-server for proteomics.

Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W485-90. doi: 10.1093/nar/gkn196. Epub 2008 Apr 25.

引用本文的文献

off-target profiling for enhanced drug safety assessment.

Acta Pharm Sin B. 2024 Jul;14(7):2927-2941. doi: 10.1016/j.apsb.2024.03.002. Epub 2024 Mar 6.

ProtoBug: functional families from the complete proteomes of insects.

Database (Oxford). 2015 Apr 24;2015:bau122. doi: 10.1093/database/bau122. Print 2015.

miRror-Suite: decoding coordinated regulation by microRNAs.

Database (Oxford). 2014 Jun 6;2014. doi: 10.1093/database/bau043. Print 2014.

NeuroPID: a classifier of neuropeptide precursors.

Nucleic Acids Res. 2014 Jul;42(Web Server issue):W182-6. doi: 10.1093/nar/gku363. Epub 2014 May 3.

Functional inference by ProtoNet family tree: the uncharacterized proteome of Daphnia pulex.

BMC Bioinformatics. 2013;14 Suppl 3(Suppl 3):S11. doi: 10.1186/1471-2105-14-S3-S11. Epub 2013 Feb 28.

ProtoNet 6.0: organizing 10 million protein sequences in a compact hierarchical family tree.

Nucleic Acids Res. 2012 Jan;40(Database issue):D313-20. doi: 10.1093/nar/gkr1027. Epub 2011 Nov 25.

本文引用的文献

VisANT 3.5: multi-scale network visualization, analysis and inference based on the gene ontology.

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W115-21. doi: 10.1093/nar/gkp406. Epub 2009 May 21.

Protein function annotation by homology-based inference.

Genome Biol. 2009 Feb 2;10(2):207. doi: 10.1186/gb-2009-10-2-207.

The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists.

Genome Biol. 2007;8(9):R183. doi: 10.1186/gb-2007-8-9-r183.

Functional annotation prediction: all for one and one for all.

Protein Sci. 2006 Jun;15(6):1557-62. doi: 10.1110/ps.062185706. Epub 2006 May 2.

Mining sequence annotation databanks for association patterns.

Bioinformatics. 2005 Nov 1;21 Suppl 3:iii49-57. doi: 10.1093/bioinformatics/bti1206.

A proteomic survey of rat cerebral cortical synaptosomes.

Proteomics. 2005 May;5(8):2177-201. doi: 10.1002/pmic.200401102.

ProtoNet 4.0: a hierarchical classification of one million protein sequences.

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D216-8. doi: 10.1093/nar/gki007.

GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies.

BMC Bioinformatics. 2004 Feb 18;5:16. doi: 10.1186/1471-2105-5-16.

The Gene Ontology (GO) database and informatics resource.

Nucleic Acids Res. 2004 Jan 1;32(Database issue):D258-61. doi: 10.1093/nar/gkh036.

Identifying biological themes within lists of genes with EASE.

Genome Biol. 2003;4(10):R70. doi: 10.1186/gb-2003-4-10-r70. Epub 2003 Sep 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PANDORA：通过注释的分层集成分析蛋白质和肽组。

PANDORA: analysis of protein and peptide sets through the hierarchical integration of annotations.

机构信息

School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel.

出版信息

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W84-9. doi: 10.1093/nar/gkq320. Epub 2010 May 5.

DOI:10.1093/nar/gkq320

PMID:20444873

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2896089/

Abstract

摘要

PANDORA：通过注释的分层集成分析蛋白质和肽组。

PANDORA: analysis of protein and peptide sets through the hierarchical integration of annotations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

PANDORA：通过注释的分层集成分析蛋白质和肽组。

PANDORA: analysis of protein and peptide sets through the hierarchical integration of annotations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献