• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组蛋白家族和结构域控制注释的统计分析,用于对分类基因列表进行功能研究。

Statistical analysis of genomic protein family and domain controlled annotations for functional investigation of classified gene lists.

作者信息

Masseroli Marco, Bellistri Elisa, Franceschini Andrea, Pinciroli Francesco

机构信息

Dipartimento di Elettronica e Informazione, Politecnico di Milano, piazza Leonardo da Vinci 32, 20133 Milano, Italy.

出版信息

BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S14. doi: 10.1186/1471-2105-8-S1-S14.

DOI:10.1186/1471-2105-8-S1-S14
PMID:17430558
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1885843/
Abstract

BACKGROUND

The increasing protein family and domain based annotations constitute important information to understand protein functions and gain insight into relations among their codifying genes. To allow analyzing of gene proteomic annotations, we implemented novel modules within GFINDer, a Web system we previously developed that dynamically aggregates functional and phenotypic annotations of user-uploaded gene lists and allows performing their statistical analysis and mining.

RESULTS

Exploiting protein information in Pfam and InterPro databanks, we developed and added in GFINDer original modules specifically devoted to the exploration and analysis of functional signatures of gene protein products. They allow annotating numerous user-classified nucleotide sequence identifiers with controlled information on related protein families, domains and functional sites, classifying them according to such protein annotation categories, and statistically analyzing the obtained classifications. In particular, when uploaded nucleotide sequence identifiers are subdivided in classes, the Statistics Protein Families&Domains module allows estimating relevance of Pfam or InterPro controlled annotations for the uploaded genes by highlighting protein signatures significantly more represented within user-defined classes of genes. In addition, the Logistic Regression module allows identifying protein functional signatures that better explain the considered gene classification.

CONCLUSION

Novel GFINDer modules provide genomic protein family and domain analyses supporting better functional interpretation of gene classes, for instance defined through statistical and clustering analyses of gene expression results from microarray experiments. They can hence help understanding fundamental biological processes and complex cellular mechanisms influenced by protein domain composition, and contribute to unveil new biomedical knowledge about the codifying genes.

摘要

背景

基于蛋白质家族和结构域的注释不断增加,这构成了理解蛋白质功能以及深入了解其编码基因之间关系的重要信息。为了能够分析基因的蛋白质组学注释,我们在GFINDer中实现了新的模块,GFINDer是我们之前开发的一个网络系统,它可以动态汇总用户上传的基因列表的功能和表型注释,并允许对其进行统计分析和挖掘。

结果

利用Pfam和InterPro数据库中的蛋白质信息,我们在GFINDer中开发并添加了专门用于探索和分析基因蛋白质产物功能特征的原始模块。这些模块允许用有关相关蛋白质家族、结构域和功能位点的受控信息注释众多用户分类的核苷酸序列标识符,根据此类蛋白质注释类别对它们进行分类,并对获得的分类进行统计分析。特别是,当上传的核苷酸序列标识符被细分为不同类别时,“统计蛋白质家族和结构域”模块通过突出显示在用户定义的基因类别中明显更具代表性的蛋白质特征,来估计Pfam或InterPro受控注释与上传基因的相关性。此外,“逻辑回归”模块允许识别能更好地解释所考虑的基因分类的蛋白质功能特征。

结论

GFINDer的新模块提供了基因组蛋白质家族和结构域分析,有助于对基因类别进行更好的功能解释,例如通过对微阵列实验的基因表达结果进行统计和聚类分析所定义的基因类别。因此,它们有助于理解受蛋白质结构域组成影响的基本生物学过程和复杂细胞机制,并有助于揭示有关编码基因的新生物医学知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/ceee16ebb54d/1471-2105-8-S1-S14-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/9ce3eddaecc2/1471-2105-8-S1-S14-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/952e2dd8be3d/1471-2105-8-S1-S14-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/78a00a4dc020/1471-2105-8-S1-S14-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/c3395e9b9c77/1471-2105-8-S1-S14-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/9e0325dc6c87/1471-2105-8-S1-S14-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/ceee16ebb54d/1471-2105-8-S1-S14-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/9ce3eddaecc2/1471-2105-8-S1-S14-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/952e2dd8be3d/1471-2105-8-S1-S14-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/78a00a4dc020/1471-2105-8-S1-S14-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/c3395e9b9c77/1471-2105-8-S1-S14-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/9e0325dc6c87/1471-2105-8-S1-S14-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3779/1885843/ceee16ebb54d/1471-2105-8-S1-S14-6.jpg

相似文献

1
Statistical analysis of genomic protein family and domain controlled annotations for functional investigation of classified gene lists.基因组蛋白家族和结构域控制注释的统计分析,用于对分类基因列表进行功能研究。
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S14. doi: 10.1186/1471-2105-8-S1-S14.
2
GFINDer: Genome Function INtegrated Discoverer through dynamic annotation, statistical analysis, and mining.GFINDer:通过动态注释、统计分析和挖掘实现的基因组功能综合发现工具。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W293-300. doi: 10.1093/nar/gkh432.
3
Genomic functional investigation through statistical analysis of protein families and domains.通过对蛋白质家族和结构域的统计分析进行基因组功能研究。
AMIA Annu Symp Proc. 2006;2006:1020.
4
Exploration and statistical analysis of human gene expression annotations.人类基因表达注释的探索与统计分析
AMIA Annu Symp Proc. 2007 Oct 11:892.
5
Inherited disorder phenotypes: controlled annotation and statistical analysis for knowledge mining from gene lists.遗传性疾病表型:来自基因列表的知识挖掘的受控注释和统计分析
BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S18. doi: 10.1186/1471-2105-6-S4-S18.
6
GeneTools--application for functional annotation and statistical hypothesis testing.基因工具——用于功能注释和统计假设检验的应用程序。
BMC Bioinformatics. 2006 Oct 24;7:470. doi: 10.1186/1471-2105-7-470.
7
Management and analysis of genomic functional and phenotypic controlled annotations to support biomedical investigation and practice.基因组功能和表型受控注释的管理与分析,以支持生物医学研究与实践。
IEEE Trans Inf Technol Biomed. 2007 Jul;11(4):376-85. doi: 10.1109/titb.2006.884367.
8
A web-enabled database of human gene expression controlled annotations for gene list functional evaluation.一个用于基因列表功能评估的、基于网络的人类基因表达受控注释数据库。
Annu Int Conf IEEE Eng Med Biol Soc. 2007;2007:394-7. doi: 10.1109/IEMBS.2007.4352307.
9
Annotation-Modules: a tool for finding significant combinations of multisource annotations for gene lists.注释模块:一种用于为基因列表寻找多源注释的显著组合的工具。
Bioinformatics. 2008 Jun 1;24(11):1386-93. doi: 10.1093/bioinformatics/btn178. Epub 2008 Apr 23.
10
Gene ontology application to genomic functional annotation, statistical analysis and knowledge mining.基因本体论在基因组功能注释、统计分析和知识挖掘中的应用。
Stud Health Technol Inform. 2004;102:108-31.

本文引用的文献

1
The Molecular Biology Database Collection: 2006 update.《分子生物学数据库合集:2006年更新版》
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D3-5. doi: 10.1093/nar/gkj162.
2
Pfam: clans, web tools and services.蛋白质家族数据库(Pfam):家族分类、网络工具及服务
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D247-51. doi: 10.1093/nar/gkj149.
3
The Universal Protein Resource (UniProt): an expanding universe of protein information.通用蛋白质资源(UniProt):不断扩展的蛋白质信息宇宙。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D187-91. doi: 10.1093/nar/gkj161.
4
BABELOMICS: a suite of web tools for functional annotation and analysis of groups of genes in high-throughput experiments.BABELOMICS:一套用于高通量实验中基因群组功能注释和分析的网络工具。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W460-4. doi: 10.1093/nar/gki456.
5
Array lessons from the heart: focus on the genome and transcriptome of cardiomyopathies.来自心脏的系列经验:聚焦心肌病的基因组和转录组
Physiol Genomics. 2005 Apr 14;21(2):131-43. doi: 10.1152/physiolgenomics.00259.2004.
6
Entrez Gene: gene-centered information at NCBI.Entrez基因:美国国立医学图书馆国家生物技术信息中心的基因中心信息。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D54-8. doi: 10.1093/nar/gki031.
7
InterPro, progress and status in 2005.InterPro 2005年的进展与现状
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D201-5. doi: 10.1093/nar/gki106.
8
GFINDer: Genome Function INtegrated Discoverer through dynamic annotation, statistical analysis, and mining.GFINDer:通过动态注释、统计分析和挖掘实现的基因组功能综合发现工具。
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W293-300. doi: 10.1093/nar/gkh432.
9
Mutational analysis of the tyrosine phosphatome in colorectal cancers.结直肠癌中酪氨酸磷酸酶组的突变分析。
Science. 2004 May 21;304(5674):1164-6. doi: 10.1126/science.1096096.
10
Neurogenomics: at the intersection of neurobiology and genome sciences.神经基因组学:处于神经生物学与基因组科学的交叉领域。
Nat Neurosci. 2004 May;7(5):429-33. doi: 10.1038/nn1232.