• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估用于分析细菌基因表达数据的基因集的一致性。

Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.

机构信息

Department of Mathematics, Statistics and Computer Science, Dordt College, Sioux Center, IA 51250, USA.

出版信息

BMC Bioinformatics. 2012 Aug 8;13:193. doi: 10.1186/1471-2105-13-193.

DOI:10.1186/1471-2105-13-193
PMID:22873695
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3462729/
Abstract

BACKGROUND

Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed.

RESULTS

We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size.

CONCLUSIONS

Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

摘要

背景

为了得出有意义的生物学结论,对全基因组表达数据进行统计分析需要基因的功能信息。基因本体论(GO)和京都基因与基因组百科全书(KEGG)是功能分组基因集的常用来源。对于细菌,SEED 和 MicrobesOnline 提供了替代的、互补的基因集来源。迄今为止,尚未对这些资源获得的数据进行全面评估。

结果

我们定义了一系列与基因表达数据最常见的统计分析类别直接相关的基因集一致性指标,然后对 17 种不同细菌的 3581 个 Affymetrix®基因表达数组进行了全面分析。我们发现,无论基因集的大小如何,来自 GO 和 KEGG 的基因集的一致性都低于来自 SEED 和 MicrobesOnline 的基因集。

结论

尽管在细菌基因表达数据分析中广泛使用 GO 和 KEGG 基因集,但 SEED 和 MicrobesOnline 为各种统计分析提供了更一致的基因集。在细菌基因表达数据分析中增加对 SEED 和 MicrobesOnline 基因集的使用可能会提高统计能力和表达数据的实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff54/3462729/f9b8f27240ec/1471-2105-13-193-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff54/3462729/f9b8f27240ec/1471-2105-13-193-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff54/3462729/f9b8f27240ec/1471-2105-13-193-1.jpg

相似文献

1
Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.评估用于分析细菌基因表达数据的基因集的一致性。
BMC Bioinformatics. 2012 Aug 8;13:193. doi: 10.1186/1471-2105-13-193.
2
MicrobesOnline: an integrated portal for comparative and functional genomics.微生物在线:一个用于比较和功能基因组学的综合门户。
Nucleic Acids Res. 2010 Jan;38(Database issue):D396-400. doi: 10.1093/nar/gkp919. Epub 2009 Nov 11.
3
Assessment of gene set analysis methods based on microarray data.基于微阵列数据的基因集分析方法评估。
Gene. 2014 Jan 25;534(2):383-9. doi: 10.1016/j.gene.2013.08.063. Epub 2013 Sep 3.
4
SEGS: search for enriched gene sets in microarray data.SEGS:在微阵列数据中搜索富集的基因集。
J Biomed Inform. 2008 Aug;41(4):588-601. doi: 10.1016/j.jbi.2007.12.001. Epub 2007 Dec 15.
5
Microarrays for microbiologists.微生物学家专用微阵列芯片
Microbiology (Reading). 2001 Jun;147(Pt 6):1403-1414. doi: 10.1099/00221287-147-6-1403.
6
Reproducibility enhancement and differential expression of non predefined functional gene sets in human genome.人类基因组中未预先定义的功能基因集的可重复性增强及差异表达
BMC Genomics. 2014 Dec 24;15(1):1181. doi: 10.1186/1471-2164-15-1181.
7
Consistency analysis of redundant probe sets on affymetrix three-prime expression arrays and applications to differential mRNA processing.Affymetrix 3' 表达阵列上冗余探针集的一致性分析及其在差异mRNA加工中的应用
PLoS One. 2009;4(1):e4229. doi: 10.1371/journal.pone.0004229. Epub 2009 Jan 23.
8
Genome-wide analysis of endosperm-specific genes in rice.水稻胚乳特异性基因的全基因组分析。
Gene. 2013 Nov 10;530(2):236-47. doi: 10.1016/j.gene.2013.07.088. Epub 2013 Aug 13.
9
Methods for evaluating gene expression from Affymetrix microarray datasets.评估来自Affymetrix微阵列数据集的基因表达的方法。
BMC Bioinformatics. 2008 Jun 17;9:284. doi: 10.1186/1471-2105-9-284.
10
SCOREM: statistical consolidation of redundant expression measures.SCOREM:冗余表达测量的统计合并。
Nucleic Acids Res. 2012 Mar;40(6):e46. doi: 10.1093/nar/gkr1270. Epub 2011 Dec 30.

引用本文的文献

1
Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation.计算和应用原子调控子以理解基因表达与调控。
Front Microbiol. 2016 Nov 24;7:1819. doi: 10.3389/fmicb.2016.01819. eCollection 2016.
2
IMPROVED PERFORMANCE OF GENE SET ANALYSIS ON GENOME-WIDE TRANSCRIPTOMICS DATA WHEN USING GENE ACTIVITY STATE ESTIMATES.使用基因活性状态估计时全基因组转录组学数据上基因集分析性能的提升
Pac Symp Biocomput. 2017;22:449-460. doi: 10.1142/9789813207813_0042.
3
A Bayesian Framework for the Classification of Microbial Gene Activity States.

本文引用的文献

1
On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report.利用基因本体论注释评估直系同源物和旁系同源物之间的功能相似性:简短报告。
PLoS Comput Biol. 2012;8(2):e1002386. doi: 10.1371/journal.pcbi.1002386. Epub 2012 Feb 16.
2
Finding disease similarity based on implicit semantic similarity.基于隐语义相似性的疾病相似性发现。
J Biomed Inform. 2012 Apr;45(2):363-71. doi: 10.1016/j.jbi.2011.11.017. Epub 2011 Dec 7.
3
SEAS: a system for SEED-based pathway enrichment analysis.SEAS:基于 SEED 的通路富集分析系统。
一种用于微生物基因活性状态分类的贝叶斯框架。
Front Microbiol. 2016 Aug 9;7:1191. doi: 10.3389/fmicb.2016.01191. eCollection 2016.
4
Novel gene sets improve set-level classification of prokaryotic gene expression data.新型基因集改进了原核生物基因表达数据的集水平分类。
BMC Bioinformatics. 2015 Oct 28;16:348. doi: 10.1186/s12859-015-0786-7.
5
Cautions about the reliability of pairwise gene correlations based on expression data.基于表达数据的成对基因相关性可靠性的注意事项。
Front Microbiol. 2015 Jun 26;6:650. doi: 10.3389/fmicb.2015.00650. eCollection 2015.
6
The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST).SEED 与利用子系统技术进行快速微生物基因组注释(RAST)。
Nucleic Acids Res. 2014 Jan;42(Database issue):D206-14. doi: 10.1093/nar/gkt1226. Epub 2013 Nov 29.
PLoS One. 2011;6(7):e22556. doi: 10.1371/journal.pone.0022556. Epub 2011 Jul 22.
4
A SATS algorithm for jointly identifying multiple differentially expressed gene sets.一种联合识别多个差异表达基因集的 SATS 算法。
Stat Med. 2011 Jul 20;30(16):2028-39. doi: 10.1002/sim.4235. Epub 2011 Apr 7.
5
Functional integration of a metabolic network model and expression data without arbitrary thresholding.代谢网络模型和表达数据的功能集成,无需任意阈值处理。
Bioinformatics. 2011 Feb 15;27(4):541-7. doi: 10.1093/bioinformatics/btq702. Epub 2010 Dec 20.
6
Probabilistic integrative modeling of genome-scale metabolic and regulatory networks in Escherichia coli and Mycobacterium tuberculosis.大肠杆菌和结核分枝杆菌全基因组代谢和调控网络的概率综合建模。
Proc Natl Acad Sci U S A. 2010 Oct 12;107(41):17845-50. doi: 10.1073/pnas.1005139107. Epub 2010 Sep 27.
7
Integrative inference of gene-regulatory networks in Escherichia coli using information theoretic concepts and sequence analysis.利用信息论概念和序列分析对大肠杆菌基因调控网络进行整合推断。
BMC Syst Biol. 2010 Aug 18;4:116. doi: 10.1186/1752-0509-4-116.
8
SICAGO: Semi-supervised cluster analysis using semantic distance between gene pairs in Gene Ontology.SICAGO:基于基因本体论中基因对语义距离的半监督聚类分析。
Bioinformatics. 2010 May 15;26(10):1384-5. doi: 10.1093/bioinformatics/btq133. Epub 2010 Mar 26.
9
Semantic similarity in biomedical ontologies.生物医学本体中的语义相似性。
PLoS Comput Biol. 2009 Jul;5(7):e1000443. doi: 10.1371/journal.pcbi.1000443. Epub 2009 Jul 31.
10
CLEAN: CLustering Enrichment ANalysis.CLEAN:聚类富集分析。
BMC Bioinformatics. 2009 Jul 29;10:234. doi: 10.1186/1471-2105-10-234.