• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

研究旁系同源基因对基因芯片基因集分析的影响。

Investigating the effect of paralogs on microarray gene-set analysis.

机构信息

Computational Biology Group, Department of Clinical Laboratory Sciences, University of Cape Town, Cape Town, South Africa.

出版信息

BMC Bioinformatics. 2011 Jan 24;12:29. doi: 10.1186/1471-2105-12-29.

DOI:10.1186/1471-2105-12-29
PMID:21261946
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3037853/
Abstract

BACKGROUND

In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research.

RESULTS

We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http://www.cbio.uct.ac.za/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs.

CONCLUSIONS

The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

摘要

背景

为了解释微阵列实验获得的结果,研究人员通常将重点从分析单个差异表达基因转移到分析基因集。这些基因集分析(GSA)方法利用先前积累的生物学知识将基因分组,然后旨在以反映它们在特定实验情况下相对重要性的方式对这些基因集进行排序。我们怀疑旁系同源物的存在会影响 GSA 方法准确识别后续研究中最重要的基因集的能力。

结果

我们表明,旁系同源物通常具有高度的序列同一性和相似的分子功能,它们的表达模式也表现出高度的相关性。我们使用 Indygene http://www.cbio.uct.ac.za/indygene 作为当前 GSA 方法的潜在混杂因素进行了调查,这是一种网络工具,可减少提供的基因列表,使其不包含指定序列相似性阈值以上的任何成对旁系同源关系。我们使用该工具重新分析了先前发表的微阵列数据集,并确定了考虑旁系同源物存在的潜在效用。

结论

Indygene 工具可有效地从给定数据集中去除旁系同源关系,我们发现,在 GSA 之前进行这种减少可以生成具有显著不同结果的能力,这些结果通常代表新颖和合理的生物学假设。当应用于重新分析先前发表的微阵列数据集时,这三种不同的 GSA 方法均证明了这一点,这表明旁系同源物的冗余性和非独立性是处理 GSA 方法学时的重要考虑因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/69cc900c65d8/1471-2105-12-29-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/57fa779669d6/1471-2105-12-29-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/491b13f85186/1471-2105-12-29-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/c40aa093c3e7/1471-2105-12-29-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/ed527eb3264e/1471-2105-12-29-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/561ba45b4afc/1471-2105-12-29-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/69cc900c65d8/1471-2105-12-29-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/57fa779669d6/1471-2105-12-29-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/491b13f85186/1471-2105-12-29-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/c40aa093c3e7/1471-2105-12-29-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/ed527eb3264e/1471-2105-12-29-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/561ba45b4afc/1471-2105-12-29-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2620/3037853/69cc900c65d8/1471-2105-12-29-6.jpg

相似文献

1
Investigating the effect of paralogs on microarray gene-set analysis.研究旁系同源基因对基因芯片基因集分析的影响。
BMC Bioinformatics. 2011 Jan 24;12:29. doi: 10.1186/1471-2105-12-29.
2
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.在表达谱分析中交互式优化信噪比:Affymetrix微阵列中特定项目的算法选择和检测p值加权
Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29.
3
GAGE: generally applicable gene set enrichment for pathway analysis.GAGE:用于通路分析的通用基因集富集分析
BMC Bioinformatics. 2009 May 27;10:161. doi: 10.1186/1471-2105-10-161.
4
Utilization of two sample t-test statistics from redundant probe sets to evaluate different probe set algorithms in GeneChip studies.利用来自冗余探针集的双样本t检验统计量来评估基因芯片研究中的不同探针集算法。
BMC Bioinformatics. 2006 Jan 10;7:12. doi: 10.1186/1471-2105-7-12.
5
Arabidopsis Co-expression Tool (ACT): web server tools for microarray-based gene expression analysis.拟南芥共表达工具(ACT):基于微阵列的基因表达分析的网络服务器工具。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W504-9. doi: 10.1093/nar/gkl204.
6
Assessment of gene set analysis methods based on microarray data.基于微阵列数据的基因集分析方法评估。
Gene. 2014 Jan 25;534(2):383-9. doi: 10.1016/j.gene.2013.08.063. Epub 2013 Sep 3.
7
GeneMesh: a web-based microarray analysis tool for relating differentially expressed genes to MeSH terms.GeneMesh:一个基于网络的微阵列分析工具,用于将差异表达基因与 MeSH 术语相关联。
BMC Bioinformatics. 2010 Apr 1;11:166. doi: 10.1186/1471-2105-11-166.
8
ChroCoLoc: an application for calculating the probability of co-localization of microarray gene expression.ChroCoLoc:一种用于计算微阵列基因表达共定位概率的应用程序。
Bioinformatics. 2006 Mar 15;22(6):765-7. doi: 10.1093/bioinformatics/btk022. Epub 2005 Dec 23.
9
STARNET 2: a web-based tool for accelerating discovery of gene regulatory networks using microarray co-expression data.STARET 2:一个基于网络的工具,用于使用微阵列共表达数据加速基因调控网络的发现。
BMC Bioinformatics. 2009 Oct 14;10:332. doi: 10.1186/1471-2105-10-332.
10
The Arabidopsis co-expression tool (ACT): a WWW-based tool and database for microarray-based gene expression analysis.拟南芥共表达工具(ACT):一个基于网络的工具和数据库,用于基于微阵列的基因表达分析。
Plant J. 2006 Apr;46(2):336-48. doi: 10.1111/j.1365-313X.2006.02681.x.

本文引用的文献

1
A general modular framework for gene set enrichment analysis.一种用于基因集富集分析的通用模块化框架。
BMC Bioinformatics. 2009 Feb 3;10:47. doi: 10.1186/1471-2105-10-47.
2
Gene-set approach for expression pattern analysis.用于表达模式分析的基因集方法。
Brief Bioinform. 2008 May;9(3):189-97. doi: 10.1093/bib/bbn001. Epub 2008 Jan 17.
3
The universal protein resource (UniProt).通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2008 Jan;36(Database issue):D190-5. doi: 10.1093/nar/gkm895. Epub 2007 Nov 27.
4
Comparative evaluation of gene-set analysis methods.基因集分析方法的比较评估
BMC Bioinformatics. 2007 Nov 7;8:431. doi: 10.1186/1471-2105-8-431.
5
The LeFE algorithm: embracing the complexity of gene expression in the interpretation of microarray data.LeFE算法:在解读微阵列数据时接纳基因表达的复杂性。
Genome Biol. 2007;8(9):R187. doi: 10.1186/gb-2007-8-9-r187.
6
A systems biology approach for pathway level analysis.一种用于通路水平分析的系统生物学方法。
Genome Res. 2007 Oct;17(10):1537-45. doi: 10.1101/gr.6202607. Epub 2007 Sep 4.
7
Improving gene set analysis of microarray data by SAM-GS.通过SAM-GS改进微阵列数据的基因集分析
BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242.
8
Analyzing gene expression data in terms of gene sets: methodological issues.从基因集角度分析基因表达数据:方法学问题。
Bioinformatics. 2007 Apr 15;23(8):980-7. doi: 10.1093/bioinformatics/btm051. Epub 2007 Feb 15.
9
Pathway analysis using random forests classification and regression.使用随机森林分类和回归的通路分析
Bioinformatics. 2006 Aug 15;22(16):2028-36. doi: 10.1093/bioinformatics/btl344. Epub 2006 Jun 29.
10
Calculating the statistical significance of changes in pathway activity from gene expression data.从基因表达数据计算通路活性变化的统计学显著性。
Stat Appl Genet Mol Biol. 2004;3:Article16. doi: 10.2202/1544-6115.1055. Epub 2004 Jun 22.