• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

统计表达模式检验(STEPath):一种在个体和荟萃分析研究中整合基因表达数据和基因组信息的新策略。

Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies.

机构信息

CRIBI Biotechnology Centre, Department of Biology, University of Padova, via U, Bassi 58/B, 35121 Padova, Italy.

出版信息

BMC Bioinformatics. 2011 Apr 11;12:92. doi: 10.1186/1471-2105-12-92.

DOI:10.1186/1471-2105-12-92
PMID:21481242
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3094239/
Abstract

BACKGROUND

In the last decades, microarray technology has spread, leading to a dramatic increase of publicly available datasets. The first statistical tools developed were focused on the identification of significant differentially expressed genes. Later, researchers moved toward the systematic integration of gene expression profiles with additional biological information, such as chromosomal location, ontological annotations or sequence features. The analysis of gene expression linked to physical location of genes on chromosomes allows the identification of transcriptionally imbalanced regions, while, Gene Set Analysis focuses on the detection of coordinated changes in transcriptional levels among sets of biologically related genes. In this field, meta-analysis offers the possibility to compare different studies, addressing the same biological question to fully exploit public gene expression datasets.

RESULTS

We describe STEPath, a method that starts from gene expression profiles and integrates the analysis of imbalanced region as an a priori step before performing gene set analysis. The application of STEPath in individual studies produced gene set scores weighted by chromosomal activation. As a final step, we propose a way to compare these scores across different studies (meta-analysis) on related biological issues. One complication with meta-analysis is batch effects, which occur because molecular measurements are affected by laboratory conditions, reagent lots and personnel differences. Major problems occur when batch effects are correlated with an outcome of interest and lead to incorrect conclusions. We evaluated the power of combining chromosome mapping and gene set enrichment analysis, performing the analysis on a dataset of leukaemia (example of individual study) and on a dataset of skeletal muscle diseases (meta-analysis approach). In leukaemia, we identified the Hox gene set, a gene set closely related to the pathology that other algorithms of gene set analysis do not identify, while the meta-analysis approach on muscular disease discriminates between related pathologies and correlates similar ones from different studies.

CONCLUSIONS

STEPath is a new method that integrates gene expression profiles, genomic co-expressed regions and the information about the biological function of genes. The usage of the STEPath-computed gene set scores overcomes batch effects in the meta-analysis approaches allowing the direct comparison of different pathologies and different studies on a gene set activation level.

摘要

背景

在过去的几十年中,微阵列技术得到了广泛应用,导致可公开获得的数据集数量急剧增加。最初开发的统计工具主要集中在识别显著差异表达的基因上。后来,研究人员转向系统地将基因表达谱与其他生物学信息(如染色体位置、本体注释或序列特征)集成。对基因表达与基因在染色体上的物理位置的分析可以识别转录失衡区域,而基因集分析则侧重于检测生物相关基因集之间转录水平的协调变化。在这个领域中,荟萃分析提供了比较不同研究的可能性,从而可以充分利用公共基因表达数据集来解决相同的生物学问题。

结果

我们描述了 STEPath,这是一种从基因表达谱开始的方法,在进行基因集分析之前,将不平衡区域的分析作为一个先验步骤。STEPath 在单个研究中的应用产生了加权染色体激活的基因集得分。作为最后一步,我们提出了一种在相关生物学问题上比较不同研究(荟萃分析)中这些得分的方法。荟萃分析的一个复杂问题是批次效应,这是由于分子测量受到实验室条件、试剂批次和人员差异的影响而产生的。当批次效应与感兴趣的结果相关并导致错误结论时,就会出现主要问题。我们评估了结合染色体作图和基因集富集分析的能力,在白血病数据集(单个研究的分析)和骨骼肌疾病数据集(荟萃分析方法)上进行了分析。在白血病中,我们确定了 Hox 基因集,这是一个与病理学密切相关的基因集,而其他基因集分析算法则无法识别;而肌肉疾病的荟萃分析方法则可以区分相关的病理学,并将来自不同研究的相似病理学进行关联。

结论

STEPath 是一种新的方法,它集成了基因表达谱、基因组共表达区域以及基因生物学功能的信息。使用 STEPath 计算的基因集得分可以克服荟萃分析方法中的批次效应,从而可以在基因集激活水平上直接比较不同的病理学和不同的研究。

相似文献

1
Statistical Test of Expression Pattern (STEPath): a new strategy to integrate gene expression data with genomic information in individual and meta-analysis studies.统计表达模式检验(STEPath):一种在个体和荟萃分析研究中整合基因表达数据和基因组信息的新策略。
BMC Bioinformatics. 2011 Apr 11;12:92. doi: 10.1186/1471-2105-12-92.
2
Improving gene set analysis of microarray data by SAM-GS.通过SAM-GS改进微阵列数据的基因集分析
BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242.
3
Cross-species and cross-platform gene expression studies with the Bioconductor-compliant R package 'annotationTools'.使用符合Bioconductor标准的R包“annotationTools”进行跨物种和跨平台的基因表达研究。
BMC Bioinformatics. 2008 Jan 17;9:26. doi: 10.1186/1471-2105-9-26.
4
A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies.一种强大的贝叶斯元分析方法,用于整合多个基因集富集研究。
Bioinformatics. 2013 Apr 1;29(7):862-9. doi: 10.1093/bioinformatics/btt068. Epub 2013 Feb 15.
5
Two independent gene signatures in pediatric t(4;11) acute lymphoblastic leukemia patients.小儿t(4;11)急性淋巴细胞白血病患者中的两个独立基因特征。
Eur J Haematol. 2009 Nov;83(5):406-19. doi: 10.1111/j.1600-0609.2009.01305.x. Epub 2009 Jun 25.
6
Meta-analysis for pathway enrichment analysis when combining multiple genomic studies.多组学研究整合的通路富集分析的元分析
Bioinformatics. 2010 May 15;26(10):1316-23. doi: 10.1093/bioinformatics/btq148. Epub 2010 Apr 21.
7
Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach.蛋白质翻译与细胞外基质基因集与乳腺癌转移的关联:使用个体患者数据方法对多个公开可用数据集进行分析所发现的结果
PLoS One. 2015 Jun 16;10(6):e0129610. doi: 10.1371/journal.pone.0129610. eCollection 2015.
8
MAID : an effect size based model for microarray data integration across laboratories and platforms.MAID:一种基于效应量的模型,用于跨实验室和平台整合微阵列数据。
BMC Bioinformatics. 2008 Jul 10;9:305. doi: 10.1186/1471-2105-9-305.
9
GEM-TREND: a web tool for gene expression data mining toward relevant network discovery.GEM-TREND:一个用于挖掘基因表达数据以发现相关网络的网络工具。
BMC Genomics. 2009 Sep 3;10:411. doi: 10.1186/1471-2164-10-411.
10
SiPaGene: A new repository for instant online retrieval, sharing and meta-analyses of GeneChip expression data.SiPaGene:一个用于即时在线检索、共享和对基因芯片表达数据进行荟萃分析的新数据库。
BMC Genomics. 2009 Mar 5;10:98. doi: 10.1186/1471-2164-10-98.

引用本文的文献

1
Altered gene transcription in human cells treated with Ludox® silica nanoparticles.用Ludox®二氧化硅纳米颗粒处理的人类细胞中的基因转录改变。
Int J Environ Res Public Health. 2014 Aug 28;11(9):8867-90. doi: 10.3390/ijerph110908867.
2
Systems biology approach to the dissection of the complexity of regulatory networks in the S. scrofa cardiocirculatory system.系统生物学方法解析猪心循环系统调控网络的复杂性。
Int J Mol Sci. 2013 Nov 21;14(11):23160-87. doi: 10.3390/ijms141123160.
3
Analyzing illumina gene expression microarray data from different tissues: methodological aspects of data analysis in the metaxpress consortium.

本文引用的文献

1
Genomics tools for unraveling chromosome architecture.用于解析染色体结构的基因组学工具。
Nat Biotechnol. 2010 Oct;28(10):1089-95. doi: 10.1038/nbt.1680.
2
Dynamic organization of chromatin assembly and transcription factories in living cells.活细胞中染色质组装和转录工厂的动态组织
Methods Cell Biol. 2010;98:57-78. doi: 10.1016/S0091-679X(10)98003-5.
3
Meta-analysis for pathway enrichment analysis when combining multiple genomic studies.多组学研究整合的通路富集分析的元分析
分析来自不同组织的 Illumina 基因表达微阵列数据:Metaxpress 联盟中数据分析的方法学方面。
PLoS One. 2012;7(12):e50938. doi: 10.1371/journal.pone.0050938. Epub 2012 Dec 7.
4
Integrative analysis of neuroblastoma and pheochromocytoma genomics data.神经母细胞瘤和嗜铬细胞瘤基因组学数据的综合分析。
BMC Med Genomics. 2012 Oct 29;5:48. doi: 10.1186/1755-8794-5-48.
Bioinformatics. 2010 May 15;26(10):1316-23. doi: 10.1093/bioinformatics/btq148. Epub 2010 Apr 21.
4
Meta-analysis of adrenocortical tumour genomics data: novel pathogenic pathways revealed.肾上腺皮质肿瘤基因组学数据的荟萃分析:揭示新的发病机制途径。
Oncogene. 2010 May 27;29(21):3163-72. doi: 10.1038/onc.2010.80. Epub 2010 Mar 22.
5
Impact of probe annotation on the integration of miRNA-mRNA expression profiles for miRNA target detection.探针注释对 miRNA-mRNA 表达谱整合进行 miRNA 靶标检测的影响。
Nucleic Acids Res. 2010 Apr;38(7):e97. doi: 10.1093/nar/gkp1239. Epub 2010 Jan 13.
6
KEGG for representation and analysis of molecular networks involving diseases and drugs.KEGG 用于表示和分析涉及疾病和药物的分子网络。
Nucleic Acids Res. 2010 Jan;38(Database issue):D355-60. doi: 10.1093/nar/gkp896. Epub 2009 Oct 30.
7
MLL rearrangements in pediatric acute lymphoblastic and myeloblastic leukemias: MLL specific and lineage specific signatures.小儿急性淋巴细胞白血病和急性髓细胞白血病中的MLL重排:MLL特异性和谱系特异性特征
BMC Med Genomics. 2009 Jun 23;2:36. doi: 10.1186/1755-8794-2-36.
8
Mitochondrial abnormalities, energy deficit and oxidative stress are features of calpain 3 deficiency in skeletal muscle.线粒体异常、能量缺乏和氧化应激是骨骼肌中钙蛋白酶3缺乏的特征。
Hum Mol Genet. 2009 Sep 1;18(17):3194-205. doi: 10.1093/hmg/ddp257. Epub 2009 May 29.
9
Ripples from neighbouring transcription.来自相邻转录的涟漪效应。
Nat Cell Biol. 2008 Sep;10(9):1106-13. doi: 10.1038/ncb1771.
10
Meta-analysis of expression signatures of muscle atrophy: gene interaction networks in early and late stages.肌肉萎缩表达特征的荟萃分析:早期和晚期的基因相互作用网络
BMC Genomics. 2008 Dec 23;9:630. doi: 10.1186/1471-2164-9-630.