• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高通量处理和归一化单色微阵列用于转录组元分析。

High-throughput processing and normalization of one-color microarrays for transcriptional meta-analyses.

机构信息

Arthritis and Clinical Immunology Research Program, Oklahoma Medical Research Foundation 825 NE 13th Street, Oklahoma City, Oklahoma 73104-5005, USA.

出版信息

BMC Bioinformatics. 2011 Oct 18;12 Suppl 10(Suppl 10):S2. doi: 10.1186/1471-2105-12-S10-S2.

DOI:10.1186/1471-2105-12-S10-S2
PMID:22166002
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3236842/
Abstract

BACKGROUND

Microarray experiments are becoming increasingly common in biomedical research, as is their deposition in publicly accessible repositories, such as Gene Expression Omnibus (GEO). As such, there has been a surge in interest to use this microarray data for meta-analytic approaches, whether to increase sample size for a more powerful analysis of a specific disease (e.g. lung cancer) or to re-examine experiments for reasons different than those examined in the initial, publishing study that generated them. For the average biomedical researcher, there are a number of practical barriers to conducting such meta-analyses such as manually aggregating, filtering and formatting the data. Methods to automatically process large repositories of microarray data into a standardized, directly comparable format will enable easier and more reliable access to microarray data to conduct meta-analyses.

METHODS

We present a straightforward, simple but robust against potential outliers method for automatic quality control and pre-processing of tens of thousands of single-channel microarray data files. GEO GDS files are quality checked by comparing parametric distributions and quantile normalized to enable direct comparison of expression level for subsequent meta-analyses.

RESULTS

13,000 human 1-color experiments were processed to create a single gene expression matrix that subsets can be extracted from to conduct meta-analyses. Interestingly, we found that when conducting a global meta-analysis of gene-gene co-expression patterns across all 13,000 experiments to predict gene function, normalization had minimal improvement over using the raw data.

CONCLUSIONS

Normalization of microarray data appears to be of minimal importance on analyses based on co-expression patterns when the sample size is on the order of thousands microarray datasets. Smaller subsets, however, are more prone to aberrations and artefacts, and effective means of automating normalization procedures not only empowers meta-analytic approaches, but aids in reproducibility by providing a standard way of approaching the problem.Data availability: matrix containing normalized expression of 20,813 genes across 13,000 experiments is available for download at . Source code for GDS files pre-processing is available from the authors upon request.

摘要

背景

微阵列实验在生物医学研究中变得越来越普遍,并且它们被存入了公共可访问的存储库中,例如基因表达综合(GEO)。因此,人们对使用这些微阵列数据进行荟萃分析的兴趣激增,无论是为了增加特定疾病(例如肺癌)的样本量以进行更强大的分析,还是出于与最初生成它们的出版研究不同的原因来重新检查实验。对于普通的生物医学研究人员来说,进行此类荟萃分析存在许多实际障碍,例如手动汇总、过滤和格式化数据。将大量微阵列数据自动处理为标准化、可直接比较的格式的方法将使人们更容易、更可靠地访问微阵列数据以进行荟萃分析。

方法

我们提出了一种简单、直接但稳健的方法,用于自动质量控制和预处理数万个单通道微阵列数据文件。通过比较参数分布和分位数归一化来检查 GEO GDS 文件,以实现后续荟萃分析中表达水平的直接比较。

结果

处理了 13000 个人类 1 色实验,创建了一个可从中提取子集以进行荟萃分析的单个基因表达矩阵。有趣的是,我们发现,当对所有 13000 个实验进行全局基因-基因共表达模式荟萃分析以预测基因功能时,与使用原始数据相比,归一化的改进很小。

结论

当样本量为数千个微阵列数据集时,基于共表达模式的分析中,微阵列数据的归一化似乎不重要。然而,较小的子集更容易出现异常和伪影,自动归一化程序的有效方法不仅增强了荟萃分析方法,而且通过提供一种处理问题的标准方法,有助于可重复性。数据可用性:可从以下网址下载包含 13000 个实验中 20813 个基因的归一化表达的矩阵:。有关 GDS 文件预处理的源代码可应要求向作者索取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/0099f26695dc/1471-2105-12-S10-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/709c6b55e447/1471-2105-12-S10-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/a1e9412ba5cc/1471-2105-12-S10-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/00a5c78f6cd8/1471-2105-12-S10-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/0099f26695dc/1471-2105-12-S10-S2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/709c6b55e447/1471-2105-12-S10-S2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/a1e9412ba5cc/1471-2105-12-S10-S2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/00a5c78f6cd8/1471-2105-12-S10-S2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/585b/3236842/0099f26695dc/1471-2105-12-S10-S2-4.jpg

相似文献

1
High-throughput processing and normalization of one-color microarrays for transcriptional meta-analyses.高通量处理和归一化单色微阵列用于转录组元分析。
BMC Bioinformatics. 2011 Oct 18;12 Suppl 10(Suppl 10):S2. doi: 10.1186/1471-2105-12-S10-S2.
2
ArrayWiki: an enabling technology for sharing public microarray data repositories and meta-analyses.ArrayWiki:一种用于共享公共微阵列数据存储库和荟萃分析的支持技术。
BMC Bioinformatics. 2008 May 28;9 Suppl 6(Suppl 6):S18. doi: 10.1186/1471-2105-9-S6-S18.
3
Microarray meta-analysis database (M(2)DB): a uniformly pre-processed, quality controlled, and manually curated human clinical microarray database.微阵列荟萃分析数据库 (M(2)DB):一个统一预处理、质量控制和人工管理的人类临床微阵列数据库。
BMC Bioinformatics. 2010 Aug 10;11:421. doi: 10.1186/1471-2105-11-421.
4
MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data.MAAMD:一种标准化 Affymetrix 微阵列数据分析和比较的工作流程。
BMC Bioinformatics. 2014 Mar 12;15:69. doi: 10.1186/1471-2105-15-69.
5
Microarray retriever: a web-based tool for searching and large scale retrieval of public microarray data.微阵列检索工具:一种用于搜索和大规模检索公共微阵列数据的基于网络的工具。
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W327-31. doi: 10.1093/nar/gkn213. Epub 2008 May 7.
6
Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.微分析器:Affymetrix 微阵列数据的自动预处理。
Comput Methods Programs Biomed. 2013 Aug;111(2):402-9. doi: 10.1016/j.cmpb.2013.04.006. Epub 2013 May 31.
7
The effect of oligonucleotide microarray data pre-processing on the analysis of patient-cohort studies.寡核苷酸微阵列数据预处理对患者队列研究分析的影响。
BMC Bioinformatics. 2006 Mar 2;7:105. doi: 10.1186/1471-2105-7-105.
8
MAID : an effect size based model for microarray data integration across laboratories and platforms.MAID:一种基于效应量的模型,用于跨实验室和平台整合微阵列数据。
BMC Bioinformatics. 2008 Jul 10;9:305. doi: 10.1186/1471-2105-9-305.
9
massiR: a method for predicting the sex of samples in gene expression microarray datasets.massiR:一种用于预测基因表达微阵列数据集样本性别的方法。
Bioinformatics. 2014 Jul 15;30(14):2084-5. doi: 10.1093/bioinformatics/btu161. Epub 2014 Mar 22.
10
Assessment of a novel multi-array normalization method based on spike-in control probes suitable for microRNA datasets with global decreases in expression.基于适用于表达整体下降的微小RNA数据集的掺入对照探针的新型多阵列标准化方法的评估。
BMC Res Notes. 2014 May 17;7:302. doi: 10.1186/1756-0500-7-302.

引用本文的文献

1
Data-driven analysis and druggability assessment methods to accelerate the identification of novel cancer targets.数据驱动分析和药物可及性评估方法以加速新型癌症靶点的识别。
Comput Struct Biotechnol J. 2022 Nov 24;21:46-57. doi: 10.1016/j.csbj.2022.11.042. eCollection 2023.
2
Gene Expression Meta-Analysis of Seven Candidate Gene Sets for Diabetes Traits Following a GWAS Pathway Study.全基因组关联研究(GWAS)通路研究后七个糖尿病性状候选基因集的基因表达荟萃分析
Front Genet. 2018 Feb 16;9:52. doi: 10.3389/fgene.2018.00052. eCollection 2018.
3
Prediction and Analysis of Key Genes in Glioblastoma Based on Bioinformatics.

本文引用的文献

1
Predicting gene ontology from a global meta-analysis of 1-color microarray experiments.从全球 1 色微阵列实验的荟萃分析中预测基因本体。
BMC Bioinformatics. 2011 Oct 18;12 Suppl 10(Suppl 10):S14. doi: 10.1186/1471-2105-12-S10-S14.
2
Genome-wide functional annotation by integrating multiple microarray datasets using meta-analysis.通过整合多个微阵列数据集并使用荟萃分析进行全基因组功能注释。
Int J Data Min Bioinform. 2010;4(4):357-76. doi: 10.1504/ijdmb.2010.034194.
3
Microarray meta-analysis database (M(2)DB): a uniformly pre-processed, quality controlled, and manually curated human clinical microarray database.
基于生物信息学的胶质母细胞瘤关键基因预测与分析
Biomed Res Int. 2017;2017:7653101. doi: 10.1155/2017/7653101. Epub 2017 Jan 16.
4
Comparative study of joint analysis of microarray gene expression data in survival prediction and risk assessment of breast cancer patients.微阵列基因表达数据联合分析在乳腺癌患者生存预测和风险评估中的比较研究。
Brief Bioinform. 2016 Sep;17(5):771-85. doi: 10.1093/bib/bbv092. Epub 2015 Oct 26.
5
Genetic Variation in the Platelet Endothelial Aggregation Receptor 1 Gene Results in Endothelial Dysfunction.血小板内皮聚集受体1基因的遗传变异导致内皮功能障碍。
PLoS One. 2015 Sep 25;10(9):e0138795. doi: 10.1371/journal.pone.0138795. eCollection 2015.
6
Ribosomal and immune transcripts associate with relapse in acquired ADAMTS13-deficient thrombotic thrombocytopenic purpura.核糖体和免疫转录本与获得性ADAMTS13缺陷型血栓性血小板减少性紫癜的复发相关。
PLoS One. 2015 Feb 11;10(2):e0117614. doi: 10.1371/journal.pone.0117614. eCollection 2015.
7
Systematic classification of non-coding RNAs by epigenomic similarity.基于表观遗传相似性的非编码 RNA 系统分类。
BMC Bioinformatics. 2013;14 Suppl 14(Suppl 14):S2. doi: 10.1186/1471-2105-14-S14-S2. Epub 2013 Oct 9.
8
Epigenomic elements enriched in the promoters of autoimmunity susceptibility genes.在自身免疫易感性基因启动子中富集的表观基因组元件。
Epigenetics. 2014 Feb;9(2):276-85. doi: 10.4161/epi.27021. Epub 2013 Nov 8.
9
Experimental validation of 5 in-silico predicted glioma biomarkers.5 种计算机预测脑胶质瘤生物标志物的实验验证。
Neuro Oncol. 2013 Dec;15(12):1625-34. doi: 10.1093/neuonc/not124. Epub 2013 Oct 24.
10
ELTD1, a potential new biomarker for gliomas.ELTD1,一种潜在的新的神经胶质瘤生物标志物。
Neurosurgery. 2013 Jan;72(1):77-90; discussion 91. doi: 10.1227/NEU.0b013e318276b29d.
微阵列荟萃分析数据库 (M(2)DB):一个统一预处理、质量控制和人工管理的人类临床微阵列数据库。
BMC Bioinformatics. 2010 Aug 10;11:421. doi: 10.1186/1471-2105-11-421.
4
Comparison study of microarray meta-analysis methods.微阵列荟萃分析方法的比较研究。
BMC Bioinformatics. 2010 Aug 3;11:408. doi: 10.1186/1471-2105-11-408.
5
The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models.《基因芯片质量控制(MAQC)-II 研究:基于基因芯片的预测模型的开发和验证的常见实践》。
Nat Biotechnol. 2010 Aug;28(8):827-38. doi: 10.1038/nbt.1665. Epub 2010 Jul 30.
6
Internal standard-based analysis of microarray data. Part 1: analysis of differential gene expressions.基于内标的微阵列数据分析。第1部分:差异基因表达分析。
Nucleic Acids Res. 2009 Oct;37(19):6323-39. doi: 10.1093/nar/gkp706. Epub 2009 Aug 31.
7
A global meta-analysis of microarray expression data to predict unknown gene functions and estimate the literature-data divide.一项用于预测未知基因功能并评估文献数据差异的微阵列表达数据的全球荟萃分析。
Bioinformatics. 2009 Jul 1;25(13):1694-701. doi: 10.1093/bioinformatics/btp290. Epub 2009 May 15.
8
Implementation of GenePattern within the Stanford Microarray Database.基因模式在斯坦福微阵列数据库中的实现。
Nucleic Acids Res. 2009 Jan;37(Database issue):D898-901. doi: 10.1093/nar/gkn786. Epub 2008 Oct 25.
9
A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments.微阵列实验中检测差异表达基因的荟萃分析方法比较。
Bioinformatics. 2008 Feb 1;24(3):374-82. doi: 10.1093/bioinformatics/btm620. Epub 2008 Jan 18.
10
CellMontage: similar expression profile search server.细胞蒙太奇:相似表达谱搜索服务器。
Bioinformatics. 2007 Nov 15;23(22):3103-4. doi: 10.1093/bioinformatics/btm462. Epub 2007 Sep 25.