Suppr超能文献

微阵列上的探针级伪影的检测和校正。

Detection and correction of probe-level artefacts on microarrays.

机构信息

Institute for Informatics, Ludwig-Maximilians-Universität München, Munich, Germany.

出版信息

BMC Bioinformatics. 2012 May 30;13:114. doi: 10.1186/1471-2105-13-114.

Abstract

BACKGROUND

A recent large-scale analysis of Gene Expression Omnibus (GEO) data found frequent evidence for spatial defects in a substantial fraction of Affymetrix microarrays in the GEO. Nevertheless, in contrast to quality assessment, artefact detection is not widely used in standard gene expression analysis pipelines. Furthermore, although approaches have been proposed to detect diverse types of spatial noise on arrays, the correction of these artefacts is mostly left to either summarization methods or the corresponding arrays are completely discarded.

RESULTS

We show that state-of-the-art robust summarization procedures are vulnerable to artefacts on arrays and cannot appropriately correct for these. To address this problem, we present a simple approach to detect artefacts with high recall and precision, which we further improve by taking into account the spatial layout of arrays. Finally, we propose two correction methods for these artefacts that either substitute values of defective probes using probeset information or filter corrupted probes. We show that our approach can identify and correct defective probe measurements appropriately and outperforms existing tools.

CONCLUSIONS

While summarization is insufficient to correct for defective probes, this problem can be addressed in a straightforward way by the methods we present for identification and correction of defective probes. As these methods output CEL files with corrected probe values that serve as input to standard normalization and summarization procedures, they can be easily integrated into existing microarray analysis pipelines as an additional pre-processing step. An R package is freely available from http://www.bio.ifi.lmu.de/artefact-correction.

摘要

背景

最近对基因表达综合数据库(GEO)数据的大规模分析发现,GEO 中相当一部分 Affymetrix 微阵列存在频繁的空间缺陷证据。然而,与质量评估不同,伪影检测在标准基因表达分析流程中并未得到广泛应用。此外,尽管已经提出了多种方法来检测阵列上的各种类型的空间噪声,但这些伪影的校正大多留给汇总方法或完全丢弃相应的阵列。

结果

我们表明,最先进的稳健汇总程序容易受到阵列上的伪影的影响,并且无法适当纠正这些伪影。为了解决这个问题,我们提出了一种简单的方法来检测具有高召回率和精度的伪影,我们进一步通过考虑阵列的空间布局来改进这些方法。最后,我们提出了两种用于这些伪影的校正方法,要么使用探针组信息替代有缺陷探针的值,要么过滤掉有缺陷的探针。我们表明,我们的方法可以适当地识别和纠正有缺陷的探针测量值,并优于现有工具。

结论

虽然汇总不足以纠正有缺陷的探针,但可以通过我们提出的用于识别和纠正有缺陷探针的方法来直接解决这个问题。由于这些方法输出带有校正后探针值的 CEL 文件,可作为标准归一化和汇总程序的输入,因此它们可以作为附加的预处理步骤,轻松集成到现有的微阵列分析流程中。一个 R 包可从 http://www.bio.ifi.lmu.de/artefact-correction 免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c227/3534149/100530fa2aaa/1471-2105-13-114-1.jpg

相似文献

1
Detection and correction of probe-level artefacts on microarrays.
BMC Bioinformatics. 2012 May 30;13:114. doi: 10.1186/1471-2105-13-114.
3
Algorithm-driven artifacts in median polish summarization of microarray data.
BMC Bioinformatics. 2010 Nov 11;11:553. doi: 10.1186/1471-2105-11-553.
4
mu-CS: an extension of the TM4 platform to manage Affymetrix binary data.
BMC Bioinformatics. 2010 Jun 10;11:315. doi: 10.1186/1471-2105-11-315.
6
A probe-treatment-reference (PTR) model for the analysis of oligonucleotide expression microarrays.
BMC Bioinformatics. 2008 Apr 14;9:194. doi: 10.1186/1471-2105-9-194.
7
Software note: using probe secondary structure information to enhance Affymetrix GeneChip background estimates.
Comput Biol Chem. 2007 Apr;31(2):92-8. doi: 10.1016/j.compbiolchem.2007.02.008. Epub 2007 Feb 20.
9
"Harshlighting" small blemishes on microarrays.
BMC Bioinformatics. 2005 Mar 22;6:65. doi: 10.1186/1471-2105-6-65.
10
Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.
Comput Methods Programs Biomed. 2013 Aug;111(2):402-9. doi: 10.1016/j.cmpb.2013.04.006. Epub 2013 May 31.

本文引用的文献

1
caCORRECT2: Improving the accuracy and reliability of microarray data in the presence of artifacts.
BMC Bioinformatics. 2011 Sep 29;12:383. doi: 10.1186/1471-2105-12-383.
2
Exon array data analysis using Affymetrix power tools and R statistical software.
Brief Bioinform. 2011 Nov;12(6):634-44. doi: 10.1093/bib/bbq086. Epub 2011 Apr 15.
3
NCBI GEO: archive for functional genomics data sets--10 years on.
Nucleic Acids Res. 2011 Jan;39(Database issue):D1005-10. doi: 10.1093/nar/gkq1184. Epub 2010 Nov 21.
4
A survey of spatial defects in Homo Sapiens Affymetrix GeneChips.
IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):647-53. doi: 10.1109/TCBB.2008.108.
5
MicroRNA-10a regulation of proinflammatory phenotype in athero-susceptible endothelium in vivo and in vitro.
Proc Natl Acad Sci U S A. 2010 Jul 27;107(30):13450-5. doi: 10.1073/pnas.1002120107. Epub 2010 Jul 12.
6
Exon-level microarray analyses identify alternative splicing programs in breast cancer.
Mol Cancer Res. 2010 Jul;8(7):961-74. doi: 10.1158/1541-7786.MCR-09-0528. Epub 2010 Jul 6.
7
Alternative splicing regulates mouse embryonic stem cell pluripotency and differentiation.
Proc Natl Acad Sci U S A. 2010 Jun 8;107(23):10514-9. doi: 10.1073/pnas.0912260107. Epub 2010 May 24.
10
Splicing factor and exon profiling across human tissues.
Nucleic Acids Res. 2010 May;38(9):2825-38. doi: 10.1093/nar/gkq008. Epub 2010 Jan 27.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验