• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

affyPara - 用于Affymetrix微阵列数据并行预处理算法的Bioconductor软件包。

affyPara-a Bioconductor Package for Parallelized Preprocessing Algorithms of Affymetrix Microarray Data.

作者信息

Schmidberger Markus, Vicedo Esmeralda, Mansmann Ulrich

机构信息

Division of Biometrics and Bioinformatics, IBE, University of Munich, 81377 Munich, Germany. Email:

出版信息

Bioinform Biol Insights. 2009 Jul 22;3:83-7. doi: 10.4137/bbi.s3060.

DOI:10.4137/bbi.s3060
PMID:20140068
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2808179/
Abstract

Microarray data repositories as well as large clinical applications of gene expression allow to analyse several hundreds of microarrays at one time. The preprocessing of large amounts of microarrays is still a challenge. The algorithms are limited by the available computer hardware. For example, building classification or prognostic rules from large microarray sets will be very time consuming. Here, preprocessing has to be a part of the cross-validation and resampling strategy which is necessary to estimate the rule's prediction quality honestly.This paper proposes the new Bioconductor package affyPara for parallelized preprocessing of Affymetrix microarray data. Partition of data can be applied on arrays and parallelization of algorithms is a straightforward consequence. The partition of data and distribution to several nodes solves the main memory problems and accelerates preprocessing by up to the factor 20 for 200 or more arrays.affyPara is a free and open source package, under GPL license, available form the Bioconductor project at www.bioconductor.org. A user guide and examples are provided with the package.

摘要

微阵列数据存储库以及基因表达的大型临床应用使得能够一次性分析数百个微阵列。对大量微阵列进行预处理仍然是一项挑战。算法受到可用计算机硬件的限制。例如,从大型微阵列集构建分类或预后规则将非常耗时。在此,预处理必须成为交叉验证和重采样策略的一部分,这对于诚实地评估规则的预测质量是必要的。本文提出了新的Bioconductor软件包affyPara,用于对Affymetrix微阵列数据进行并行预处理。数据分区可应用于阵列,算法并行化是直接的结果。数据分区和分布到多个节点解决了主内存问题,并将200个或更多阵列的预处理加速了20倍。affyPara是一个免费的开源软件包,遵循GPL许可,可从www.bioconductor.org的Bioconductor项目获得。该软件包提供了用户指南和示例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5802/2808179/c699e7509214/bbi-2009-083f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5802/2808179/c699e7509214/bbi-2009-083f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5802/2808179/c699e7509214/bbi-2009-083f1.jpg

相似文献

1
affyPara-a Bioconductor Package for Parallelized Preprocessing Algorithms of Affymetrix Microarray Data.affyPara - 用于Affymetrix微阵列数据并行预处理算法的Bioconductor软件包。
Bioinform Biol Insights. 2009 Jul 22;3:83-7. doi: 10.4137/bbi.s3060.
2
mAPKL: R/ Bioconductor package for detecting gene exemplars and revealing their characteristics.mAPKL:用于检测基因范例并揭示其特征的R/Bioconductor软件包。
BMC Bioinformatics. 2015 Sep 15;16(1):291. doi: 10.1186/s12859-015-0719-5.
3
Framework for Parallel Preprocessing of Microarray Data Using Hadoop.使用Hadoop进行微阵列数据并行预处理的框架
Adv Bioinformatics. 2018 Mar 29;2018:9391635. doi: 10.1155/2018/9391635. eCollection 2018.
4
Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.微分析器:Affymetrix 微阵列数据的自动预处理。
Comput Methods Programs Biomed. 2013 Aug;111(2):402-9. doi: 10.1016/j.cmpb.2013.04.006. Epub 2013 May 31.
5
A framework for oligonucleotide microarray preprocessing.寡核苷酸微阵列预处理框架。
Bioinformatics. 2010 Oct 1;26(19):2363-7. doi: 10.1093/bioinformatics/btq431. Epub 2010 Aug 5.
6
arrayQualityMetrics--a bioconductor package for quality assessment of microarray data.arrayQualityMetrics - 一个用于微阵列数据质量评估的生物导体软件包。
Bioinformatics. 2009 Feb 1;25(3):415-6. doi: 10.1093/bioinformatics/btn647. Epub 2008 Dec 23.
7
lumi: a pipeline for processing Illumina microarray.Lumi:一个用于处理Illumina微阵列的流程。
Bioinformatics. 2008 Jul 1;24(13):1547-8. doi: 10.1093/bioinformatics/btn224. Epub 2008 May 8.
8
arrayMagic: two-colour cDNA microarray quality control and preprocessing.阵列魔法:双色cDNA微阵列质量控制与预处理
Bioinformatics. 2005 Feb 15;21(4):554-6. doi: 10.1093/bioinformatics/bti052. Epub 2004 Sep 28.
9
oneChannelGUI: a graphical interface to Bioconductor tools, designed for life scientists who are not familiar with R language.单通道图形用户界面:一个用于生物导体工具的图形界面,专为不熟悉R语言的生命科学家设计。
Bioinformatics. 2007 Dec 15;23(24):3406-8. doi: 10.1093/bioinformatics/btm469. Epub 2007 Sep 17.
10
mu-CS: an extension of the TM4 platform to manage Affymetrix binary data.mu-CS:TM4 平台的一个扩展,用于管理 Affymetrix 二进制数据。
BMC Bioinformatics. 2010 Jun 10;11:315. doi: 10.1186/1471-2105-11-315.

引用本文的文献

1
From Genes to Metabolites: HSP90B1's Role in Alzheimer's Disease and Potential for Therapeutic Intervention.从基因到代谢物:HSP90B1在阿尔茨海默病中的作用及治疗干预潜力
Neuromolecular Med. 2025 Jan 6;27(1):6. doi: 10.1007/s12017-024-08822-0.
2
A fully scalable online pre-processing algorithm for short oligonucleotide microarray atlases.一种用于短寡核苷酸微阵列图谱的完全可扩展的在线预处理算法。
Nucleic Acids Res. 2013 May 1;41(10):e110. doi: 10.1093/nar/gkt229. Epub 2013 Apr 5.
3
Conceptual aspects of large meta-analyses with publicly available microarray data: a case study in oncology.

本文引用的文献

1
Current status of gene expression profiling in the diagnosis and management of acute leukaemia.基因表达谱分析在急性白血病诊断与治疗中的现状
Br J Haematol. 2009 Jun;145(5):555-68. doi: 10.1111/j.1365-2141.2009.07656.x. Epub 2009 Mar 16.
2
Microarray based diagnosis profits from better documentation of gene expression signatures.基于微阵列的诊断受益于基因表达特征的更好记录。
PLoS Comput Biol. 2008 Feb;4(2):e22. doi: 10.1371/journal.pcbi.0040022.
3
A compendium to ensure computational reproducibility in high-dimensional classification tasks.
利用公开可用的微阵列数据进行大型荟萃分析的概念性问题:肿瘤学案例研究
Bioinform Biol Insights. 2011 Jan 23;5:13-39. doi: 10.4137/BBI.S5537.
4
AnyExpress: integrated toolkit for analysis of cross-platform gene expression data using a fast interval matching algorithm.AnyExpress:一个集成的工具包,用于使用快速区间匹配算法分析跨平台基因表达数据。
BMC Bioinformatics. 2011 Mar 17;12:75. doi: 10.1186/1471-2105-12-75.
确保高维分类任务中计算可重复性的纲要。
Stat Appl Genet Mol Biol. 2004;3:Article37. doi: 10.2202/1544-6115.1078. Epub 2004 Dec 19.
4
Bioconductor: open software development for computational biology and bioinformatics.生物导体:用于计算生物学和生物信息学的开源软件开发。
Genome Biol. 2004;5(10):R80. doi: 10.1186/gb-2004-5-10-r80. Epub 2004 Sep 15.
5
Exploration, normalization, and summaries of high density oligonucleotide array probe level data.高密度寡核苷酸阵列探针水平数据的探索、标准化及汇总
Biostatistics. 2003 Apr;4(2):249-64. doi: 10.1093/biostatistics/4.2.249.
6
Variance stabilization applied to microarray data calibration and to the quantification of differential expression.方差稳定化应用于微阵列数据校准和差异表达定量分析。
Bioinformatics. 2002;18 Suppl 1:S96-104. doi: 10.1093/bioinformatics/18.suppl_1.s96.