• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个基于网络且支持网格计算的dChip版本,用于分析大量基因表达数据。

A Web-based and Grid-enabled dChip version for the analysis of large sets of gene expression data.

作者信息

Corradi Luca, Fato Marco, Porro Ivan, Scaglione Silvia, Torterolo Livia

机构信息

Computer Science, Systems, and Communication Department, University of Genova, Viale Causa 12, Genova, Italy.

出版信息

BMC Bioinformatics. 2008 Nov 13;9:480. doi: 10.1186/1471-2105-9-480.

DOI:10.1186/1471-2105-9-480
PMID:19014540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2596147/
Abstract

BACKGROUND

Microarray techniques are one of the main methods used to investigate thousands of gene expression profiles for enlightening complex biological processes responsible for serious diseases, with a great scientific impact and a wide application area. Several standalone applications had been developed in order to analyze microarray data. Two of the most known free analysis software packages are the R-based Bioconductor and dChip. The part of dChip software concerning the calculation and the analysis of gene expression has been modified to permit its execution on both cluster environments (supercomputers) and Grid infrastructures (distributed computing).This work is not aimed at replacing existing tools, but it provides researchers with a method to analyze large datasets without any hardware or software constraints.

RESULTS

An application able to perform the computation and the analysis of gene expression on large datasets has been developed using algorithms provided by dChip. Different tests have been carried out in order to validate the results and to compare the performances obtained on different infrastructures. Validation tests have been performed using a small dataset related to the comparison of HUVEC (Human Umbilical Vein Endothelial Cells) and Fibroblasts, derived from same donors, treated with IFN-alpha.Moreover performance tests have been executed just to compare performances on different environments using a large dataset including about 1000 samples related to Breast Cancer patients.

CONCLUSION

A Grid-enabled software application for the analysis of large Microarray datasets has been proposed. DChip software has been ported on Linux platform and modified, using appropriate parallelization strategies, to permit its execution on both cluster environments and Grid infrastructures. The added value provided by the use of Grid technologies is the possibility to exploit both computational and data Grid infrastructures to analyze large datasets of distributed data. The software has been validated and performances on cluster and Grid environments have been compared obtaining quite good scalability results.

摘要

背景

微阵列技术是用于研究数千个基因表达谱以阐明导致严重疾病的复杂生物过程的主要方法之一,具有重大的科学影响和广泛的应用领域。为了分析微阵列数据,已经开发了几个独立的应用程序。两个最著名的免费分析软件包是基于R的Bioconductor和dChip。dChip软件中有关基因表达计算和分析的部分已经过修改,以允许其在集群环境(超级计算机)和网格基础设施(分布式计算)上运行。这项工作并非旨在取代现有工具,而是为研究人员提供一种在没有任何硬件或软件限制的情况下分析大型数据集的方法。

结果

利用dChip提供的算法开发了一个能够对大型数据集进行基因表达计算和分析的应用程序。为了验证结果并比较在不同基础设施上获得的性能,进行了不同的测试。使用与来自相同供体的人脐静脉内皮细胞(HUVEC)和成纤维细胞比较相关的小数据集进行了验证测试,这些细胞用α干扰素处理。此外,使用包含约1000个与乳腺癌患者相关样本的大型数据集进行了性能测试,只是为了比较不同环境下的性能。

结论

提出了一种用于分析大型微阵列数据集的支持网格的软件应用程序。dChip软件已移植到Linux平台并使用适当的并行化策略进行了修改,以允许其在集群环境和网格基础设施上运行。使用网格技术提供的附加值是能够利用计算和数据网格基础设施来分析分布式数据的大型数据集。该软件已经过验证,并比较了在集群和网格环境下的性能,获得了相当好的可扩展性结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/257ef410e0d3/1471-2105-9-480-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/055ef2d4522f/1471-2105-9-480-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/3c1059851fb0/1471-2105-9-480-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/52118f9c60fd/1471-2105-9-480-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/92fc86561150/1471-2105-9-480-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/9399ad5fc489/1471-2105-9-480-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/257ef410e0d3/1471-2105-9-480-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/055ef2d4522f/1471-2105-9-480-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/3c1059851fb0/1471-2105-9-480-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/52118f9c60fd/1471-2105-9-480-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/92fc86561150/1471-2105-9-480-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/9399ad5fc489/1471-2105-9-480-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f8/2596147/257ef410e0d3/1471-2105-9-480-6.jpg

相似文献

1
A Web-based and Grid-enabled dChip version for the analysis of large sets of gene expression data.一个基于网络且支持网格计算的dChip版本,用于分析大量基因表达数据。
BMC Bioinformatics. 2008 Nov 13;9:480. doi: 10.1186/1471-2105-9-480.
2
A Grid-based solution for management and analysis of microarrays in distributed experiments.一种用于分布式实验中微阵列管理与分析的基于网格的解决方案。
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S7. doi: 10.1186/1471-2105-8-S1-S7.
3
EMAAS: an extensible grid-based rich internet application for microarray data analysis and management.EMAAS:一种用于微阵列数据分析与管理的基于网格的可扩展富互联网应用程序。
BMC Bioinformatics. 2008 Nov 25;9:493. doi: 10.1186/1471-2105-9-493.
4
GLAD: a system for developing and deploying large-scale bioinformatics grid.GLAD:一个用于开发和部署大规模生物信息学网格的系统。
Bioinformatics. 2005 Mar;21(6):794-802. doi: 10.1093/bioinformatics/bti034. Epub 2004 Sep 23.
5
ArrayQuest: a web resource for the analysis of DNA microarray data.ArrayQuest:一个用于分析DNA微阵列数据的网络资源。
BMC Bioinformatics. 2005 Dec 1;6:287. doi: 10.1186/1471-2105-6-287.
6
A bioinformatics knowledge discovery in text application for grid computing.一种用于网格计算的文本应用中的生物信息学知识发现。
BMC Bioinformatics. 2009 Jun 16;10 Suppl 6(Suppl 6):S23. doi: 10.1186/1471-2105-10-S6-S23.
7
Automating dChip: toward reproducible sharing of microarray data analysis.自动化dChip:迈向可重复共享的微阵列数据分析
BMC Bioinformatics. 2008 May 8;9:231. doi: 10.1186/1471-2105-9-231.
8
myGrid: personalised bioinformatics on the information grid.我的网格:信息网格上的个性化生物信息学
Bioinformatics. 2003;19 Suppl 1:i302-4. doi: 10.1093/bioinformatics/btg1041.
9
Talisman--rapid application development for the grid.Talisman——面向网格的快速应用程序开发。
Bioinformatics. 2003;19 Suppl 1:i212-4. doi: 10.1093/bioinformatics/btg1028.
10
The dChip survival analysis module for microarray data.dChip 微阵列数据分析模块
BMC Bioinformatics. 2011 Mar 9;12:72. doi: 10.1186/1471-2105-12-72.

引用本文的文献

1
A digital repository with an extensible data model for biobanking and genomic analysis management.一个具有可扩展数据模型的数字存储库,用于生物样本库和基因组分析管理。
BMC Genomics. 2014;15 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2164-15-S3-S3. Epub 2014 May 6.
2
mu-CS: an extension of the TM4 platform to manage Affymetrix binary data.mu-CS:TM4 平台的一个扩展,用于管理 Affymetrix 二进制数据。
BMC Bioinformatics. 2010 Jun 10;11:315. doi: 10.1186/1471-2105-11-315.
3
Survival Online: a web-based service for the analysis of correlations between gene expression and clinical and follow-up data.

本文引用的文献

1
A Grid-based solution for management and analysis of microarrays in distributed experiments.一种用于分布式实验中微阵列管理与分析的基于网格的解决方案。
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S7. doi: 10.1186/1471-2105-8-S1-S7.
2
How to decide? Different methods of calculating gene expression from short oligonucleotide array data will give different results.如何做出决定?从短寡核苷酸阵列数据计算基因表达的不同方法会得出不同的结果。
BMC Bioinformatics. 2006 Mar 15;7:137. doi: 10.1186/1471-2105-7-137.
3
Evaluation of methods for oligonucleotide array data via quantitative real-time PCR.
生存分析在线:一个基于网络的服务,用于分析基因表达与临床和随访数据之间的相关性。
BMC Bioinformatics. 2009 Oct 15;10 Suppl 12(Suppl 12):S10. doi: 10.1186/1471-2105-10-S12-S10.
通过定量实时PCR评估寡核苷酸阵列数据的方法
BMC Bioinformatics. 2006 Jan 17;7:23. doi: 10.1186/1471-2105-7-23.
4
Molecular mechanisms of action of angiopreventive anti-oxidants on endothelial cells: microarray gene expression analyses.血管预防抗氧化剂作用于内皮细胞的分子机制:基因芯片基因表达分析
Mutat Res. 2005 Dec 11;591(1-2):198-211. doi: 10.1016/j.mrfmmm.2005.04.014. Epub 2005 Aug 5.
5
Bioconductor: open software development for computational biology and bioinformatics.生物导体:用于计算生物学和生物信息学的开源软件开发。
Genome Biol. 2004;5(10):R80. doi: 10.1186/gb-2004-5-10-r80. Epub 2004 Sep 15.
6
Exploration, normalization, and summaries of high density oligonucleotide array probe level data.高密度寡核苷酸阵列探针水平数据的探索、标准化及汇总
Biostatistics. 2003 Apr;4(2):249-64. doi: 10.1093/biostatistics/4.2.249.
7
Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application.基于模型的寡核苷酸阵列分析:模型验证、设计问题及标准误差应用
Genome Biol. 2001;2(8):RESEARCH0032. doi: 10.1186/gb-2001-2-8-research0032. Epub 2001 Aug 3.
8
Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection.基于模型的寡核苷酸阵列分析:表达指数计算与异常值检测。
Proc Natl Acad Sci U S A. 2001 Jan 2;98(1):31-6. doi: 10.1073/pnas.98.1.31.