Suppr超能文献

从 GEO 读取基因表达数据:一个 R 包,用于方便从基因表达综合数据库(GEO)读取数据。

geneExpressionFromGEO: An R Package to Facilitate Data Reading from Gene Expression Omnibus (GEO).

机构信息

University of Toronto, Toronto, ON, Canada.

出版信息

Methods Mol Biol. 2022;2401:187-194. doi: 10.1007/978-1-0716-1839-4_12.

Abstract

Gene expression profiling is a useful way to measure the activity of genes in molecular biology and, because of its effectiveness, researchers have released thousands of gene expression datasets publicly in online databases and repositories, such as Gene Expression Omnibus (GEO). To read and analyze gene expression data, the computational biology community has developed several tools and platforms, including Bioconductor, an R open-source platform of software packages that can be used to analyze these data. Despite the usefulness of Bioconductor and of its packages, it is still difficult to read gene expression data from GEO, and to assign gene symbols to the probesets of datasets. To alleviate this problem, we introduce here a new R software package, geneExpressionFromGEO, which provides to the users the possibility to easily download gene expression data from GEO and to easily associate gene symbols to probesets. In this short chapter, we describe the assets of our software package, and we report an example of its usage. We believe that geneExpressionFromGEO can be very useful for the R community of bioinformaticians working on gene expression data.

摘要

基因表达谱是一种测量分子生物学中基因活性的有用方法,由于其有效性,研究人员已经在在线数据库和存储库(如基因表达综合数据库(GEO))中公开了数千个基因表达数据集。为了读取和分析基因表达数据,计算生物学界已经开发了几种工具和平台,包括 Bioconductor,这是一个 R 开源平台的软件包,可以用于分析这些数据。尽管 Bioconductor 和它的软件包非常有用,但仍然很难从 GEO 读取基因表达数据,并为数据集的探针集分配基因符号。为了解决这个问题,我们在这里介绍一个新的 R 软件包,geneExpressionFromGEO,它为用户提供了从 GEO 轻松下载基因表达数据并轻松将基因符号与探针集关联的可能性。在这一小节中,我们描述了我们软件包的资产,并报告了它的一个使用示例。我们相信 geneExpressionFromGEO 对于从事基因表达数据分析的 R 社区的生物信息学家非常有用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验