Ritchie Matthew E, Phipson Belinda, Wu Di, Hu Yifang, Law Charity W, Shi Wei, Smyth Gordon K
Molecular Medicine Division, The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3052, Australia Department of Mathematics and Statistics, The University of Melbourne, Parkville, Victoria 3010, Australia.
Murdoch Childrens Research Institute, Royal Children's Hospital, 50 Flemington Road, Parkville, Victoria 3052, Australia.
Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.
limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
limma是一个R/Bioconductor软件包,它为分析基因表达实验数据提供了一个集成解决方案。它具有丰富的功能,可处理复杂的实验设计并通过信息借用克服小样本量问题。在过去十年中,limma一直是通过对微阵列和高通量PCR数据进行差异表达分析来发现基因的热门选择。该软件包在读取、标准化和探索此类数据方面具有特别强大的功能。最近,limma的功能在两个重要方向上得到了显著扩展。首先,该软件包现在可以对RNA测序(RNA-seq)数据进行差异表达和差异剪接分析。以前仅限于微阵列数据的所有下游分析工具现在也可用于RNA-seq。这些功能使用户能够使用非常相似的流程分析RNA-seq和微阵列数据。其次,该软件包现在能够以多种方式超越传统的基因水平表达分析,从共调控的基因集或更高阶的表达特征方面分析表达谱。这为基因表达差异的生物学解释提供了更多可能性。本文回顾了limma软件包的理念和设计,总结了新的和历史的功能,重点介绍了最近的增强功能和以前未描述过的功能。