Integrative Biology Center of Excellence, Pfizer Worldwide Research and Development, Cambridge, Massachusetts 02139, USA.
Early Clinical Development, Pfizer Worldwide Research and Development, Cambridge, Massachusetts 02139, USA.
RNA. 2020 Aug;26(8):903-909. doi: 10.1261/rna.074922.120. Epub 2020 Apr 13.
In recent years, RNA-sequencing (RNA-seq) has emerged as a powerful technology for transcriptome profiling. For a given gene, the number of mapped reads is not only dependent on its expression level and gene length, but also the sequencing depth. To normalize these dependencies, RPKM (reads per kilobase of transcript per million reads mapped) and TPM (transcripts per million) are used to measure gene or transcript expression levels. A common misconception is that RPKM and TPM values are already normalized, and thus should be comparable across samples or RNA-seq projects. However, RPKM and TPM represent the relative abundance of a transcript among a population of sequenced transcripts, and therefore depend on the composition of the RNA population in a sample. Quite often, it is reasonable to assume that total RNA concentration and distributions are very close across compared samples. Nevertheless, the sequenced RNA repertoires may differ significantly under different experimental conditions and/or across sequencing protocols; thus, the proportion of gene expression is not directly comparable in such cases. In this review, we illustrate typical scenarios in which RPKM and TPM are misused, unintentionally, and hope to raise scientists' awareness of this issue when comparing them across samples or different sequencing protocols.
近年来,RNA 测序(RNA-seq)已成为一种强大的转录组分析技术。对于给定的基因,映射读数的数量不仅取决于其表达水平和基因长度,还取决于测序深度。为了标准化这些依赖关系,使用 RPKM(每百万映射读段中每千碱基转录物的读段)和 TPM(每百万转录物的读段)来测量基因或转录物的表达水平。一个常见的误解是,RPKM 和 TPM 值已经过标准化,因此应该在样本或 RNA-seq 项目之间具有可比性。然而,RPKM 和 TPM 代表测序转录本群体中特定转录本的相对丰度,因此取决于样本中 RNA 群体的组成。通常,在比较样本中,总 RNA 浓度和分布非常接近。然而,在不同的实验条件下和/或不同的测序方案下,测序 RNA 库可能会有很大差异;因此,在这种情况下,基因表达的比例不能直接比较。在这篇综述中,我们说明了 RPKM 和 TPM 被误用的典型情况,希望在比较样本或不同测序方案时引起科学家对这一问题的关注。