Vukmirovic Milica, Herazo-Maya Jose D, Blackmon John, Skodric-Trifunovic Vesna, Jovanovic Dragana, Pavlovic Sonja, Stojsic Jelena, Zeljkovic Vesna, Yan Xiting, Homer Robert, Stefanovic Branko, Kaminski Naftali
Section of Pulmonary, Critical Care and Sleep Medicine, Yale University School of Medicine, New Haven, CT, USA.
Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, FL, USA.
BMC Pulm Med. 2017 Jan 12;17(1):15. doi: 10.1186/s12890-016-0356-4.
Idiopathic Pulmonary Fibrosis (IPF) is a lethal lung disease of unknown etiology. A major limitation in transcriptomic profiling of lung tissue in IPF has been a dependence on snap-frozen fresh tissues (FF). In this project we sought to determine whether genome scale transcript profiling using RNA Sequencing (RNA-Seq) could be applied to archived Formalin-Fixed Paraffin-Embedded (FFPE) IPF tissues.
We isolated total RNA from 7 IPF and 5 control FFPE lung tissues and performed 50 base pair paired-end sequencing on Illumina 2000 HiSeq. TopHat2 was used to map sequencing reads to the human genome. On average ~62 million reads (53.4% of ~116 million reads) were mapped per sample. 4,131 genes were differentially expressed between IPF and controls (1,920 increased and 2,211 decreased (FDR < 0.05). We compared our results to differentially expressed genes calculated from a previously published dataset generated from FF tissues analyzed on Agilent microarrays (GSE47460). The overlap of differentially expressed genes was very high (760 increased and 1,413 decreased, FDR < 0.05). Only 92 differentially expressed genes changed in opposite directions. Pathway enrichment analysis performed using MetaCore confirmed numerous IPF relevant genes and pathways including extracellular remodeling, TGF-beta, and WNT. Gene network analysis of MMP7, a highly differentially expressed gene in both datasets, revealed the same canonical pathways and gene network candidates in RNA-Seq and microarray data. For validation by NanoString nCounter® we selected 35 genes that had a fold change of 2 in at least one dataset (10 discordant, 10 significantly differentially expressed in one dataset only and 15 concordant genes). High concordance of fold change and FDR was observed for each type of the samples (FF vs FFPE) with both microarrays (r = 0.92) and RNA-Seq (r = 0.90) and the number of discordant genes was reduced to four.
Our results demonstrate that RNA sequencing of RNA obtained from archived FFPE lung tissues is feasible. The results obtained from FFPE tissue are highly comparable to FF tissues. The ability to perform RNA-Seq on archived FFPE IPF tissues should greatly enhance the availability of tissue biopsies for research in IPF.
特发性肺纤维化(IPF)是一种病因不明的致命性肺部疾病。IPF肺组织转录组分析的一个主要限制是依赖于速冻新鲜组织(FF)。在本项目中,我们试图确定使用RNA测序(RNA-Seq)进行基因组规模转录谱分析是否可应用于存档的福尔马林固定石蜡包埋(FFPE)IPF组织。
我们从7个IPF和5个对照FFPE肺组织中分离出总RNA,并在Illumina 2000 HiSeq上进行了50碱基对的双末端测序。使用TopHat2将测序读数映射到人类基因组。每个样本平均映射了约6200万个读数(约1.16亿个读数的53.4%)。IPF与对照之间有4131个基因差异表达(1920个增加,2211个减少(FDR<0.05)。我们将我们的结果与从先前发表的数据集计算出的差异表达基因进行了比较,该数据集由在安捷伦微阵列上分析的FF组织生成(GSE47460)。差异表达基因的重叠率非常高(760个增加,1413个减少,FDR<0.05)。只有92个差异表达基因的变化方向相反。使用MetaCore进行的通路富集分析证实了许多与IPF相关的基因和通路,包括细胞外重塑、TGF-β和WNT。对两个数据集中高度差异表达的基因MMP7进行基因网络分析,揭示了RNA-Seq和微阵列数据中相同的经典通路和基因网络候选物。为了通过NanoString nCounter®进行验证,我们选择了35个在至少一个数据集中变化倍数为2的基因(10个不一致,10个仅在一个数据集中显著差异表达,15个一致基因)。在微阵列(r=0.92)和RNA-Seq(r=0.90)中,每种类型的样本(FF与FFPE)的变化倍数和FDR都具有高度一致性,不一致基因的数量减少到了4个。
我们的结果表明,从存档的FFPE肺组织中获得的RNA进行RNA测序是可行的。从FFPE组织获得的结果与FF组织高度可比。对存档的FFPE IPF组织进行RNA-Seq的能力应大大提高IPF研究中组织活检的可用性。