The Thoracic Surgery Oncology Laboratory and Division of Thoracic Surgery, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
BMC Med Genet. 2009 Dec 31;10:149. doi: 10.1186/1471-2350-10-149.
Analyses of Expressed Sequence Tags (ESTs) databases suggest that most human genes have multiple alternative splice variants. The alternative splicing of pre-mRNA is tightly regulated during development and in different tissue types. Changes in splicing patterns have been described in disease states. Recently, we used whole-transcriptome shotgun pryrosequencing to characterize 4 malignant pleural mesothelioma (MPM) tumors, 1 lung adenocarcinoma and 1 normal lung. We hypothesized that alternative splicing profiles might be detected in the sequencing data for the expressed genes in these samples.
We developed a software pipeline to map the transcriptome read sequences of the 4 MPM samples and 1 normal lung sample onto known exon junction sequences in the comprehensive AceView database of expressed sequences and to count how many reads map to each junction. 13,274,187 transcriptome reads generated by the Roche/454 sequencing platform for 5 samples were compared with 151,486 exon junctions from the AceView database. The exon junction expression index (EJEI) was calculated for each exon junction in each sample to measure the differential expression of alternative splicing events. Top ten exon junctions with the largest EJEI difference between the 4 mesothelioma and the normal lung sample were then examined for differential expression using Quantitative Real Time PCR (qRT-PCR) in the 5 sequenced samples. Two of the differentially expressed exon junctions (ACTG2.aAug05 and CDK4.aAug05) were further examined with qRT-PCR in additional 18 MPM and 18 normal lung specimens.
We found 70,953 exon junctions covered by at least one sequence read in at least one of the 5 samples. All 10 identified most differentially expressed exon junctions were validated as present by RT-PCR, and 8 were differentially expressed exactly as predicted by the sequence analysis. The differential expression of the AceView exon junctions for the ACTG2 and CDK4 genes were also observed to be statistically significant in an additional 18 MPM and 18 normal lung samples examined using qRT-PCR. The differential expression of these two junctions was shown to successfully classify these mesothelioma and normal lung specimens with high sensitivity (89% and 78%, respectively).
Whole-transcriptome shotgun sequencing, combined with a downstream bioinformatics pipeline, provides powerful tools for the identification of differentially expressed exon junctions resulting from alternative splice variants. The alternatively spliced genes discovered in the study could serve as useful diagnostic markers as well as potential therapeutic targets for MPM.
分析表达序列标签 (EST) 数据库表明,大多数人类基因都有多个选择性剪接变体。前体 mRNA 的选择性剪接在发育过程中和不同组织类型中受到严格调控。在疾病状态下已经描述了剪接模式的变化。最近,我们使用全转录组 shotgun 焦磷酸测序来描述 4 个恶性胸膜间皮瘤 (MPM) 肿瘤、1 个肺腺癌和 1 个正常肺。我们假设在这些样本的表达基因的测序数据中可能检测到选择性剪接谱。
我们开发了一个软件管道,将 4 个 MPM 样本和 1 个正常肺样本的转录组读序列映射到综合 AceView 表达序列数据库中的已知外显子连接序列,并计算有多少读序列映射到每个连接。罗氏/454 测序平台为 5 个样本生成的 13274187 个转录组读序列与 AceView 数据库中的 151486 个外显子连接进行了比较。为了测量选择性剪接事件的差异表达,我们计算了每个样本中每个外显子连接的外显子连接表达指数 (EJEI)。然后,在 4 个间皮瘤和正常肺样本之间,对 EJEI 差异最大的前 10 个外显子连接进行差异表达分析,使用 5 个测序样本中的定量实时 PCR (qRT-PCR)。在另外 18 个 MPM 和 18 个正常肺标本中,进一步用 qRT-PCR 检查了两个差异表达的外显子连接 (ACTG2.aAug05 和 CDK4.aAug05)。
我们发现至少有一个序列读取至少在 5 个样本中的一个样本中覆盖了 70953 个外显子连接。通过 RT-PCR 验证了所有 10 个鉴定出的最具差异表达的外显子连接均存在,并且 8 个外显子连接的差异表达完全如序列分析所预测的那样。在使用 qRT-PCR 进一步检查的另外 18 个 MPM 和 18 个正常肺样本中,ACTG2 和 CDK4 基因的 AceView 外显子连接的差异表达也被证明具有统计学意义。这两个连接的差异表达成功地以高灵敏度(分别为 89%和 78%)对这些间皮瘤和正常肺标本进行了分类。
全转录组 shotgun 测序,结合下游生物信息学管道,为鉴定由于选择性剪接变体而导致的差异表达外显子连接提供了强大的工具。该研究中发现的选择性剪接基因可以作为有用的诊断标记物,也可以作为 MPM 的潜在治疗靶点。