Laboratoire de biologie et de génétique du cancer, Département de Biopathologie, Centre François Baclesse, Caen, 14000, France.
Cancer and Brain Genomics, FHU G4 Genomics, Inserm U1245, Normandie University, Rouen, 76183, France.
BMC Genomics. 2024 Sep 30;25(1):909. doi: 10.1186/s12864-024-10741-0.
Solving the structure of mRNA transcripts is a major challenge for both research and molecular diagnostic purposes. Current approaches based on short-read RNA sequencing and RT-PCR techniques cannot fully explore the complexity of transcript structure. The emergence of third-generation long-read sequencing addresses this problem by solving this sequence directly. However, genes with low expression levels are difficult to study with the whole transcriptome sequencing approach. To fix this technical limitation, we propose a novel method to capture transcripts of a gene panel using a targeted enrichment approach suitable for Pacific Biosciences and Oxford Nanopore Technologies platforms.
We designed a set of probes to capture transcripts of a panel of genes involved in hereditary breast and ovarian cancer syndrome. We present SOSTAR (iSofOrmS annoTAtoR), a versatile pipeline to assemble, quantify and annotate isoforms from long read sequencing using a new tool specially designed for this application. The significant enrichment of transcripts by our capture protocol, together with the SOSTAR annotation, allowed the identification of 1,231 unique transcripts within the gene panel from the eight patients sequenced. The structure of these transcripts was annotated with a resolution of one base relative to a reference transcript. All major alternative splicing events of the BRCA1 and BRCA2 genes described in the literature were found. Complex splicing events such as pseudoexons were correctly annotated. SOSTAR enabled the identification of abnormal transcripts in the positive controls. In addition, a case of unexplained inheritance in a family with a history of breast and ovarian cancer was solved by identifying an SVA retrotransposon in intron 13 of the BRCA1 gene.
We have validated a new protocol for the enrichment of transcripts of interest using probes adapted to the ONT and PacBio platforms. This protocol allows a complete description of the alternative structures of transcripts, the estimation of their expression and the identification of aberrant transcripts in a single experiment. This proof-of-concept opens new possibilities for RNA structure exploration in both research and molecular diagnostics.
解决 mRNA 转录本的结构是研究和分子诊断的主要挑战。目前基于短读长 RNA 测序和 RT-PCR 技术的方法无法充分探索转录本结构的复杂性。第三代长读长测序的出现通过直接解决该序列来解决这个问题。然而,低表达水平的基因很难通过全转录组测序方法进行研究。为了解决这个技术限制,我们提出了一种使用适用于 Pacific Biosciences 和 Oxford Nanopore Technologies 平台的靶向富集方法来捕获基因panel 转录本的新方法。
我们设计了一组探针来捕获遗传性乳腺癌和卵巢癌综合征相关基因panel 的转录本。我们提出了 SOSTAR(iSofOrmS annoTAtoR),这是一种灵活的流水线,使用专门为此应用设计的新工具从长读长测序中组装、定量和注释异构体。我们的捕获方案显著富集了转录本,加上 SOSTAR 注释,从测序的 8 位患者中鉴定出基因panel 中 1231 个独特的转录本。这些转录本的结构以相对于参考转录本的一个碱基的分辨率进行注释。文献中描述的 BRCA1 和 BRCA2 基因的所有主要选择性剪接事件都被发现。正确注释了复杂的剪接事件,如假外显子。SOSTAR 能够在阳性对照中识别异常转录本。此外,通过在 BRCA1 基因的内含子 13 中鉴定 SVA 反转录转座子,解决了一个家族中具有乳腺癌和卵巢癌病史的未解释遗传病例。
我们已经验证了一种使用适应 ONT 和 PacBio 平台的探针富集感兴趣转录本的新方案。该方案允许在单个实验中完整描述转录本的替代结构、估计其表达并识别异常转录本。这一概念验证为研究和分子诊断中的 RNA 结构探索开辟了新的可能性。