Nygard Ann-Britt, Cirera Susanna, Gilchrist Michael J, Gorodkin Jan, Jørgensen Claus B, Fredholm Merete
University of Copenhagen, Faculty of Life Sciences, Department of Basic Animal and Veterinary Sciences, Division of Genetics and Bioinformatics, Groennegaardsvej 3, 1870 Frederiksberg C, Denmark.
BMC Res Notes. 2010 May 5;3:123. doi: 10.1186/1756-0500-3-123.
Since at least half of the genes in mammalian genomes are subjected to alternative splicing, alternative pre-mRNA splicing plays an important contribution to the complexity of the mammalian proteome. Expressed sequence tags (ESTs) provide evidence of a great number of possible alternative isoforms. With the EST resource for the domestic pig now containing more than one million porcine ESTs, it is possible to identify alternative splice forms of the individual transcripts in this species from the EST data with some confidence.
The pig EST data generated by the Sino-Danish Pig Genome project has been assembled with publicly available ESTs and made available in the PigEST database. Using the Distiller package 2,515 EST clusters with candidate alternative isoforms were identified in the EST data with high confidence. In agreement with general observations in human and mouse, we find putative splice variants in about 30% of the contigs with more than 50 ESTs. Based on the criteria that a minimum of two EST sequences confirmed each splice event, a list of 100 genes with the most distinct tissue-specific alternative splice events was generated from the list of candidates. To confirm the tissue specificity of the splice events, 10 genes with functional annotation were randomly selected from which 16 individual splice events were chosen for experimental verification by quantitative PCR (qPCR). Six genes were shown to have tissue specific alternatively spliced transcripts with expression patterns matching those of the EST data. The remaining four genes had tissue-restricted expression of alternative spliced transcripts. Five out of the 16 splice events that were experimentally verified were found to be putative pig specific.
In accordance with human and rodent studies we estimate that approximately 30% of the porcine genes undergo alternative splicing. We found a good correlation between EST predicted tissue-specificity and experimentally validated splice events in different porcine tissue. This study indicates that a cluster size of around 50 ESTs is optimal for in silico detection of alternative splicing. Although based on a limited number of splice events, the study supports the notion that alternative splicing could have an important impact on species differentiation since 31% of the splice events studied appears to be species specific.
由于哺乳动物基因组中至少一半的基因会发生可变剪接,因此可变前体mRNA剪接对哺乳动物蛋白质组的复杂性有重要贡献。表达序列标签(EST)为大量可能的可变异构体提供了证据。目前,家猪的EST资源包含超过一百万个猪EST,因此有信心从EST数据中识别该物种中单个转录本的可变剪接形式。
中丹猪基因组计划产生的猪EST数据已与公开可用的EST进行组装,并在PigEST数据库中提供。使用Distiller软件包,在EST数据中高置信度地鉴定出2515个具有候选可变异构体的EST簇。与在人类和小鼠中的一般观察结果一致,我们在约30%的具有超过50个EST的重叠群中发现了推定的剪接变体。根据每个剪接事件至少有两个EST序列确认的标准,从候选列表中生成了100个具有最明显组织特异性可变剪接事件的基因列表。为了确认剪接事件的组织特异性,从具有功能注释的基因中随机选择10个,从中选择16个单独的剪接事件通过定量PCR(qPCR)进行实验验证。六个基因显示具有组织特异性可变剪接转录本,其表达模式与EST数据匹配。其余四个基因具有可变剪接转录本的组织限制性表达。在实验验证的16个剪接事件中,有5个被发现是推定的猪特异性剪接事件。
与人类和啮齿动物研究一致,我们估计约30%的猪基因会发生可变剪接。我们发现EST预测的组织特异性与不同猪组织中实验验证的剪接事件之间有良好的相关性。这项研究表明,约50个EST的簇大小最适合于通过计算机检测可变剪接。尽管基于有限数量的剪接事件,但该研究支持可变剪接可能对物种分化有重要影响的观点,因为所研究的剪接事件中有31%似乎是物种特异性的。