Suppr超能文献

将可变剪接检测整合到基因预测中。

Integrating alternative splicing detection into gene prediction.

作者信息

Foissac Sylvain, Schiex Thomas

机构信息

Unité de Biométrie et Intelligence Artificielle, INRA, 31326 Castanet Tolosan, France.

出版信息

BMC Bioinformatics. 2005 Feb 10;6:25. doi: 10.1186/1471-2105-6-25.

Abstract

BACKGROUND

Alternative splicing (AS) is now considered as a major actor in transcriptome/proteome diversity and it cannot be neglected in the annotation process of a new genome. Despite considerable progresses in term of accuracy in computational gene prediction, the ability to reliably predict AS variants when there is local experimental evidence of it remains an open challenge for gene finders.

RESULTS

We have used a new integrative approach that allows to incorporate AS detection into ab initio gene prediction. This method relies on the analysis of genomically aligned transcript sequences (ESTs and/or cDNAs), and has been implemented in the dynamic programming algorithm of the graph-based gene finder EuGENE. Given a genomic sequence and a set of aligned transcripts, this new version identifies the set of transcripts carrying evidence of alternative splicing events, and provides, in addition to the classical optimal gene prediction, alternative optimal predictions (among those which are consistent with the AS events detected). This allows for multiple annotations of a single gene in a way such that each predicted variant is supported by a transcript evidence (but not necessarily with a full-length coverage).

CONCLUSIONS

This automatic combination of experimental data analysis and ab initio gene finding offers an ideal integration of alternatively spliced gene prediction inside a single annotation pipeline.

摘要

背景

可变剪接(AS)如今被视为转录组/蛋白质组多样性的一个主要因素,在新基因组的注释过程中不容忽视。尽管在计算基因预测的准确性方面取得了相当大的进展,但当存在局部实验证据时,可靠预测AS变体的能力对基因发现工具来说仍是一个悬而未决的挑战。

结果

我们采用了一种新的综合方法,能够将AS检测纳入从头基因预测中。该方法依赖于对基因组比对的转录序列(EST和/或cDNA)进行分析,并已在基于图的基因发现工具EuGENE的动态规划算法中实现。给定一个基因组序列和一组比对的转录本,这个新版本会识别出携带可变剪接事件证据的转录本集合,并且除了经典的最优基因预测外,还会提供替代的最优预测(在与检测到的AS事件一致的那些预测中)。这允许以一种每个预测变体都有转录本证据支持(但不一定是全长覆盖)的方式对单个基因进行多种注释。

结论

这种实验数据分析与从头基因发现的自动结合,在单个注释流程中为可变剪接基因预测提供了理想的整合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a9ff/550657/550125bf4ab9/1471-2105-6-25-1.jpg

相似文献

1
Integrating alternative splicing detection into gene prediction.
BMC Bioinformatics. 2005 Feb 10;6:25. doi: 10.1186/1471-2105-6-25.
3
The Alternative Splicing Gallery (ASG): bridging the gap between genome and transcriptome.
Nucleic Acids Res. 2004 Aug 3;32(13):3977-83. doi: 10.1093/nar/gkh731. Print 2004.
4
JIGSAW: integration of multiple sources of evidence for gene prediction.
Bioinformatics. 2005 Sep 15;21(18):3596-603. doi: 10.1093/bioinformatics/bti609. Epub 2005 Aug 2.
5
ASmodeler: gene modeling of alternative splicing from genomic alignment of mRNA, EST and protein sequences.
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W181-6. doi: 10.1093/nar/gkh404.
6
Characterization of 954 bovine full-CDS cDNA sequences.
BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166.
8
ECgene: genome annotation for alternative splicing.
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D75-9. doi: 10.1093/nar/gki118.
9
ASPIC: a web resource for alternative splicing prediction and transcript isoforms characterization.
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W440-3. doi: 10.1093/nar/gkl324.

引用本文的文献

1
Genome-wide comparative analyses of GATA transcription factors among seven Populus genomes.
Sci Rep. 2021 Aug 16;11(1):16578. doi: 10.1038/s41598-021-95940-5.
3
EuGène-maize: a web site for maize gene prediction.
Bioinformatics. 2010 May 1;26(9):1254-5. doi: 10.1093/bioinformatics/btq123. Epub 2010 Apr 16.
4
mGene: accurate SVM-based gene finding with an application to nematode genomes.
Genome Res. 2009 Nov;19(11):2133-43. doi: 10.1101/gr.090597.108. Epub 2009 Jun 29.
5
nGASP--the nematode genome annotation assessment project.
BMC Bioinformatics. 2008 Dec 19;9:549. doi: 10.1186/1471-2105-9-549.
6
Genomix: a method for combining gene-finders' predictions, which uses evolutionary conservation of sequence and intron-exon structure.
Bioinformatics. 2007 Jun 15;23(12):1468-75. doi: 10.1093/bioinformatics/btm133. Epub 2007 May 5.
8
Identification of alternative 5'/3' splice sites based on the mechanism of splice site competition.
Nucleic Acids Res. 2006;34(21):6305-13. doi: 10.1093/nar/gkl900. Epub 2006 Nov 10.
9
Using several pair-wise informant sequences for de novo prediction of alternatively spliced transcripts.
Genome Biol. 2006;7 Suppl 1(Suppl 1):S8.1-9. doi: 10.1186/gb-2006-7-s1-s8. Epub 2006 Aug 7.

本文引用的文献

1
ESTGenes: alternative splicing from ESTs in Ensembl.
Genome Res. 2004 May;14(5):976-87. doi: 10.1101/gr.1862204.
2
The Ensembl automatic gene annotation system.
Genome Res. 2004 May;14(5):942-50. doi: 10.1101/gr.1858004.
3
Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus.
Bioinformatics. 2004 May 1;20(7):1157-69. doi: 10.1093/bioinformatics/bth058. Epub 2004 Feb 5.
4
Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays.
Science. 2003 Dec 19;302(5653):2141-4. doi: 10.1126/science.1090100.
5
PlantGDB, plant genome database and analysis tools.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D354-9. doi: 10.1093/nar/gkh046.
6
EASED: Extended Alternatively Spliced EST Database.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D70-4. doi: 10.1093/nar/gkh136.
7
ASD: the Alternative Splicing Database.
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D64-9. doi: 10.1093/nar/gkh030.
8
HMM sampling and applications to gene finding and alternative splicing.
Bioinformatics. 2003 Oct;19 Suppl 2:ii36-41. doi: 10.1093/bioinformatics/btg1057.
9
Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies.
Nucleic Acids Res. 2003 Oct 1;31(19):5654-66. doi: 10.1093/nar/gkg770.
10
EUGENE'HOM: A generic similarity-based gene finder using multiple homologous sequences.
Nucleic Acids Res. 2003 Jul 1;31(13):3742-5. doi: 10.1093/nar/gkg586.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验