Suppr超能文献

基因组合分析:一种整合基因预测结果的方法,该方法利用序列和内含子-外显子结构的进化保守性。

Genomix: a method for combining gene-finders' predictions, which uses evolutionary conservation of sequence and intron-exon structure.

作者信息

Coghlan Avril, Durbin Richard

机构信息

Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.

出版信息

Bioinformatics. 2007 Jun 15;23(12):1468-75. doi: 10.1093/bioinformatics/btm133. Epub 2007 May 5.

Abstract

MOTIVATION

Correct gene predictions are crucial for most analyses of genomes. However, in the absence of transcript data, gene prediction is still challenging. One way to improve gene-finding accuracy in such genomes is to combine the exons predicted by several gene-finders, so that gene-finders that make uncorrelated errors can correct each other.

RESULTS

We present a method for combining gene-finders called Genomix. Genomix selects the predicted exons that are best conserved within and/or between species in terms of sequence and intron-exon structure, and combines them into a gene structure. Genomix was used to combine predictions from four gene-finders for Caenorhabditis elegans, by selecting the predicted exons that are best conserved with C.briggsae and C.remanei. On a set of approximately 1500 confirmed C.elegans genes, Genomix increased the exon-level specificity by 10.1% and sensitivity by 2.7% compared to the best input gene-finder.

AVAILABILITY

Scripts and Supplementary Material can be found at http://www.sanger.ac.uk/Software/analysis/genomix

摘要

动机

正确的基因预测对于大多数基因组分析至关重要。然而,在缺乏转录本数据的情况下,基因预测仍然具有挑战性。提高此类基因组中基因发现准确性的一种方法是将多个基因预测工具预测的外显子进行组合,这样做出不相关错误的基因预测工具可以相互校正。

结果

我们提出了一种名为Genomix的基因预测工具组合方法。Genomix根据序列和内含子-外显子结构选择在物种内部和/或物种之间保守性最佳的预测外显子,并将它们组合成一个基因结构。通过选择与秀丽隐杆线虫、布氏秀丽线虫和雷曼氏秀丽线虫保守性最佳的预测外显子,Genomix被用于组合来自四种基因预测工具对秀丽隐杆线虫的预测。在一组约1500个已确认的秀丽隐杆线虫基因上,与最佳的输入基因预测工具相比,Genomix将外显子水平的特异性提高了10.1%,敏感性提高了2.7%。

可用性

脚本和补充材料可在http://www.sanger.ac.uk/Software/analysis/genomix获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6e9a/2880447/b5a4072b6e30/ukmss-27365-f0001.jpg

相似文献

2
nGASP--the nematode genome annotation assessment project.线虫基因组注释评估项目(nGASP)
BMC Bioinformatics. 2008 Dec 19;9:549. doi: 10.1186/1471-2105-9-549.
7
RNA-seq analysis of the C. briggsae transcriptome.秀丽隐杆线虫转录组的 RNA-seq 分析。
Genome Res. 2012 Aug;22(8):1567-80. doi: 10.1101/gr.134601.111. Epub 2012 Jul 6.
8
RASE: recognition of alternatively spliced exons in C.elegans.RASE:秀丽隐杆线虫中可变剪接外显子的识别
Bioinformatics. 2005 Jun;21 Suppl 1:i369-77. doi: 10.1093/bioinformatics/bti1053.
10
ExonHunter: a comprehensive approach to gene finding.外显子猎手:一种全面的基因发现方法。
Bioinformatics. 2005 Jun;21 Suppl 1:i57-65. doi: 10.1093/bioinformatics/bti1040.

本文引用的文献

1
Creating a honey bee consensus gene set.创建一个蜜蜂共有基因集。
Genome Biol. 2007;8(1):R13. doi: 10.1186/gb-2007-8-1-r13.
3
EGASP: the human ENCODE Genome Annotation Assessment Project.EGASP:人类ENCODE基因组注释评估项目。
Genome Biol. 2006;7 Suppl 1(Suppl 1):S2.1-31. doi: 10.1186/gb-2006-7-s1-s2. Epub 2006 Aug 7.
6
WormBase: better software, richer content.WormBase:更优质的软件,更丰富的内容。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D475-8. doi: 10.1093/nar/gkj061.
10
Gene finding in novel genomes.新基因组中的基因发现。
BMC Bioinformatics. 2004 May 14;5:59. doi: 10.1186/1471-2105-5-59.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验