利用蛋白质多重序列比对进行新型混合基因预测方法。

A novel hybrid gene prediction method employing protein multiple sequence alignments.

机构信息

Institute of Computer Science, University of Göttingen, Goldschmidtstrasse 7, Greifswald, Germany.

出版信息

Bioinformatics. 2011 Mar 15;27(6):757-63. doi: 10.1093/bioinformatics/btr010. Epub 2011 Jan 6.

DOI:10.1093/bioinformatics/btr010

PMID:21216780

Abstract

MOTIVATION

As improved DNA sequencing techniques have increased enormously the speed of producing new eukaryotic genome assemblies, the further development of automated gene prediction methods continues to be essential. While the classification of proteins into families is a task heavily relying on correct gene predictions, it can at the same time provide a source of additional information for the prediction, complementary to those presently used.

RESULTS

We extended the gene prediction software AUGUSTUS by a method that employs block profiles generated from multiple sequence alignments as a protein signature to improve the accuracy of the prediction. Equipped with profiles modelling human dynein heavy chain (DHC) proteins and other families, AUGUSTUS was run on the genomic sequences known to contain members of these families. Compared with AUGUSTUS' ab initio version, the rate of genes predicted with high accuracy showed a dramatic increase.

AVAILABILITY

The AUGUSTUS project web page is located at http://augustus.gobics.de, with the executable program as well as the source code available for download.

摘要

动机

随着改进的 DNA 测序技术极大地提高了产生新真核基因组组装的速度，自动化基因预测方法的进一步发展仍然是必不可少的。虽然蛋白质的分类主要依赖于正确的基因预测，但它同时可以为预测提供额外的信息来源，与目前使用的信息来源互补。

结果

我们通过一种方法扩展了基因预测软件 AUGUSTUS，该方法使用来自多序列比对的块谱作为蛋白质特征，以提高预测的准确性。配备了模拟人类动力蛋白重链 (DHC) 蛋白和其他家族的谱，AUGUSTUS 被用于已知包含这些家族成员的基因组序列上。与 AUGUSTUS 的从头开始版本相比，高精度预测的基因率显示出了显著的增加。

可用性

AUGUSTUS 项目网页位于 http://augustus.gobics.de，可执行程序以及源代码均可下载。

相似文献

A novel hybrid gene prediction method employing protein multiple sequence alignments.利用蛋白质多重序列比对进行新型混合基因预测方法。

Bioinformatics. 2011 Mar 15;27(6):757-63. doi: 10.1093/bioinformatics/btr010. Epub 2011 Jan 6.

AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome.EGASP中的AUGUSTUS：利用EST、蛋白质和基因组比对改进人类基因组中的基因预测

Genome Biol. 2006;7 Suppl 1(Suppl 1):S11.1-8. doi: 10.1186/gb-2006-7-s1-s11. Epub 2006 Aug 7.

PROMALS web server for accurate multiple protein sequence alignments.用于精确多蛋白序列比对的PROMALS网络服务器。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W649-52. doi: 10.1093/nar/gkm227. Epub 2007 Apr 22.

Gene prediction with a hidden Markov model and a new intron submodel.基于隐马尔可夫模型和新型内含子子模型的基因预测

Bioinformatics. 2003 Oct;19 Suppl 2:ii215-25. doi: 10.1093/bioinformatics/btg1080.

transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.transAlign：利用氨基酸促进蛋白质编码DNA序列的多重比对。

BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

AUGUSTUS: ab initio prediction of alternative transcripts.奥古斯塔斯：可变转录本的从头预测。

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W435-9. doi: 10.1093/nar/gkl200.

AUGUSTUS: a web server for gene finding in eukaryotes.奥古斯塔斯：用于真核生物基因发现的网络服务器。

Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W309-12. doi: 10.1093/nar/gkh379.

Concerted action of the new Genomic Peptide Finder and AUGUSTUS allows for automated proteogenomic annotation of the Chlamydomonas reinhardtii genome.新型基因组肽发现器与 AUGUSTUS 的协同作用可实现莱茵衣藻基因组的自动化蛋白基因组注释。

Proteomics. 2011 May;11(9):1814-23. doi: 10.1002/pmic.201000621. Epub 2011 Mar 22.

DOMAC: an accurate, hybrid protein domain prediction server.DOMAC：一个准确的混合蛋白质结构域预测服务器。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W354-6. doi: 10.1093/nar/gkm390. Epub 2007 Jun 6.

引用本文的文献

Chromosome-scale genome assembly and gene annotation of the hydrothermal vent annelid Alvinella pompejana yield insight into animal evolution in extreme environments.热液喷口环节动物庞贝蠕虫的染色体水平基因组组装和基因注释为极端环境中的动物进化提供了见解。

BMC Biol. 2025 Sep 2;23(1):274. doi: 10.1186/s12915-025-02369-7.

Enhancing local meiotic crossovers in Arabidopsis and maize through juxtaposition of heterozygous and homozygous regions.通过杂合区域和纯合区域并置增强拟南芥和玉米中的局部减数分裂交叉。

Nat Plants. 2025 Sep 2. doi: 10.1038/s41477-025-02085-8.

Analysis wheat wild relatives Thinopyrum intermedium and Roegneria kamoji genomes reveal different polyploid evolution paths.对小麦野生近缘种中间偃麦草和鹅观草基因组的分析揭示了不同的多倍体进化路径。

Nat Commun. 2025 Aug 18;16(1):7693. doi: 10.1038/s41467-025-63007-y.

Genomic selection for growth and wood properties in multi-generation hybrid populations of ..多代杂交群体中生长和木材特性的基因组选择

Hortic Res. 2025 Jun 25;12(9):uhaf165. doi: 10.1093/hr/uhaf165. eCollection 2025 Sep.

Phased genome assemblies and pangenome graphs of human populations of Japan and Saudi Arabia.日本和沙特阿拉伯人群的阶段性基因组组装和泛基因组图谱。

Sci Data. 2025 Aug 12;12(1):1316. doi: 10.1038/s41597-025-05652-y.

Genome sequences of six clinical isolates of exhibiting different degrees and temporal regulation of biofilm formation.六个临床分离株的基因组序列，这些分离株表现出不同程度的生物膜形成及其时间调控。

Microbiol Resour Announc. 2025 Sep 11;14(9):e0130024. doi: 10.1128/mra.01300-24. Epub 2025 Aug 11.

Analysis of metabolomics and transcriptomics data to assess interactions in microalgal co-culture of Skeletonema marinoi and Prymnesium parvum.分析代谢组学和转录组学数据以评估海洋骨条藻和微小原甲藻微藻共培养中的相互作用。

PLoS One. 2025 Jul 28;20(7):e0329115. doi: 10.1371/journal.pone.0329115. eCollection 2025.

Starship giant transposons dominate plastic genomic regions in a fungal plant pathogen and drive virulence evolution.星舰巨型转座子在一种真菌植物病原体中主导可塑性基因组区域并推动毒力进化。

Nat Commun. 2025 Jul 24;16(1):6806. doi: 10.1038/s41467-025-61986-6.

Molecular basis for the biosynthesis of the siderophore coprogen in the cheese-ripening fungus Penicillium roqueforti.干酪成熟真菌罗克福青霉中铁载体粪产碱菌素生物合成的分子基础。

Biol Res. 2025 Jul 21;58(1):51. doi: 10.1186/s40659-025-00633-2.

Long-read microbial genome assembly, gene prediction and functional annotation: a service of the MIRRI ERIC Italian node.长读长微生物基因组组装、基因预测和功能注释：MIRRI ERIC意大利节点的一项服务。

Front Bioinform. 2025 Jun 30;5:1632189. doi: 10.3389/fbinf.2025.1632189. eCollection 2025.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用蛋白质多重序列比对进行新型混合基因预测方法。

A novel hybrid gene prediction method employing protein multiple sequence alignments.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献