• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自组织映射进行基因预测:多个基因模型的自动生成

Gene prediction using the Self-Organizing Map: automatic generation of multiple gene models.

作者信息

Mahony Shaun, McInerney James O, Smith Terry J, Golden Aaron

机构信息

National Centre for Biomedical Engineering Science, NUI, Galway, Galway, Ireland.

出版信息

BMC Bioinformatics. 2004 Mar 5;5:23. doi: 10.1186/1471-2105-5-23.

DOI:10.1186/1471-2105-5-23
PMID:15070404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC385221/
Abstract

BACKGROUND

Many current gene prediction methods use only one model to represent protein-coding regions in a genome, and so are less likely to predict the location of genes that have an atypical sequence composition. It is likely that future improvements in gene finding will involve the development of methods that can adequately deal with intra-genomic compositional variation.

RESULTS

This work explores a new approach to gene-prediction, based on the Self-Organizing Map, which has the ability to automatically identify multiple gene models within a genome. The current implementation, named RescueNet, uses relative synonymous codon usage as the indicator of protein-coding potential.

CONCLUSIONS

While its raw accuracy rate can be less than other methods, RescueNet consistently identifies some genes that other methods do not, and should therefore be of interest to gene-prediction software developers and genome annotation teams alike. RescueNet is recommended for use in conjunction with, or as a complement to, other gene prediction methods.

摘要

背景

许多当前的基因预测方法仅使用一种模型来表示基因组中的蛋白质编码区域,因此不太可能预测具有非典型序列组成的基因的位置。未来基因发现的改进可能会涉及开发能够充分处理基因组内组成变异的方法。

结果

这项工作探索了一种基于自组织映射的新的基因预测方法,该方法能够自动识别基因组内的多个基因模型。当前的实现版本名为RescueNet,它使用相对同义密码子使用情况作为蛋白质编码潜力的指标。

结论

虽然其原始准确率可能低于其他方法,但RescueNet始终能识别出其他方法无法识别的一些基因,因此基因预测软件开发人员和基因组注释团队都应该对它感兴趣。建议将RescueNet与其他基因预测方法结合使用或作为其补充。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e884/385221/a35d5cdf3159/1471-2105-5-23-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e884/385221/0f9d2d5cd8d4/1471-2105-5-23-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e884/385221/a35d5cdf3159/1471-2105-5-23-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e884/385221/0f9d2d5cd8d4/1471-2105-5-23-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e884/385221/a35d5cdf3159/1471-2105-5-23-2.jpg

相似文献

1
Gene prediction using the Self-Organizing Map: automatic generation of multiple gene models.使用自组织映射进行基因预测:多个基因模型的自动生成
BMC Bioinformatics. 2004 Mar 5;5:23. doi: 10.1186/1471-2105-5-23.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
JIGSAW: integration of multiple sources of evidence for gene prediction.拼图:用于基因预测的多源证据整合
Bioinformatics. 2005 Sep 15;21(18):3596-603. doi: 10.1093/bioinformatics/bti609. Epub 2005 Aug 2.
4
Re-annotation of genome microbial coding-sequences: finding new genes and inaccurately annotated genes.基因组微生物编码序列的重新注释:发现新基因和注释不准确的基因。
BMC Bioinformatics. 2002;3:5. doi: 10.1186/1471-2105-3-5. Epub 2002 Feb 5.
5
SIGI: score-based identification of genomic islands.SIGI:基于分数的基因组岛识别
BMC Bioinformatics. 2004 Mar 3;5:22. doi: 10.1186/1471-2105-5-22.
6
Quantitative relationship between synonymous codon usage bias and GC composition across unicellular genomes.单细胞基因组中同义密码子使用偏好与GC含量之间的定量关系。
BMC Evol Biol. 2004 Jun 28;4:19. doi: 10.1186/1471-2148-4-19.
7
Temperature adaptation of synonymous codon usage in different functional categories of genes: a comparative study between homologous genes of Methanococcus jannaschii and Methanococcus maripaludis.不同功能类别的基因中同义密码子使用的温度适应性:詹氏甲烷球菌和沼泽甲烷球菌同源基因之间的比较研究
FEBS Lett. 2006 Jul 10;580(16):3895-9. doi: 10.1016/j.febslet.2006.06.014. Epub 2006 Jun 16.
8
Identification of circular codes in bacterial genomes and their use in a factorization method for retrieving the reading frames of genes.细菌基因组中环状编码的鉴定及其在检索基因阅读框的分解方法中的应用。
Comput Biol Chem. 2006 Apr;30(2):87-101. doi: 10.1016/j.compbiolchem.2005.11.001. Epub 2006 Jan 24.
9
GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.GeneMarkS:一种用于预测微生物基因组中基因起始位点的自训练方法。对在调控区域中寻找序列基序的启示。
Nucleic Acids Res. 2001 Jun 15;29(12):2607-18. doi: 10.1093/nar/29.12.2607.
10
nWayComp: a genome-wide sequence comparison tool for multiple strains/species of phylogenetically related microorganisms.nWayComp:一种用于系统发育相关微生物的多个菌株/物种的全基因组序列比较工具。
In Silico Biol. 2007;7(2):195-200.

引用本文的文献

1
MiRNATIP: a SOM-based miRNA-target interactions predictor.MiRNATIP:一种基于自组织映射的微小RNA-靶标相互作用预测工具
BMC Bioinformatics. 2016 Sep 22;17(Suppl 11):321. doi: 10.1186/s12859-016-1171-x.
2
Classification and clustering analysis of pyruvate dehydrogenase enzyme based on their physicochemical properties.基于丙酮酸脱氢酶理化性质的分类与聚类分析
Bioinformation. 2010 Apr 30;4(10):456-62. doi: 10.6026/97320630004456.
3
Genome signatures, self-organizing maps and higher order phylogenies: a parametric analysis.基因组特征、自组织映射和高阶系统发育:参数分析。

本文引用的文献

1
Informatics for unveiling hidden genome signatures.用于揭示隐藏基因组特征的信息学。
Genome Res. 2003 Apr;13(4):693-702. doi: 10.1101/gr.634603.
2
Evolution of synonymous codon usage in metazoans.后生动物中同义密码子使用情况的演变。
Curr Opin Genet Dev. 2002 Dec;12(6):640-9. doi: 10.1016/s0959-437x(02)00353-2.
3
Current methods of gene prediction, their strengths and weaknesses.当前的基因预测方法、其优势与不足。
Evol Bioinform Online. 2007 Sep 17;3:211-36.
4
GISMO--gene identification using a support vector machine for ORF classification.GISMO——使用支持向量机进行开放阅读框分类的基因识别方法
Nucleic Acids Res. 2007;35(2):540-9. doi: 10.1093/nar/gkl1083. Epub 2006 Dec 14.
Nucleic Acids Res. 2002 Oct 1;30(19):4103-17. doi: 10.1093/nar/gkf543.
4
Improving gene recognition accuracy by combining predictions from two gene-finding programs.通过结合两个基因发现程序的预测结果提高基因识别准确性。
Bioinformatics. 2002 Aug;18(8):1034-45. doi: 10.1093/bioinformatics/18.8.1034.
5
Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2).模式放线菌天蓝色链霉菌A3(2)的全基因组序列
Nature. 2002 May 9;417(6885):141-7. doi: 10.1038/417141a.
6
Genome sequence of the plant pathogen Ralstonia solanacearum.植物病原菌青枯雷尔氏菌的基因组序列。
Nature. 2002 Jan 31;415(6871):497-502. doi: 10.1038/415497a.
7
Analysis of codon usage diversity of bacterial genes with a self-organizing map (SOM): characterization of horizontally transferred genes with emphasis on the E. coli O157 genome.利用自组织映射(SOM)分析细菌基因的密码子使用多样性:重点关注大肠杆菌O157基因组的水平转移基因的特征
Gene. 2001 Oct 3;276(1-2):89-99. doi: 10.1016/s0378-1119(01)00673-4.
8
Genome sequence of Yersinia pestis, the causative agent of plague.鼠疫病原体——鼠疫耶尔森菌的基因组序列。
Nature. 2001 Oct 4;413(6855):523-7. doi: 10.1038/35097083.
9
Selection at the amino acid level can influence synonymous codon usage: implications for the study of codon adaptation in plastid genes.氨基酸水平上的选择会影响同义密码子的使用:对质体基因密码子适应性研究的启示。
Genetics. 2001 Sep;159(1):347-58. doi: 10.1093/genetics/159.1.347.
10
Intragenomic base content variation is a potential source of biases when searching for horizontally transferred genes.在寻找水平转移基因时,基因组内碱基含量变异是偏差的一个潜在来源。
Mol Biol Evol. 2001 Sep;18(9):1838-40. doi: 10.1093/oxfordjournals.molbev.a003972.