King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center, Thuwal 23955-6900, Saudi Arabia.
Bioinformatics. 2013 Jan 1;29(1):117-8. doi: 10.1093/bioinformatics/bts638. Epub 2012 Oct 30.
In higher eukaryotes, the identification of translation initiation sites (TISs) has been focused on finding these signals in cDNA or mRNA sequences. Using Arabidopsis thaliana (A.t.) information, we developed a prediction tool for signals within genomic sequences of plants that correspond to TISs. Our tool requires only genome sequence, not expressed sequences. Its sensitivity/specificity is for A.t. (90.75%/92.2%), for Vitis vinifera (66.8%/94.4%) and for Populus trichocarpa (81.6%/94.4%), which suggests that our tool can be used in annotation of different plant genomes. We provide a list of features used in our model. Further study of these features may improve our understanding of mechanisms of the translation initiation.
Our tool is implemented as an artificial neural network. It is available as a web-based tool and, together with the source code, the list of features, and data used for model development, is accessible at http://cbrc.kaust.edu.sa/dts.
在高等真核生物中,翻译起始位点(TIS)的鉴定主要集中在寻找 cDNA 或 mRNA 序列中的这些信号。利用拟南芥(A.t.)的信息,我们开发了一种针对植物基因组中与 TIS 相对应的信号的预测工具。我们的工具仅需要基因组序列,而不需要表达序列。它对 A.t.(90.75%/92.2%)、葡萄(Vitis vinifera)(66.8%/94.4%)和杨属(Populus trichocarpa)(81.6%/94.4%)的灵敏度/特异性,这表明我们的工具可用于注释不同的植物基因组。我们提供了模型中使用的特征列表。进一步研究这些特征可能有助于我们理解翻译起始的机制。
我们的工具是作为人工神经网络实现的。它作为一个基于网络的工具提供,以及源代码、特征列表和用于模型开发的数据可在 http://cbrc.kaust.edu.sa/dts 访问。