Popadin Konstantin Yu, Mamirova Leila A, Kondrashov Fyodor A
Institute for Information Transmission Problems RAS, Bolshoi Karetny pereulok 19, Moscow.
BMC Bioinformatics. 2007 Nov 14;8:441. doi: 10.1186/1471-2105-8-441.
Mitochondrial tRNAs have been the subject of study for structural biologists interested in their secondary structure characteristics, evolutionary biologists have researched patterns of compensatory and structural evolution and medical studies have been directed towards understanding the basis of human disease. However, an up to date, manually curated database of mitochondrially encoded tRNAs from higher animals is currently not available.
We obtained the complete mitochondrial sequence for 277 tetrapod species from GenBank and re-annotated all of the tRNAs based on a multiple alignment of each tRNA gene and secondary structure prediction made independently for each tRNA. The mitochondrial (mt) tRNA sequences and the secondary structure based multiple alignments are freely available as Supplemental Information online.
We compiled a manually curated database of mitochondrially encoded tRNAs from tetrapods with completely sequenced genomes. In the course of our work, we reannotated more than 10% of all tetrapod mt-tRNAs and subsequently predicted the secondary structures of 6060 mitochondrial tRNAs. This carefully constructed database can be utilized to enhance our knowledge in several different fields including the evolution of mt-tRNA secondary structure and prediction of pathogenic mt-tRNA mutations. In addition, researchers reporting novel mitochondrial genome sequences should check their tRNA gene annotations against our database to ensure a higher level of fidelity of their annotation.
线粒体tRNA一直是结构生物学家研究的对象,他们关注其二级结构特征;进化生物学家研究了补偿性和结构进化模式;医学研究则致力于理解人类疾病的基础。然而,目前尚无一个最新的、经过人工整理的高等动物线粒体编码tRNA数据库。
我们从GenBank获取了277种四足动物的完整线粒体序列,并基于每个tRNA基因的多重比对以及为每个tRNA独立进行的二级结构预测,对所有tRNA进行了重新注释。线粒体(mt)tRNA序列和基于二级结构的多重比对可作为在线补充信息免费获取。
我们编制了一个经过人工整理的数据库,其中包含来自基因组已完全测序的四足动物的线粒体编码tRNA。在我们的工作过程中,我们对所有四足动物mt-tRNA中的10%以上进行了重新注释,随后预测了6060个线粒体tRNA的二级结构。这个精心构建的数据库可用于增进我们在几个不同领域的知识,包括mt-tRNA二级结构的进化以及致病性mt-tRNA突变的预测。此外,报告新线粒体基因组序列的研究人员应根据我们的数据库检查其tRNA基因注释,以确保注释具有更高的准确性。