Kannan Sivakumar, Burger Gertraud
Robert Cedergren Research Center for Bioinformatics and Genomics, Département de Biochimie, Université de Montréal, 2900 Boulevard Edouard-Montpetit, Montréal, Québec, H3T 1J4, Canada.
BMC Genomics. 2008 Oct 2;9:455. doi: 10.1186/1471-2164-9-455.
In a previous study, we conducted a large-scale similarity-free function prediction of mitochondrion-encoded hypothetical proteins, by which the hypothetical gene murf1 (maxicircle unidentified reading frame 1) was assigned as nad2, encoding subunit 2 of NADH dehydrogenase (Complex I of the respiratory chain). This hypothetical gene occurs in the mitochondrial genome of kinetoplastids, a group of unicellular eukaryotes including the causative agents of African sleeping sickness and leishmaniasis. In the present study, we test this assignment by using bioinformatics methods that are highly sensitive in identifying remote homologs and confront the prediction with available biological knowledge.
Comparison of MURF1 profile Hidden Markov Model (HMM) against function-known profile HMMs in Pfam, Panther and TIGR shows that MURF1 is a Complex I protein, but without specifying the exact subunit. Therefore, we constructed profile HMMs for each individual subunit, using all available sequences clustered at various identity thresholds. HMM-HMM comparison of these individual NADH subunits against MURF1 clearly identifies this hypothetical protein as NAD2. Further, we collected the relevant experimental information about kinetoplastids, which provides additional evidence in support of this prediction.
Our in silico analyses provide convincing evidence for MURF1 being a highly divergent member of NAD2.
在之前的一项研究中,我们对线粒体编码的假设蛋白进行了大规模的无相似性功能预测,据此将假设基因murf1(大环未知阅读框1)指定为nad2,其编码NADH脱氢酶(呼吸链复合体I)的亚基2。该假设基因存在于动基体的线粒体基因组中,动基体是一类单细胞真核生物,包括非洲昏睡病和利什曼病的病原体。在本研究中,我们使用在识别远源同源物方面高度敏感的生物信息学方法来检验这一指定,并将该预测与现有的生物学知识进行对照。
将MURF1概况隐马尔可夫模型(HMM)与Pfam、Panther和TIGR中功能已知的概况HMM进行比较,结果表明MURF1是一种复合体I蛋白,但未明确具体的亚基。因此,我们使用在不同同一性阈值下聚类的所有可用序列,为每个单独的亚基构建了概况HMM。将这些单独的NADH亚基的HMM与MURF1进行HMM-HMM比较,明确将这种假设蛋白鉴定为NAD2。此外,我们收集了有关动基体的相关实验信息,为这一预测提供了额外的证据。
我们的计算机模拟分析为MURF1是NAD2的一个高度分化的成员提供了令人信服的证据。