Homma Keiichi, Fukuchi Satoshi, Nishikawa Ken, Sakamoto Shigetaka, Sugawara Hideaki
Center for Information Biology-DNA Data Bank of Japan, National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka, Japan.
Mol Biosyst. 2012 Jan;8(1):247-55. doi: 10.1039/c1mb05208j. Epub 2011 Aug 24.
Proteins in general consist not only of globular structural domains (SDs), but also of intrinsically disordered regions (IDRs), i.e. those that do not assume unique three-dimensional structures by themselves. Although IDRs are especially prevalent in eukaryotic proteins, the functions are mostly unknown. To elucidate the functions of IDRs, we first divided eukaryotic proteins into subcellular localizations, identified IDRs by the DICHOT system that accurately divides entire proteins into SDs and IDRs, and examined charge and hydropathy characteristics. On average, mitochondrial proteins have IDRs more positively charged than SDs. Comparison of mitochondrial proteins with orthologous prokaryotic proteins showed that mitochondrial proteins tend to have segments attached at both N and C termini, high fractions of which are IDRs. Segments added to the N-terminus of mitochondrial proteins contain not only signal sequences but also mature proteins and exhibit a positive charge gradient, with the magnitude increasing toward the N-terminus. This finding is consistent with the notion that positively charged residues are added to the N-terminus of proteobacterial proteins so that the extended proteins can be chromosomally encoded and efficiently transported to mitochondria after translation. By contrast, nuclear proteins generally have positively charged SDs and negatively charged IDRs. Among nuclear proteins, DNA-binding proteins have enhanced charge tendencies. We propose that SDs in nuclear proteins tend to be positively charged because of the need to bind to negatively charged nucleotides, while IDRs tend to be negatively charged to interact with other proteins or other regions of the same proteins to avoid premature proteasomal degradation.
一般来说,蛋白质不仅由球状结构域(SDs)组成,还包含内在无序区域(IDRs),即那些自身不具有独特三维结构的区域。尽管IDRs在真核生物蛋白质中特别普遍,但其功能大多未知。为了阐明IDRs的功能,我们首先将真核生物蛋白质按亚细胞定位进行分类,通过DICHOT系统识别IDRs,该系统能准确地将整个蛋白质分为SDs和IDRs,并研究其电荷和亲水性特征。平均而言,线粒体蛋白质的IDRs比SDs带更多正电荷。将线粒体蛋白质与其原核生物直系同源蛋白质进行比较发现,线粒体蛋白质倾向于在N端和C端都有附加片段,其中很大一部分是IDRs。添加到线粒体蛋白质N端的片段不仅包含信号序列,还包含成熟蛋白质,并呈现出正电荷梯度,其强度向N端增加。这一发现与以下观点一致,即带正电荷的残基被添加到变形菌蛋白质的N端,以便延伸后的蛋白质能够在染色体上编码,并在翻译后有效地转运到线粒体。相比之下,核蛋白质通常具有带正电荷的SDs和带负电荷的IDRs。在核蛋白质中,DNA结合蛋白具有增强的电荷倾向。我们认为,核蛋白质中的SDs倾向于带正电荷是因为需要与带负电荷的核苷酸结合,而IDRs倾向于带负电荷以便与其他蛋白质或同一蛋白质的其他区域相互作用,从而避免过早被蛋白酶体降解。