Ganne Akshatha, Balasubramaniam Meenakshisundaram, Ayyadevara Srinivas, Shmookler Reis Robert J
Bioinformatics Program, University of Arkansas for Medical Sciences and University of Arkansas at Little Rock, Little Rock, AR, United States.
Department of Geriatrics, University of Arkansas for Medical Sciences, Little Rock, AR, United States.
Front Aging Neurosci. 2022 Aug 3;14:938117. doi: 10.3389/fnagi.2022.938117. eCollection 2022.
Protein structure is determined by the amino acid sequence and a variety of post-translational modifications, and provides the basis for physiological properties. Not all proteins in the proteome attain a stable conformation; roughly one third of human proteins are unstructured or contain intrinsically disordered regions exceeding 40% of their length. Proteins comprising or containing extensive unstructured regions are termed intrinsically disordered proteins (IDPs). IDPs are known to be overrepresented in protein aggregates of diverse neurodegenerative diseases. We evaluated the importance of disordered proteins in the nematode , by RNAi-mediated knockdown of IDPs in disease-model strains that mimic aggregation associated with neurodegenerative pathologies. Not all disordered proteins are sequestered into aggregates, and most of the tested aggregate-protein IDPs contribute to important physiological functions such as stress resistance or reproduction. Despite decades of research, we still do not understand what properties of a disordered protein determine its entry into aggregates. We have employed machine-learning models to identify factors that predict whether a disordered protein is found in sarkosyl-insoluble aggregates isolated from neurodegenerative-disease brains (both AD and PD). Machine-learning predictions, coupled with principal component analysis (PCA), enabled us to identify the physiochemical properties that determine whether a disordered protein will be enriched in neuropathic aggregates.
蛋白质结构由氨基酸序列和多种翻译后修饰决定,并为其生理特性提供基础。蛋白质组中的并非所有蛋白质都能形成稳定的构象;大约三分之一的人类蛋白质是无结构的,或者含有长度超过其40%的内在无序区域。包含或含有广泛无结构区域的蛋白质被称为内在无序蛋白质(IDP)。已知IDP在多种神经退行性疾病的蛋白质聚集体中过度表达。我们通过RNA干扰介导的疾病模型菌株中IDP的敲低,评估了线虫中无序蛋白质的重要性,这些疾病模型菌株模拟了与神经退行性病变相关的聚集。并非所有无序蛋白质都会被隔离到聚集体中,并且大多数测试的聚集蛋白IDP对诸如抗应激或繁殖等重要生理功能有贡献。尽管经过了数十年的研究,我们仍然不了解无序蛋白质的哪些特性决定其进入聚集体。我们采用机器学习模型来识别预测无序蛋白质是否存在于从神经退行性疾病大脑(阿尔茨海默病和帕金森病)分离的 Sarkosyl 不溶性聚集体中的因素。机器学习预测与主成分分析(PCA)相结合,使我们能够识别决定无序蛋白质是否会在神经病变聚集体中富集的物理化学性质。