Bug D S, Moiseev I S, Porozov Yu B, Petukhova N V
Bioinformatics Research Center, Pavlov First Saint Petersburg Medical State University, St. Petersburg, Russia.
R. M. Gorbacheva Scientific Research Institute of Pediatric Hematology and Transplantation, Pavlov First Saint Petersburg State Medical University, St. Petersburg, Russia.
Front Mol Biosci. 2024 Oct 3;11:1441180. doi: 10.3389/fmolb.2024.1441180. eCollection 2024.
The Dicer protein is an indispensable player in such fundamental cell pathways as miRNA biogenesis and regulation of protein expression in a cell. Most recently, both germline and somatic mutations in have been identified in diverse types of cancers, which suggests Dicer mutations can lead to cancer progression. In addition to well-known hotspot mutations in RNAase III domains, is characterized by a wide spectrum of variants in all the functional domains; most are of uncertain significance and unstated clinical effects. Moreover, various new somatic mutations continuously appear in cancer genome sequencing. The latest contemporary methods of variant effect prediction utilize machine learning algorithms on bulk data, yielding suboptimal correlation with biological data. Consequently, such analysis should be conducted based on the functional and structural characteristics of each protein, using a well-grounded targeted dataset rather than relying on large amounts of unsupervised data. Domains are the functional and evolutionary units of a protein; the analysis of the whole protein should be based on separate and independent examinations of each domain by their evolutionary reconstruction. Dicer represents a hallmark example of a multidomain protein, and we confirmed the phylogenetic multidomain approach being beneficial for the clinical effect prediction of Dicer variants. Because Dicer was suggested to have a putative role in hematological malignancies, we examined variants of occurring outside the well-known hotspots of the RNase III domain in this type of cancer using phylogenetic reconstruction of individual domain history. Examined substitutions might disrupt the Dicer function, which was demonstrated by molecular dynamic simulation, where distinct structural alterations were observed for each mutation. Our approach can be utilized to study other multidomain proteins and to improve clinical effect evaluation.
Dicer蛋白是细胞中miRNA生物合成和蛋白质表达调控等基本细胞通路中不可或缺的参与者。最近,在多种类型的癌症中都发现了种系和体细胞突变,这表明Dicer突变可能导致癌症进展。除了RNA酶III结构域中众所周知的热点突变外,其在所有功能结构域中都具有广泛的变异;大多数变异的意义不确定且临床影响不明确。此外,各种新的体细胞突变在癌症基因组测序中不断出现。最新的变异效应预测方法利用机器学习算法处理大量数据,与生物学数据的相关性欠佳。因此,此类分析应基于每种蛋白质的功能和结构特征,使用有充分依据的靶向数据集,而不是依赖大量无监督数据。结构域是蛋白质的功能和进化单位;对整个蛋白质的分析应基于通过进化重建对每个结构域进行单独和独立的检查。Dicer是多结构域蛋白的典型例子,我们证实系统发育多结构域方法有利于Dicer变异体的临床效应预测。由于Dicer被认为在血液系统恶性肿瘤中具有假定作用,我们利用单个结构域历史的系统发育重建,研究了这种癌症中RNA酶III结构域已知热点之外发生的Dicer变异体。经检测的替代可能会破坏Dicer功能,分子动力学模拟证明了这一点,其中每个突变都观察到了明显的结构改变。我们的方法可用于研究其他多结构域蛋白并改善临床效应评估。