Yu Ruiyang, Ye Xinghuo, Zhang Chenghua, Hu Hailong, Kang Yanlei, Li Zhong
School of Information Engineering, Huzhou University, Huzhou 313000, China.
Curr Issues Mol Biol. 2022 Dec 30;45(1):212-222. doi: 10.3390/cimb45010016.
Virus infestation can seriously harm the host plant's growth and development. Turnip yellows virus (TuYV) infestation of host plants can cause symptoms, such as yellowing and curling of leaves and root chlorosis. However, the regulatory mechanisms by which TuYV affects host growth and development are unclear. Hence, it is essential to mine small RNA (sRNA) and explore the regulation of sRNAs on plant hosts for disease control. In this study, we analyzed high-throughput data before and after TuYV infestation in Arabidopsis using combined genetics, statistics, and machine learning to identify 108 specifically expressed and critical functional sRNAs after TuYV infection. First, comparing the expression levels of sRNAs before and after infestation, 508 specific sRNAs were significantly up-regulated in Arabidopsis after infestation. In addition, the results show that AI models, including SVM, RF, XGBoost, and CNN using two-dimensional convolution, have robust classification features at the sequence level, with a prediction accuracy of about 96.8%. A comparison of specific sRNAs with genome sequences revealed that 247 matched precisely with the TuYV genome sequence but not with the Arabidopsis genome, suggesting that TuYV viruses may be their source. The 247 sRNAs predicted target genes and enrichment analysis, which identified 206 Arabidopsis genes involved in nine biological processes and three KEGG pathways associated with plant growth and viral stress tolerance, corresponding to 108 sRNAs. These findings provide a reference for studying sRNA-mediated interactions in pathogen infection and are essential for establishing a vital resource of regulation network for the virus infecting plants and deepening the understanding of TuYV virus infection patterns. However, further validation of these sRNAs is needed to gain a new understanding.
病毒侵染会严重损害寄主植物的生长发育。寄主植物感染芜菁黄化病毒(TuYV)会引发叶片黄化卷曲、根部褪绿等症状。然而,TuYV影响寄主生长发育的调控机制尚不清楚。因此,挖掘小RNA(sRNA)并探索其对植物寄主的调控作用对于病害防治至关重要。在本研究中,我们运用遗传学、统计学和机器学习相结合的方法,分析了拟南芥在TuYV侵染前后的高通量数据,以鉴定出TuYV感染后108个特异性表达且具有关键功能的sRNA。首先,比较侵染前后sRNA的表达水平,侵染后拟南芥中有508个特异性sRNA显著上调。此外,结果表明,包括支持向量机(SVM)、随机森林(RF)、极端梯度提升(XGBoost)以及使用二维卷积的卷积神经网络(CNN)在内的人工智能模型在序列水平上具有强大的分类特征,预测准确率约为96.8%。将特异性sRNA与基因组序列进行比较发现,247个sRNA与TuYV基因组序列精确匹配,但与拟南芥基因组不匹配,这表明TuYV病毒可能是它们的来源。对这247个sRNA预测的靶基因进行富集分析,鉴定出206个拟南芥基因参与了九个生物学过程和三条与植物生长及病毒胁迫耐受性相关的KEGG通路,对应108个sRNA。这些发现为研究病原体感染中sRNA介导的相互作用提供了参考,对于建立病毒侵染植物的重要调控网络资源以及深化对TuYV病毒感染模式的理解至关重要。然而,需要对这些sRNA进行进一步验证才能获得新的认识。