Suppr超能文献

RiceSNP-BST:一种用于预测水稻中与生物胁迫相关的 SNP 的深度学习框架。

RiceSNP-BST: a deep learning framework for predicting biotic stress-associated SNPs in rice.

机构信息

School of Information and Artificial Intelligence, Anhui Provincial Engineering Research Center for Beidou Precision Agriculture Information, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China.

Research Center for Biological Breeding Technology, Advance Academy, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China.

出版信息

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae599.

Abstract

Rice consistently faces significant threats from biotic stresses, such as fungi, bacteria, pests, and viruses. Consequently, accurately and rapidly identifying previously unknown single-nucleotide polymorphisms (SNPs) in the rice genome is a critical challenge for rice research and the development of resistant varieties. However, the limited availability of high-quality rice genotype data has hindered this research. Deep learning has transformed biological research by facilitating the prediction and analysis of SNPs in biological sequence data. Convolutional neural networks are especially effective in extracting structural and local features from DNA sequences, leading to significant advancements in genomics. Nevertheless, the expanding catalog of genome-wide association studies provides valuable biological insights for rice research. Expanding on this idea, we introduce RiceSNP-BST, an automatic architecture search framework designed to predict SNPs associated with rice biotic stress traits (BST-associated SNPs) by integrating multidimensional features. Notably, the model successfully innovates the datasets, offering more precision than state-of-the-art methods while demonstrating good performance on an independent test set and cross-species datasets. Additionally, we extracted features from the original DNA sequences and employed causal inference to enhance the biological interpretability of the model. This study highlights the potential of RiceSNP-BST in advancing genome prediction in rice. Furthermore, a user-friendly web server for RiceSNP-BST (http://rice-snp-bst.aielab.cc) has been developed to support broader genome research.

摘要

水稻一直面临着生物胁迫的重大威胁,如真菌、细菌、害虫和病毒。因此,准确快速地识别水稻基因组中以前未知的单核苷酸多态性(SNP)是水稻研究和抗性品种开发的关键挑战。然而,高质量水稻基因型数据的有限可用性阻碍了这一研究。深度学习通过促进生物序列数据中 SNP 的预测和分析,改变了生物学研究。卷积神经网络特别擅长从 DNA 序列中提取结构和局部特征,从而在基因组学方面取得了重大进展。然而,不断扩展的全基因组关联研究目录为水稻研究提供了有价值的生物学见解。在此基础上,我们引入了 RiceSNP-BST,这是一个自动架构搜索框架,旨在通过整合多维特征来预测与水稻生物胁迫性状相关的 SNP(BST 相关 SNP)。值得注意的是,该模型成功地创新了数据集,提供了比最先进方法更高的精度,同时在独立测试集和跨物种数据集上表现良好。此外,我们从原始 DNA 序列中提取特征,并采用因果推断来增强模型的生物学可解释性。这项研究强调了 RiceSNP-BST 在推进水稻基因组预测方面的潜力。此外,我们还开发了一个易于使用的 RiceSNP-BST 网络服务器(http://rice-snp-bst.aielab.cc),以支持更广泛的基因组研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0f5/11576077/c4a0641abdbc/bbae599f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验