• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

植物矿:一种用于检测水稻基因组核心 SNP 的机器学习框架。

PlantMine: A Machine-Learning Framework to Detect Core SNPs in Rice Genomics.

机构信息

School of Biological Engineering, Sichuan University of Science & Engineering, Yibin 644000, China.

National Agriculture Science Data Center, Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China.

出版信息

Genes (Basel). 2024 May 9;15(5):603. doi: 10.3390/genes15050603.

DOI:10.3390/genes15050603
PMID:38790232
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11120712/
Abstract

As a fundamental global staple crop, rice plays a pivotal role in human nutrition and agricultural production systems. However, its complex genetic architecture and extensive trait variability pose challenges for breeders and researchers in optimizing yield and quality. Particularly to expedite breeding methods like genomic selection, isolating core SNPs related to target traits from genome-wide data reduces irrelevant mutation noise, enhancing computational precision and efficiency. Thus, exploring efficient computational approaches to mine core SNPs is of great importance. This study introduces PlantMine, an innovative computational framework that integrates feature selection and machine learning techniques to effectively identify core SNPs critical for the improvement of rice traits. Utilizing the dataset from the 3000 Rice Genomes Project, we applied different algorithms for analysis. The findings underscore the effectiveness of combining feature selection with machine learning in accurately identifying core SNPs, offering a promising avenue to expedite rice breeding efforts and improve crop productivity and resilience to stress.

摘要

作为全球主要的粮食作物之一,水稻在人类营养和农业生产系统中起着至关重要的作用。然而,其复杂的遗传结构和广泛的性状变异性给育种家和研究人员在优化产量和品质方面带来了挑战。特别是为了加速基因组选择等育种方法,从全基因组数据中分离与目标性状相关的核心 SNPs 可以减少不相关的突变噪声,提高计算精度和效率。因此,探索有效的计算方法来挖掘核心 SNPs 非常重要。本研究介绍了 PlantMine,这是一种创新的计算框架,它集成了特征选择和机器学习技术,可有效地识别对水稻性状改良至关重要的核心 SNPs。我们利用 3000 份水稻基因组计划的数据集应用了不同的算法进行分析。研究结果强调了在准确识别核心 SNPs 方面结合特征选择和机器学习的有效性,为加速水稻育种工作提供了有前景的途径,提高了作物的生产力和对胁迫的适应能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/bca65e49b937/genes-15-00603-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/abec31fbeec3/genes-15-00603-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/fc6a26ba4b84/genes-15-00603-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/4888cce1dc54/genes-15-00603-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/bca65e49b937/genes-15-00603-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/abec31fbeec3/genes-15-00603-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/fc6a26ba4b84/genes-15-00603-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/4888cce1dc54/genes-15-00603-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/476f/11120712/bca65e49b937/genes-15-00603-g004.jpg

相似文献

1
PlantMine: A Machine-Learning Framework to Detect Core SNPs in Rice Genomics.植物矿:一种用于检测水稻基因组核心 SNP 的机器学习框架。
Genes (Basel). 2024 May 9;15(5):603. doi: 10.3390/genes15050603.
2
From gene banks to farmer's fields: using genomic selection to identify donors for a breeding program in rice to close the yield gap on smallholder farms.从基因库到农民田间:利用基因组选择为水稻育种计划鉴定供体,以缩小小农户农场的产量差距。
Theor Appl Genet. 2021 Oct;134(10):3397-3410. doi: 10.1007/s00122-021-03909-9. Epub 2021 Jul 15.
3
Genome-wide resequencing of KRICE_CORE reveals their potential for future breeding, as well as functional and evolutionary studies in the post-genomic era.对KRICE_CORE进行全基因组重测序揭示了它们在未来育种以及后基因组时代功能和进化研究方面的潜力。
BMC Genomics. 2016 May 26;17:408. doi: 10.1186/s12864-016-2734-y.
4
SSR marker-based genetic diversity analysis and SNP haplotyping of genes associating abiotic and biotic stress tolerance, rice growth and development and yield across 93 rice landraces.基于 SSR 标记的遗传多样性分析和 SNP 单倍型分析,这些基因与非生物和生物胁迫耐受性、水稻生长发育和产量有关,研究对象为 93 份水稻地方品种。
Mol Biol Rep. 2021 Aug;48(8):5943-5953. doi: 10.1007/s11033-021-06595-4. Epub 2021 Jul 28.
5
Evaluation of methods and marker Systems in Genomic Selection of oil palm (Elaeis guineensis Jacq.).油棕(Elaeis guineensis Jacq.)基因组选择中方法和标记系统的评估
BMC Genet. 2017 Dec 11;18(1):107. doi: 10.1186/s12863-017-0576-5.
6
Genome-Wide Association Study of Root System Development at Seedling Stage in Rice.水稻幼苗根系发育的全基因组关联研究。
Genes (Basel). 2020 Nov 25;11(12):1395. doi: 10.3390/genes11121395.
7
SR4R: An Integrative SNP Resource for Genomic Breeding and Population Research in Rice.SR4R:一个水稻基因组育种和群体研究的综合 SNP 资源。
Genomics Proteomics Bioinformatics. 2020 Apr;18(2):173-185. doi: 10.1016/j.gpb.2020.03.002. Epub 2020 Jun 30.
8
An improved 7K SNP array, the C7AIR, provides a wealth of validated SNP markers for rice breeding and genetics studies.改良后的 7K SNP 芯片 C7AIR 为水稻育种和遗传学研究提供了丰富的经验证的 SNP 标记。
PLoS One. 2020 May 14;15(5):e0232479. doi: 10.1371/journal.pone.0232479. eCollection 2020.
9
Predicting biomass of rice with intermediate traits: Modeling method combining crop growth models and genomic prediction models.利用中间性状预测水稻生物量:结合作物生长模型和基因组预测模型的建模方法。
PLoS One. 2020 Jun 19;15(6):e0233951. doi: 10.1371/journal.pone.0233951. eCollection 2020.
10
TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.TrG2P:一种基于迁移学习的工具,集成多性状数据,用于准确预测作物产量。
Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.

引用本文的文献

1
Application of machine learning and genomics for orphan crop improvement.机器学习与基因组学在小众作物改良中的应用。
Nat Commun. 2025 Jan 24;16(1):982. doi: 10.1038/s41467-025-56330-x.

本文引用的文献

1
Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data.基于机器学习的基因组预测:在合成数据和实际数据上,正则化回归、集成、基于实例和深度学习方法的性能比较。
BMC Genomics. 2024 Feb 7;25(1):152. doi: 10.1186/s12864-023-09933-x.
2
NRTPredictor: identifying rice root cell state in single-cell RNA-seq via ensemble learning.NRTPredictor:通过集成学习在单细胞RNA测序中识别水稻根细胞状态
Plant Methods. 2023 Nov 4;19(1):119. doi: 10.1186/s13007-023-01092-0.
3
Research on Plant Genomics and Breeding.
植物基因组学与育种研究。
Int J Mol Sci. 2023 Oct 18;24(20):15298. doi: 10.3390/ijms242015298.
4
Machine learning bridges omics sciences and plant breeding.机器学习架起了组学科学与植物育种之间的桥梁。
Trends Plant Sci. 2023 Feb;28(2):199-210. doi: 10.1016/j.tplants.2022.08.018. Epub 2022 Sep 21.
5
LightGBM: accelerated genomically designed crop breeding through ensemble learning.LightGBM:通过集成学习加速基因组设计的作物育种。
Genome Biol. 2021 Sep 20;22(1):271. doi: 10.1186/s13059-021-02492-y.
6
Predicting Cell Wall Lytic Enzymes Using Combined Features.利用组合特征预测细胞壁裂解酶
Front Bioeng Biotechnol. 2021 Jan 6;8:627335. doi: 10.3389/fbioe.2020.627335. eCollection 2020.
7
eHSCPr discriminating the cell identity involved in endothelial to hematopoietic transition.eHSCPr区分参与内皮向造血转变的细胞身份。
Bioinformatics. 2021 Aug 9;37(15):2157-2164. doi: 10.1093/bioinformatics/btab071.
8
Deep Learning for Predicting Complex Traits in Spring Wheat Breeding Program.深度学习在春小麦育种计划中预测复杂性状的应用
Front Plant Sci. 2021 Jan 5;11:613325. doi: 10.3389/fpls.2020.613325. eCollection 2020.
9
Machine learning: A powerful tool for gene function prediction in plants.机器学习:植物基因功能预测的强大工具。
Appl Plant Sci. 2020 Jul 28;8(7):e11376. doi: 10.1002/aps3.11376. eCollection 2020 Jul.
10
Design powerful predictor for mRNA subcellular location prediction in Homo sapiens.设计用于预测人类 mRNA 亚细胞定位的强大预测器。
Brief Bioinform. 2021 Jan 18;22(1):526-535. doi: 10.1093/bib/bbz177.