Suppr超能文献

利用表型、基因型和蛋白质组学综合分析鉴定向日葵杂种优势群的机器学习应用。

Application of machine learning for identification of heterotic groups in sunflower through combined approach of phenotyping, genotyping and protein profiling.

机构信息

Crop Science Institute, National Agricultural Research Centre, Islamabad, Pakistan.

Colorado Water Centre, Colorado State University, Fort Collins, CO, 80523, USA.

出版信息

Sci Rep. 2024 Mar 27;14(1):7333. doi: 10.1038/s41598-024-58049-z.

Abstract

Application of machine learning in plant breeding is a recent concept, that has to be optimized for precise utilization in the breeding program of high yielding crop plants. Identification and efficient utilization of heterotic grouping pattern aided with machine learning approaches is of utmost importance in hybrid cultivar breeding as it can save time and resources required to breed a new plant hybrid/variety. In the present study, 109 genotypes of sunflower were investigated at morphological, biochemical (SDS-PAGE) and molecular levels (through micro-satellites (SSR) markers) for heterotic grouping. All the three datasets were combined, scaled, and subjected to unsupervised machine learning algorithms, i.e., Hierarchical clustering, K-means clustering and hybrid clustering algorithm (hierarchical + K-means) for assessment of efficiency and resolution power of these algorithms in practical plant breeding for heterotic grouping identification. Following the application of machine learning unsupervised clustering approach, two major groups were identified in the studied sunflower germplasm, and further classification revealed six smaller classes in each major group through hierarchical and hybrid clustering approach. Due to high resolution, obtained in hierarchical clustering, classification achieved through this algorithm was further used for selection of potential parents. One genotype from each smaller group was selected based on the maximum seed yield potential and hybridized in a line  ×  tester mating design producing 36 F cross combinations. These Fs along with their parents were studied in open field conditions for validating the efficacy of identified heterotic groups in sunflowers genetic material under study. Data for 11 agronomic and qualitative traits were recorded. These 36 F combinations were tested for their combining ability (General/Specific), heterosis, genotypic and phenotypic correlation and path analysis. Results suggested that F hybrids performed better for all the traits under investigation than their respective parents. Findings of the study validated the use of machine learning approaches in practical plant breeding; however, more accurate and robust clustering algorithms need to be developed to handle the data noisiness of open field experiments.

摘要

机器学习在植物育种中的应用是一个较新的概念,必须针对高产作物的育种计划进行优化,以实现精确利用。利用机器学习方法识别和有效利用杂种优势分组模式对于杂交品种的培育至关重要,因为它可以节省培育新植物杂种/品种所需的时间和资源。本研究在形态学、生化(SDS-PAGE)和分子水平(通过微卫星(SSR)标记)对 109 个向日葵基因型进行了杂种优势分组研究。将所有三个数据集组合、缩放,并应用于无监督机器学习算法,即层次聚类、K-均值聚类和混合聚类算法(层次+K-均值),以评估这些算法在实际植物育种中用于杂种优势分组识别的效率和分辨率。在应用机器学习无监督聚类方法后,在研究的向日葵种质资源中确定了两个主要群体,通过层次聚类和混合聚类方法进一步分类,在每个主要群体中发现了六个较小的类。由于层次聚类获得的高分辨率,通过该算法实现的分类进一步用于选择潜在的亲本。根据最大种子产量潜力从每个较小的群体中选择一个基因型,并在杂交线 × 测验设计中杂交产生 36 个 F 杂交组合。这些 F 及其亲本在田间条件下进行研究,以验证在所研究的向日葵遗传材料中鉴定的杂种优势群体的有效性。记录了 11 个农艺和定性性状的数据。对这些 36 个 F 组合进行了配合力(一般/特殊)、杂种优势、基因型和表型相关性以及路径分析的测试。结果表明,与各自的亲本相比,F 杂种在所有研究性状中表现更好。该研究的结果验证了机器学习方法在实际植物育种中的应用;然而,需要开发更准确和稳健的聚类算法来处理田间试验数据的噪声。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bed/10973396/224f935553ac/41598_2024_58049_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验