• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习驱动识别影响陆地棉纤维产量和品质性状的关键环境因素

Machine Learning-Driven Identification of Key Environmental Factors Influencing Fiber Yield and Quality Traits in Upland Cotton.

作者信息

Souaibou Mohamadou, Yan Haoliang, Dai Panhong, Pan Jingtao, Li Yang, Shi Yuzhen, Gong Wankui, Shang Haihong, Gong Juwu, Yuan Youlu

机构信息

State Key Laboratory of Cotton Bio-Breeding and Integrated Utilization, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang 455000, China.

Zhengzhou Research Base, State Key Laboratory of Cotton Bio-Breeding and Integrated Utilization, School of Agricultural Sciences, Zhengzhou University, Zhengzhou 450001, China.

出版信息

Plants (Basel). 2025 Jul 4;14(13):2053. doi: 10.3390/plants14132053.

DOI:10.3390/plants14132053
PMID:40648062
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12252131/
Abstract

Understanding the influence of environmental factors on cotton performance is crucial for enhancing yield and fiber quality in the context of climate change. This study investigates genotype-by-environment (G×E) interactions in cotton, using data from 250 recombinant inbred lines (CCRI70 RILs) cultivated across 14 diverse environments in China's major cotton cultivation areas. Our findings reveal that environmental effects predominantly influenced yield-related traits (boll weight, lint percentage, and the seed index), contributing to 34.7% to 55.7% of their variance. In contrast fiber quality traits showed lower environmental sensitivity (12.3-27.0%), with notable phenotypic plasticity observed in the boll weight, lint percentage, and fiber micronaire. Employing six machine learning models, Random Forest demonstrated superior predictive ability (R = 0.40-0.72; predictive Pearson correlation = 0.63-0.86). Through SHAP-based interpretation and sliding-window regression, we identified key environmental drivers primarily active during mid-to-late growth stages. This approach effectively reduced the number of influential input variables to just 0.1-2.4% of the original dataset, spanning 2-9 critical time windows per trait. Incorporating these identified drivers significantly improved cross-environment predictions, enhancing Random Forest accuracy by 0.02-0.15. These results underscore the strong potential of machine learning to uncover critical temporal environmental factors underlying G×E interactions and to substantially improve predictive modeling in cotton breeding programs, ultimately contributing to more resilient and productive cotton cultivation.

摘要

了解环境因素对棉花性能的影响对于在气候变化背景下提高产量和纤维质量至关重要。本研究利用在中国主要棉花种植区14个不同环境中种植的250个重组自交系(CCRI70 RILs)的数据,调查了棉花中的基因型与环境互作(G×E)。我们的研究结果表明,环境效应主要影响与产量相关的性状(铃重、衣分和籽指),其变异的34.7%至55.7%归因于此。相比之下,纤维品质性状对环境的敏感性较低(12.3 - 27.0%),在铃重、衣分和纤维马克隆值中观察到显著的表型可塑性。采用六种机器学习模型,随机森林表现出卓越的预测能力(R = 0.40 - 0.72;预测皮尔逊相关系数 = 0.63 - 0.86)。通过基于SHAP的解释和滑动窗口回归,我们确定了主要在生长中后期起作用的关键环境驱动因素。这种方法有效地将有影响的输入变量数量减少到原始数据集的仅0.1 - 2.4%,每个性状跨越2 - 9个关键时间窗口。纳入这些已确定的驱动因素显著改善了跨环境预测,将随机森林的准确率提高了0.02 - 0.15。这些结果强调了机器学习在揭示G×E互作背后关键的时间环境因素以及大幅改进棉花育种计划中的预测建模方面的强大潜力,最终有助于实现更具适应性和高产的棉花种植。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/0a86048abf03/plants-14-02053-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/073f769b7c48/plants-14-02053-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/72f9ec88b3c0/plants-14-02053-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/1c78d5233022/plants-14-02053-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/17b67fd956c0/plants-14-02053-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/345af39ffc64/plants-14-02053-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/0a86048abf03/plants-14-02053-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/073f769b7c48/plants-14-02053-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/72f9ec88b3c0/plants-14-02053-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/1c78d5233022/plants-14-02053-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/17b67fd956c0/plants-14-02053-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/345af39ffc64/plants-14-02053-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d464/12252131/0a86048abf03/plants-14-02053-g006.jpg

相似文献

1
Machine Learning-Driven Identification of Key Environmental Factors Influencing Fiber Yield and Quality Traits in Upland Cotton.机器学习驱动识别影响陆地棉纤维产量和品质性状的关键环境因素
Plants (Basel). 2025 Jul 4;14(13):2053. doi: 10.3390/plants14132053.
2
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
3
Genotype-by-environment interaction for yearling weight of Nellore cattle in pasture and feedlot conditions using a "double" genomic reaction norm model.使用“双重”基因组反应规范模型,对草原和饲养场条件下内洛尔牛一岁体重的基因型与环境互作进行研究。
J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf169.
4
Genotype-by-gestational thermal environment interaction and its impact on the future performance of tropical composite beef cattle offspring.基因型-妊娠期热环境互作及其对热带杂交肉牛后代未来性能的影响。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae200.
5
Simultaneous improvement of fiber yield and quality in upland cotton () by integration of auxin transport and synthesis.通过整合生长素运输与合成同时提高陆地棉的纤维产量和品质
Mol Breed. 2024 Sep 17;44(10):64. doi: 10.1007/s11032-024-01500-w. eCollection 2024 Oct.
6
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.
7
GWAS and GS analysis revealed the selection and prediction efficiency for yield, plant morphological, and fiber quality in Gossypium barbadense.全基因组关联研究(GWAS)和基因组选择(GS)分析揭示了海岛棉产量、植株形态和纤维品质的选择及预测效率。
Theor Appl Genet. 2025 Jun 9;138(7):138. doi: 10.1007/s00122-025-04911-1.
8
Short-Season Direct-Seeded Cotton Cultivation Under Once-Only Irrigation Throughout the Growing Season: Investigating the Effects of Planting Density and Nitrogen Application.全生育期一次性灌溉条件下短季直播棉花栽培:探究种植密度和施氮量的影响
Plants (Basel). 2025 Jun 17;14(12):1864. doi: 10.3390/plants14121864.
9
Advanced Prediction of Heart Failure Risk in Elderly Diabetic and Hypertensive Patients Using Nine Machine Learning Models and Novel Composite Indices: Insights from NHANES 2003-2016.使用九种机器学习模型和新型复合指标对老年糖尿病和高血压患者心力衰竭风险的高级预测:来自2003 - 2016年美国国家健康与营养检查调查(NHANES)的见解
Eur J Prev Cardiol. 2025 Feb 27. doi: 10.1093/eurjpc/zwaf081.
10
Explainable AI-driven prediction of APE1 inhibitors: enhancing cancer therapy with machine learning models and feature importance analysis.可解释人工智能驱动的APE1抑制剂预测:利用机器学习模型和特征重要性分析增强癌症治疗
Mol Divers. 2025 Feb 21. doi: 10.1007/s11030-025-11133-6.

本文引用的文献

1
Machine learning algorithms translate big data into predictive breeding accuracy.机器学习算法将大数据转化为预测育种准确性。
Trends Plant Sci. 2025 Feb;30(2):167-184. doi: 10.1016/j.tplants.2024.09.011. Epub 2024 Oct 26.
2
Genetic interrogation of phenotypic plasticity informs genome-enabled breeding in cotton.遗传探究表型可塑性为棉花的基因组辅助育种提供信息。
J Genet Genomics. 2023 Dec;50(12):971-982. doi: 10.1016/j.jgg.2023.05.004. Epub 2023 May 19.
3
Yield prediction through integration of genetic, environment, and management data through deep learning.
通过深度学习整合遗传、环境和管理数据进行产量预测。
G3 (Bethesda). 2023 Apr 11;13(4). doi: 10.1093/g3journal/jkad006.
4
Machine learning for predicting phenotype from genotype and environment.用于从基因型和环境预测表型的机器学习。
Curr Opin Biotechnol. 2023 Feb;79:102853. doi: 10.1016/j.copbio.2022.102853. Epub 2022 Dec 2.
5
Gene-by-environment interactions in plants: Molecular mechanisms, environmental drivers, and adaptive plasticity.植物中的基因-环境互作:分子机制、环境驱动因素和适应性可塑性。
Plant Cell. 2023 Jan 2;35(1):109-124. doi: 10.1093/plcell/koac322.
6
Multi-environment Evaluations Across Ecological Regions Reveal That the Kernel Oil Content of Cottonseed Is Equally Determined by Genotype and Environment.多环境跨生态区评估表明,棉花仁的仁油含量同样由基因型和环境决定。
J Agric Food Chem. 2022 Mar 2;70(8):2529-2544. doi: 10.1021/acs.jafc.1c07082. Epub 2022 Feb 16.
7
Quantitative Trait Loci and Transcriptome Analysis Reveal Genetic Basis of Fiber Quality Traits in CCRI70 RIL Population of .数量性状基因座与转录组分析揭示了中棉所70重组自交系群体纤维品质性状的遗传基础。
Front Plant Sci. 2021 Dec 16;12:753755. doi: 10.3389/fpls.2021.753755. eCollection 2021.
8
Predicting phenotypes from genetic, environment, management, and historical data using CNNs.使用卷积神经网络从遗传、环境、管理和历史数据预测表型。
Theor Appl Genet. 2021 Dec;134(12):3997-4011. doi: 10.1007/s00122-021-03943-7. Epub 2021 Aug 27.
9
An integrated framework reinstating the environmental dimension for GWAS and genomic selection in crops.一种综合框架,为作物的 GWAS 和基因组选择重新引入环境维度。
Mol Plant. 2021 Jun 7;14(6):874-887. doi: 10.1016/j.molp.2021.03.010. Epub 2021 Mar 10.
10
Genomic mechanisms of climate adaptation in polyploid bioenergy switchgrass.多倍体生物能源柳枝稷的气候适应的基因组机制。
Nature. 2021 Feb;590(7846):438-444. doi: 10.1038/s41586-020-03127-1. Epub 2021 Jan 27.