• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过机器学习和差异基因相关性分析揭示黄瓜中的耐涝基因

Uncovering waterlogging-responsive genes in cucumber through machine learning and differential gene correlation analysis.

作者信息

Zinati Zahra, Nazari Leyla, Niazi Ali

机构信息

Department of Agroecology, College of Agriculture and Natural Resources of Darab, Shiraz University, Shiraz, Iran.

Crop and Horticultural Science Research Department, Fars Agricultural and Natural Resources Research and Education Center, Agricultural Research, Education and Extension Organization (AREEO), Shiraz, Iran.

出版信息

Bot Stud. 2024 Aug 14;65(1):25. doi: 10.1186/s40529-024-00433-z.

DOI:10.1186/s40529-024-00433-z
PMID:39141059
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11324642/
Abstract

As climate change intensifies, the frequency and severity of waterlogging are expected to increase, necessitating a deeper understanding of the cucumber response to this stress. In this study, three public RNA-seq datasets (PRJNA799460, PRJNA844418, and PRJNA678740) comprising 36 samples were analyzed. Various feature selection algorithms including Uncertainty, Relief, SVM (Support Vector Machine), Correlation, and logistic least absolute shrinkage, and selection operator (LASSO) were performed to identify the most significant genes related to the waterlogging stress response. These feature selection techniques, which have different characteristics, were used to reduce the complexity of the data and thereby identify the most significant genes related to the waterlogging stress response. Uncertainty, Relief, SVM, Correlation, and LASSO identified 4, 4, 10, 21, and 13 genes, respectively. Differential gene correlation analysis (DGCA) focusing on the 36 selected genes identified changes in correlation patterns between the selected genes under waterlogged versus control conditions, providing deeper insights into the regulatory networks and interactions among the selected genes. DGCA revealed significant changes in the correlation of 13 genes between control and waterlogging conditions. Finally, we validated 13 genes using the Random Forest (RF) classifier, which achieved 100% accuracy and a 1.0 Area Under the Curve (AUC) score. The SHapley Additive exPlanations (SHAP) values clearly showed the significant impact of LOC101209599, LOC101217277, and LOC101216320 on the model's predictive power. In addition, we employed the Boruta as a wrapper feature selection method to further validate our gene selection strategy. Eight of the 13 genes were common across the four feature weighting algorithms, LASSO, DGCA, and Boruta, underscoring the robustness and reliability of our gene selection strategy. Notably, the genes LOC101209599, LOC101217277, and LOC101216320 were among genes identified by multiple feature selection methods from different categories (filtering, wrapper, and embedded). Pathways associated with these specific genes play a pivotal role in regulating stress tolerance, root development, nutrient absorption, sugar metabolism, gene expression, protein degradation, and calcium signaling. These intricate regulatory mechanisms are crucial for cucumbers to adapt effectively to waterlogging conditions. These findings provide valuable insights for uncovering targets in breeding new cucumber varieties with enhanced stress tolerance.

摘要

随着气候变化加剧,预计涝渍的频率和严重程度将会增加,因此有必要更深入地了解黄瓜对这种胁迫的反应。在本研究中,对包含36个样本的三个公共RNA测序数据集(PRJNA799460、PRJNA844418和PRJNA678740)进行了分析。采用了包括不确定性、Relief、支持向量机(SVM)、相关性以及逻辑最小绝对收缩和选择算子(LASSO)在内的各种特征选择算法,以识别与涝渍胁迫反应相关的最重要基因。这些具有不同特征的特征选择技术被用于降低数据的复杂性,从而识别与涝渍胁迫反应相关的最重要基因。不确定性、Relief、SVM、相关性和LASSO分别识别出4个、4个、10个、21个和13个基因。针对这36个选定基因的差异基因相关性分析(DGCA)确定了涝渍条件与对照条件下选定基因之间相关性模式的变化,从而更深入地了解选定基因之间的调控网络和相互作用。DGCA揭示了对照和涝渍条件下13个基因的相关性有显著变化。最后,我们使用随机森林(RF)分类器对13个基因进行了验证,其准确率达到100%,曲线下面积(AUC)得分为1.0。SHapley加法解释(SHAP)值清楚地显示了LOC101209599、LOC101217277和LOC101216320对模型预测能力的显著影响。此外,我们采用Boruta作为一种包装特征选择方法来进一步验证我们的基因选择策略。13个基因中的8个在LASSO、DGCA和Boruta这四种特征加权算法中是共有的,这突出了我们基因选择策略的稳健性和可靠性。值得注意的是,LOC101209599、LOC101217277和LOC101216320这几个基因是通过来自不同类别(过滤、包装和嵌入)的多种特征选择方法识别出来的。与这些特定基因相关的通路在调节胁迫耐受性、根系发育、养分吸收、糖代谢、基因表达、蛋白质降解和钙信号传导中起着关键作用。这些复杂的调控机制对于黄瓜有效适应涝渍条件至关重要。这些发现为揭示培育具有增强胁迫耐受性的新黄瓜品种的目标提供了有价值的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/c2d7594cf10b/40529_2024_433_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/7c7ff8e3363e/40529_2024_433_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/bf3b62a36532/40529_2024_433_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/e0feff55d6bb/40529_2024_433_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/9fc093499c55/40529_2024_433_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/c2d7594cf10b/40529_2024_433_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/7c7ff8e3363e/40529_2024_433_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/bf3b62a36532/40529_2024_433_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/e0feff55d6bb/40529_2024_433_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/9fc093499c55/40529_2024_433_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/47a6/11324642/c2d7594cf10b/40529_2024_433_Fig5_HTML.jpg

相似文献

1
Uncovering waterlogging-responsive genes in cucumber through machine learning and differential gene correlation analysis.通过机器学习和差异基因相关性分析揭示黄瓜中的耐涝基因
Bot Stud. 2024 Aug 14;65(1):25. doi: 10.1186/s40529-024-00433-z.
2
Novel candidate genes for environmental stresses response in Synechocystis sp. PCC 6803 revealed by machine learning algorithms.利用机器学习算法揭示集胞藻 PCC 6803 中环境胁迫反应的新候选基因。
Braz J Microbiol. 2024 Jun;55(2):1219-1229. doi: 10.1007/s42770-024-01338-6. Epub 2024 May 6.
3
Comparative performance analysis of Boruta, SHAP, and Borutashap for disease diagnosis: A study with multiple machine learning algorithms.用于疾病诊断的Boruta、SHAP和Borutashap的比较性能分析:一项使用多种机器学习算法的研究。
Network. 2024 Mar 21:1-38. doi: 10.1080/0954898X.2024.2331506.
4
Comparative RNA-seq based transcriptome profiling of waterlogging response in cucumber hypocotyls reveals novel insights into the de novo adventitious root primordia initiation.基于比较RNA测序的黄瓜下胚轴涝害响应转录组分析揭示了从头不定根原基起始的新见解。
BMC Plant Biol. 2017 Jul 26;17(1):129. doi: 10.1186/s12870-017-1081-8.
5
A comparative study on urban waterlogging susceptibility assessment based on multiple data-driven models.基于多种数据驱动模型的城市内涝易发性评估的对比研究。
J Environ Manage. 2024 Jun;360:121166. doi: 10.1016/j.jenvman.2024.121166. Epub 2024 May 22.
6
RNA-seq-based comparative transcriptome analysis reveals the role of in waterlogging-triggered adventitious root formation in cucumber.基于RNA测序的比较转录组分析揭示了[具体内容缺失]在黄瓜淹水诱导不定根形成中的作用。
Hortic Res. 2024 Feb 28;11(4):uhae062. doi: 10.1093/hr/uhae062. eCollection 2024 Apr.
7
Optimizing prognostic factors of five-year survival in gastric cancer patients using feature selection techniques with machine learning algorithms: a comparative study.使用机器学习算法进行特征选择技术优化胃癌患者五年生存率的预后因素:一项比较研究。
BMC Med Inform Decis Mak. 2023 Apr 6;23(1):54. doi: 10.1186/s12911-023-02154-y.
8
Comparative Proteomic Analysis Provides Insight into the Key Proteins Involved in Cucumber ( L.) Adventitious Root Emergence under Waterlogging Stress.比较蛋白质组学分析揭示了黄瓜(L.)在渍水胁迫下不定根形成过程中的关键蛋白。
Front Plant Sci. 2016 Oct 13;7:1515. doi: 10.3389/fpls.2016.01515. eCollection 2016.
9
Gene targeting in amyotrophic lateral sclerosis using causality-based feature selection and machine learning.使用基于因果关系的特征选择和机器学习进行肌萎缩侧索硬化症的基因靶向治疗。
Mol Med. 2023 Jan 24;29(1):12. doi: 10.1186/s10020-023-00603-y.
10
Transcriptional survey of abiotic stress response in maize () in the level of gene co-expression network and differential gene correlation analysis.基于基因共表达网络水平和差异基因相关性分析的玉米非生物胁迫响应转录组学研究
AoB Plants. 2023 Dec 22;16(1):plad087. doi: 10.1093/aobpla/plad087. eCollection 2024 Jan.

本文引用的文献

1
Genome-wide analysis of R2R3-MYB genes in cultivated peanut ( L.): Gene duplications, functional conservation, and diversification.栽培花生(Arachis hypogaea L.)中R2R3-MYB基因的全基因组分析:基因复制、功能保守性和多样性
Front Plant Sci. 2023 Feb 14;14:1102174. doi: 10.3389/fpls.2023.1102174. eCollection 2023.
2
Heat and drought induced transcriptomic changes in barley varieties with contrasting stress response phenotypes.高温和干旱诱导具有不同胁迫反应表型的大麦品种的转录组变化。
Front Plant Sci. 2022 Dec 8;13:1066421. doi: 10.3389/fpls.2022.1066421. eCollection 2022.
3
Plant sugar metabolism, transport and signalling in challenging environments.
植物在具有挑战性的环境中的糖代谢、运输与信号传导
Physiol Plant. 2022 Sep;174(5):e13768. doi: 10.1111/ppl.13768.
4
Mutation-based Binary Aquila optimizer for gene selection in cancer classification.基于突变的二进制 Aqlua 优化器在癌症分类中的基因选择。
Comput Biol Chem. 2022 Dec;101:107767. doi: 10.1016/j.compbiolchem.2022.107767. Epub 2022 Sep 5.
5
ATP-Binding Cassette G Transporters and Their Multiple Roles Especially for Male Fertility in , Rice and Maize.三磷酸腺苷结合盒 G 型转运蛋白及其在水稻和玉米中的多种功能,尤其是对雄性育性的影响。
Int J Mol Sci. 2022 Aug 18;23(16):9304. doi: 10.3390/ijms23169304.
6
Short waterlogging events differently affect morphology and photosynthesis of two cucumber ( L.) cultivars.短期渍水事件对两个黄瓜(L.)品种的形态和光合作用有不同影响。
Front Plant Sci. 2022 Jul 22;13:896244. doi: 10.3389/fpls.2022.896244. eCollection 2022.
7
Crosstalk between Ca and Other Regulators Assists Plants in Responding to Abiotic Stress.钙与其他调节因子之间的相互作用有助于植物应对非生物胁迫。
Plants (Basel). 2022 May 19;11(10):1351. doi: 10.3390/plants11101351.
8
Genome-wide identification, evolutionary and functional analyses of KFB family members in potato.马铃薯 KFB 家族成员的全基因组鉴定、进化和功能分析。
BMC Plant Biol. 2022 May 2;22(1):226. doi: 10.1186/s12870-022-03611-y.
9
The Importance of Networking: Plant Polycomb Repressive Complex 2 and Its Interactors.网络的重要性:植物多梳抑制复合物2及其相互作用分子
Epigenomes. 2022 Mar 3;6(1):8. doi: 10.3390/epigenomes6010008.
10
2021 update on ATP-binding cassette (ABC) transporters: how they meet the needs of plants.2021 年关于三磷酸腺苷结合盒(ABC)转运蛋白的更新:它们如何满足植物的需求。
Plant Physiol. 2021 Dec 4;187(4):1876-1892. doi: 10.1093/plphys/kiab193.