• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用组合自动编码器解析基因型和环境特异性潜在特征以改进性状预测。

Disentangling genotype and environment specific latent features for improved trait prediction using a compositional autoencoder.

作者信息

Powadi Anirudha, Jubery Talukder Zaki, Tross Michael C, Schnable James C, Ganapathysubramanian Baskar

机构信息

Department of Electrical and Computer Engineering, Iowa State University, Ames, IA, United States.

Translational AI Research and Education Center, Iowa State University, Ames, IA, United States.

出版信息

Front Plant Sci. 2024 Dec 16;15:1476070. doi: 10.3389/fpls.2024.1476070. eCollection 2024.

DOI:10.3389/fpls.2024.1476070
PMID:39741666
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11686434/
Abstract

In plant breeding and genetics, predictive models traditionally rely on compact representations of high-dimensional data, often using methods like Principal Component Analysis (PCA) and, more recently, Autoencoders (AE). However, these methods do not separate genotype-specific and environment-specific features, limiting their ability to accurately predict traits influenced by both genetic and environmental factors. We hypothesize that disentangling these representations into genotype-specific and environment-specific components can enhance predictive models. To test this, we developed a compositional autoencoder (CAE) that decomposes high-dimensional data into distinct genotype-specific and environment-specific latent features. Our CAE framework employed a hierarchical architecture within an autoencoder to effectively separate these entangled latent features. Applied to a maize diversity panel dataset, the CAE demonstrated superior modeling of environmental influences and out-performs PCA (principal component analysis), PLSR (Partial Least square regression) and vanilla autoencoders by 7 times for 'Days to Pollen' trait and 10 times improved predictive performance for 'Yield'. By disentangling latent features, the CAE provided a powerful tool for precision breeding and genetic research. This work has significantly enhanced trait prediction models, advancing agricultural and biological sciences.

摘要

在植物育种和遗传学中,传统的预测模型依赖于高维数据的紧凑表示,通常使用主成分分析(PCA)等方法,以及最近的自动编码器(AE)。然而,这些方法没有分离出基因型特异性和环境特异性特征,限制了它们准确预测受遗传和环境因素影响的性状的能力。我们假设,将这些表示分解为基因型特异性和环境特异性成分可以增强预测模型。为了验证这一点,我们开发了一种组合自动编码器(CAE),它将高维数据分解为不同的基因型特异性和环境特异性潜在特征。我们的CAE框架在自动编码器中采用了分层架构,以有效地分离这些纠缠的潜在特征。应用于玉米多样性面板数据集时,CAE在环境影响建模方面表现出色,在“花粉形成天数”性状上比主成分分析(PCA)、偏最小二乘回归(PLSR)和普通自动编码器的性能高出7倍,在“产量”性状上预测性能提高了10倍。通过解开潜在特征,CAE为精准育种和遗传研究提供了一个强大的工具。这项工作显著增强了性状预测模型,推动了农业和生物科学的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/ed4cccca4403/fpls-15-1476070-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/0a2c4b8422bd/fpls-15-1476070-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/42865671209a/fpls-15-1476070-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/2f6c7eaef3ed/fpls-15-1476070-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/7a1d3215b1f8/fpls-15-1476070-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/a9aaa1988137/fpls-15-1476070-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/353311280d16/fpls-15-1476070-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/cb80d0a608eb/fpls-15-1476070-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/1d922d51aee5/fpls-15-1476070-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/ed4cccca4403/fpls-15-1476070-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/0a2c4b8422bd/fpls-15-1476070-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/42865671209a/fpls-15-1476070-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/2f6c7eaef3ed/fpls-15-1476070-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/7a1d3215b1f8/fpls-15-1476070-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/a9aaa1988137/fpls-15-1476070-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/353311280d16/fpls-15-1476070-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/cb80d0a608eb/fpls-15-1476070-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/1d922d51aee5/fpls-15-1476070-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/712a/11686434/ed4cccca4403/fpls-15-1476070-g009.jpg

相似文献

1
Disentangling genotype and environment specific latent features for improved trait prediction using a compositional autoencoder.使用组合自动编码器解析基因型和环境特异性潜在特征以改进性状预测。
Front Plant Sci. 2024 Dec 16;15:1476070. doi: 10.3389/fpls.2024.1476070. eCollection 2024.
2
Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement.序贯变分自动编码器与对抗分类器在视频解缠中的应用
Sensors (Basel). 2023 Feb 24;23(5):2515. doi: 10.3390/s23052515.
3
Genomic Prediction Accounting for Genotype by Environment Interaction Offers an Effective Framework for Breeding Simultaneously for Adaptation to an Abiotic Stress and Performance Under Normal Cropping Conditions in Rice.考虑基因型与环境互作的基因组预测为水稻同时选育适应非生物胁迫的品种和正常种植条件下的表现提供了一个有效框架。
G3 (Bethesda). 2018 Jul 2;8(7):2319-2332. doi: 10.1534/g3.118.200098.
4
Combining Crop Growth Modeling and Statistical Genetic Modeling to Evaluate Phenotyping Strategies.结合作物生长模型与统计遗传模型评估表型分析策略。
Front Plant Sci. 2019 Nov 27;10:1491. doi: 10.3389/fpls.2019.01491. eCollection 2019.
5
Genomic prediction applied to multiple traits and environments in second season maize hybrids.基因组预测在第二季玉米杂交种的多个性状和环境中的应用。
Heredity (Edinb). 2020 Aug;125(1-2):60-72. doi: 10.1038/s41437-020-0321-0. Epub 2020 May 29.
6
TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.TrG2P:一种基于迁移学习的工具,集成多性状数据,用于准确预测作物产量。
Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.
7
High-throughput field phenotyping using hyperspectral reflectance and partial least squares regression (PLSR) reveals genetic modifications to photosynthetic capacity.利用高光谱反射率和偏最小二乘回归(PLSR)进行高通量田间表型分析揭示了光合能力的遗传修饰。
Remote Sens Environ. 2019 Sep 15;231:111176. doi: 10.1016/j.rse.2019.04.029.
8
Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective Study.无监督特征选择以识别冠心病患者队列机器学习中的重要国际疾病分类第十版(ICD - 10)和解剖治疗化学分类系统(ATC)编码:回顾性研究
JMIR Med Inform. 2024 Jul 26;12:e52896. doi: 10.2196/52896.
9
From QTLs to Adaptation Landscapes: Using Genotype-To-Phenotype Models to Characterize G×E Over Time.从数量性状基因座到适应景观:利用基因型到表型模型来刻画随时间变化的基因与环境互作。
Front Plant Sci. 2019 Dec 4;10:1540. doi: 10.3389/fpls.2019.01540. eCollection 2019.
10
A study on hybrid-architecture deep learning model for predicting pressure distribution in 2D airfoils.一种用于预测二维翼型压力分布的混合架构深度学习模型的研究。
Sci Rep. 2025 Jan 16;15(1):2155. doi: 10.1038/s41598-024-84940-w.

引用本文的文献

1
Gaining insights into epigenetic memories through artificial intelligence and omics science in plants.通过人工智能和植物组学科学深入了解表观遗传记忆。
J Integr Plant Biol. 2025 Sep;67(9):2320-2349. doi: 10.1111/jipb.13953. Epub 2025 Jun 24.

本文引用的文献

1
Disentangling the latent space of GANs for semantic face editing.解纠缠 GAN 的潜在空间进行语义人脸编辑。
PLoS One. 2023 Oct 26;18(10):e0293496. doi: 10.1371/journal.pone.0293496. eCollection 2023.
2
Hyperspectral imaging combined with CNN for maize variety identification.高光谱成像结合卷积神经网络用于玉米品种识别。
Front Plant Sci. 2023 Sep 8;14:1254548. doi: 10.3389/fpls.2023.1254548. eCollection 2023.
3
Temporal phenomic predictions from unoccupied aerial systems can outperform genomic predictions.无人航空系统的时间表型预测可以优于基因组预测。
G3 (Bethesda). 2023 Jan 12;13(1). doi: 10.1093/g3journal/jkac294.
4
Association mapping across a multitude of traits collected in diverse environments in maize.玉米在多种环境下收集的众多性状的关联作图。
Gigascience. 2022 Aug 23;11. doi: 10.1093/gigascience/giac080.
5
Variational autoencoders learn transferrable representations of metabolomics data.变分自编码器可学习代谢组学数据的可迁移表示。
Commun Biol. 2022 Jun 30;5(1):645. doi: 10.1038/s42003-022-03579-3.
6
Towards Disentangling Latent Space for Unsupervised Semantic Face Editing.面向无监督语义人脸编辑的潜在空间解缠。
IEEE Trans Image Process. 2022;31:1475-1489. doi: 10.1109/TIP.2022.3142527. Epub 2022 Jan 27.
7
Capturing discrete latent structures: choose LDs over PCs.捕捉离散潜在结构:选择潜在因子而非主成分。
Biostatistics. 2022 Dec 12;24(1):1-16. doi: 10.1093/biostatistics/kxab030.
8
Latent representation learning in biology and translational medicine.生物学与转化医学中的潜在表征学习
Patterns (N Y). 2021 Mar 12;2(3):100198. doi: 10.1016/j.patter.2021.100198.
9
Latent Space Phenotyping: Automatic Image-Based Phenotyping for Treatment Studies.潜在空间表型分析:用于治疗研究的基于图像的自动表型分析
Plant Phenomics. 2020 Jan 20;2020:5801869. doi: 10.34133/2020/5801869. eCollection 2020.
10
A Style-Based Generator Architecture for Generative Adversarial Networks.基于风格的生成对抗网络生成器架构。
IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4217-4228. doi: 10.1109/TPAMI.2020.2970919. Epub 2021 Nov 3.