• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测植物组织身份的基于表达的机器学习模型。

Expression-based machine learning models for predicting plant tissue identity.

作者信息

Palande Sourabh, Arsenault Jeremy, Basurto-Lozada Patricia, Bleich Andrew, Brown Brianna N I, Buysse Sophia F, Connors Noelle A, Das Adhikari Sikta, Dobson Kara C, Guerra-Castillo Francisco Xavier, Guerrero-Carrillo Maria F, Harlow Sophia, Herrera-Orozco Héctor, Hightower Asia T, Izquierdo Paulo, Jacobs MacKenzie, Johnson Nicholas A, Leuenberger Wendy, Lopez-Hernandez Alessandro, Luckie-Duque Alicia, Martínez-Avila Camila, Mendoza-Galindo Eddy J, Plancarte David Cruz, Schuster Jenny M, Shomer Harry, Sitar Sidney C, Steensma Anne K, Thomson Joanne Elise, Villaseñor-Amador Damián, Waterman Robin, Webster Brandon M, Whyte Madison, Zorilla-Azcué Sofía, Montgomery Beronda L, Husbands Aman Y, Krishnan Arjun, Percival Sarah, Munch Elizabeth, VanBuren Robert, Chitwood Daniel H, Rougon-Cardoso Alejandra

机构信息

Department of Computational Mathematics, Science and Engineering Michigan State University East Lansing Michigan USA.

Department of Computer Science and Engineering Michigan State University East Lansing Michigan USA.

出版信息

Appl Plant Sci. 2024 Oct 19;13(1):e11621. doi: 10.1002/aps3.11621. eCollection 2025 Jan-Feb.

DOI:10.1002/aps3.11621
PMID:39906497
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11788907/
Abstract

PREMISE

The selection of as a model organism played a pivotal role in advancing genomic science. The competing frameworks to select an agricultural- or ecological-based model species were rejected, in favor of building knowledge in a species that would facilitate genome-enabled research.

METHODS

Here, we examine the ability of models based on gene expression data to predict tissue identity in other flowering plants. Comparing different machine learning algorithms, models trained and tested on data achieved near perfect precision and recall values, whereas when tissue identity is predicted across the flowering plants using models trained on data, precision values range from 0.69 to 0.74 and recall from 0.54 to 0.64.

RESULTS

The identity of belowground tissue can be predicted more accurately than other tissue types, and the ability to predict tissue identity is not correlated with phylogenetic distance from . -nearest neighbors is the most successful algorithm, suggesting that gene expression signatures, rather than marker genes, are more valuable to create models for tissue and cell type prediction in plants.

DISCUSSION

Our data-driven results highlight that the assertion that knowledge from is translatable to other plants is not always true. Considering the current landscape of abundant sequencing data, we should reevaluate the scientific emphasis on and prioritize plant diversity.

摘要

前提

选择[具体物种]作为模式生物在推进基因组科学方面发挥了关键作用。选择基于农业或生态的模式物种的竞争框架被否决,转而支持在一个有助于开展基因组研究的物种中积累知识。

方法

在此,我们研究基于[具体物种]基因表达数据的模型预测其他开花植物组织身份的能力。比较不同的机器学习算法,在[具体物种]数据上训练和测试的模型实现了近乎完美的精确率和召回率值,而当使用在[具体物种]数据上训练的模型预测整个开花植物的组织身份时,精确率值范围为0.69至0.74,召回率为0.54至0.64。

结果

地下组织的身份比其他组织类型能更准确地被预测,并且预测组织身份的能力与与[具体物种]的系统发育距离无关。k近邻算法是最成功的算法,这表明基因表达特征而非标记基因对于创建植物组织和细胞类型预测模型更有价值。

讨论

我们的数据驱动结果凸显了认为来自[具体物种]的知识可转化到其他植物的观点并非总是正确的。考虑到当前丰富测序数据的情况,我们应该重新评估对[具体物种]的科学重视程度,并优先考虑植物多样性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/6646783bd7be/APS3-13-e11621-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/6521567c321b/APS3-13-e11621-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/e1af622b05e8/APS3-13-e11621-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/0dc44de6133f/APS3-13-e11621-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/6646783bd7be/APS3-13-e11621-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/6521567c321b/APS3-13-e11621-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/e1af622b05e8/APS3-13-e11621-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/0dc44de6133f/APS3-13-e11621-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5499/11788907/6646783bd7be/APS3-13-e11621-g002.jpg

相似文献

1
Expression-based machine learning models for predicting plant tissue identity.用于预测植物组织身份的基于表达的机器学习模型。
Appl Plant Sci. 2024 Oct 19;13(1):e11621. doi: 10.1002/aps3.11621. eCollection 2025 Jan-Feb.
2
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.
3
AtML: An Arabidopsis thaliana root cell identity recognition tool for medicinal ingredient accumulation.AtML:一种用于药用成分积累的拟南芥根细胞身份识别工具。
Methods. 2024 Nov;231:61-69. doi: 10.1016/j.ymeth.2024.09.010. Epub 2024 Sep 16.
4
A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction.植物miRNA靶标预测工具的性能比较及全基因组靶标预测特征的表征
BMC Genomics. 2014 May 8;15(1):348. doi: 10.1186/1471-2164-15-348.
5
QTG-Finder2: A Generalized Machine-Learning Algorithm for Prioritizing QTL Causal Genes in Plants.QTG-Finder2:一种用于对植物数量性状基因座(QTL)因果基因进行优先级排序的广义机器学习算法。
G3 (Bethesda). 2020 Jul 7;10(7):2411-2421. doi: 10.1534/g3.120.401122.
6
QTG-Finder: A Machine-Learning Based Algorithm To Prioritize Causal Genes of Quantitative Trait Loci in Arabidopsis and Rice.QTG-Finder:一种基于机器学习的算法,用于优先考虑拟南芥和水稻中数量性状位点的因果基因。
G3 (Bethesda). 2019 Oct 7;9(10):3129-3138. doi: 10.1534/g3.119.400319.
7
The major clades of MADS-box genes and their role in the development and evolution of flowering plants.MADS盒基因的主要进化枝及其在开花植物发育和进化中的作用。
Mol Phylogenet Evol. 2003 Dec;29(3):464-89. doi: 10.1016/s1055-7903(03)00207-0.
8
RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors.RLPredictiOme,一种基于机器学习的高通量植物类受体蛋白预测方法,揭示了新型跨膜受体类别。
Int J Mol Sci. 2022 Oct 12;23(20):12176. doi: 10.3390/ijms232012176.
9
MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads.MAC-ErrorReads:一种基于机器学习的分类器,用于过滤错误的 NGS 读取。
BMC Bioinformatics. 2024 Feb 7;25(1):61. doi: 10.1186/s12859-024-05681-1.
10
StressGenePred: a twin prediction model architecture for classifying the stress types of samples and discovering stress-related genes in arabidopsis.StressGenePred:一种用于对样本的应激类型进行分类和发现拟南芥中与应激相关基因的双胞胎预测模型架构。
BMC Genomics. 2019 Dec 20;20(Suppl 11):949. doi: 10.1186/s12864-019-6283-z.

本文引用的文献

1
The Arabidopsis Information Resource in 2024.2024 年的拟南芥信息资源。
Genetics. 2024 May 7;227(1). doi: 10.1093/genetics/iyae027.
2
Topological data analysis reveals a core gene expression backbone that defines form and function across flowering plants.拓扑数据分析揭示了一个核心基因表达主干,它定义了开花植物的形态和功能。
PLoS Biol. 2023 Dec 5;21(12):e3002397. doi: 10.1371/journal.pbio.3002397. eCollection 2023 Dec.
3
A critical analysis of plant science literature reveals ongoing inequities.对植物科学文献的批判性分析揭示了持续存在的不平等现象。
Proc Natl Acad Sci U S A. 2023 Mar 7;120(10):e2217564120. doi: 10.1073/pnas.2217564120. Epub 2023 Feb 28.
4
Renaming Indigenous crops and addressing colonial bias in scientific language.重新命名本土作物并解决科学语言中的殖民偏见。
Trends Plant Sci. 2022 Dec;27(12):1189-1192. doi: 10.1016/j.tplants.2022.08.022. Epub 2022 Sep 23.
5
Exploiting plant transcriptomic databases: Resources, tools, and approaches.利用植物转录组数据库:资源、工具和方法。
Plant Commun. 2022 Jul 11;3(4):100323. doi: 10.1016/j.xplc.2022.100323. Epub 2022 Apr 9.
6
Plant Public RNA-seq Database: a comprehensive online database for expression analysis of ~45 000 plant public RNA-Seq libraries.植物公共RNA测序数据库:一个用于约45000个植物公共RNA测序文库表达分析的综合在线数据库。
Plant Biotechnol J. 2022 May;20(5):806-808. doi: 10.1111/pbi.13798. Epub 2022 Mar 6.
7
Representation and participation across 20 years of plant genome sequencing.二十年来植物基因组测序的表现与参与。
Nat Plants. 2021 Dec;7(12):1571-1578. doi: 10.1038/s41477-021-01031-8. Epub 2021 Nov 29.
8
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
9
Comparative transcriptomic analysis reveals conserved programmes underpinning organogenesis and reproduction in land plants.比较转录组学分析揭示了陆地植物器官发生和生殖的保守程序。
Nat Plants. 2021 Aug;7(8):1143-1159. doi: 10.1038/s41477-021-00958-2. Epub 2021 Jul 12.
10
Unlocking the potential of plant phenotyping data through integration and data-driven approaches.通过整合和数据驱动方法释放植物表型数据的潜力。
Curr Opin Syst Biol. 2017 Aug;4:58-63. doi: 10.1016/j.coisb.2017.07.002.