• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用皮尔逊相关系数作为衡量数量性状预测准确性的唯一指标:这足够吗?

Using the Pearson's correlation coefficient as the sole metric to measure the accuracy of quantitative trait prediction: is it sufficient?

作者信息

Pan Shouhui, Liu Zhongqiang, Han Yanyun, Zhang Dongfeng, Zhao Xiangyu, Li Jinlong, Wang Kaiyi

机构信息

Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China.

National Engineering Research Center for Information Technology in Agriculture, Beijing, China.

出版信息

Front Plant Sci. 2024 Dec 10;15:1480463. doi: 10.3389/fpls.2024.1480463. eCollection 2024.

DOI:10.3389/fpls.2024.1480463
PMID:39719937
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11667204/
Abstract

How to evaluate the accuracy of quantitative trait prediction is crucial to choose the best model among several possible choices in plant breeding. Pearson's correlation coefficient (PCC), serving as a metric for quantifying the strength of the linear association between two variables, is widely used to evaluate the accuracy of the quantitative trait prediction models, and generally performs well in most circumstances. However, PCC may not always offer a comprehensive view of predictive accuracy, especially in cases involving nonlinear relationships or complex dependencies in machine learning-based methods. It has been found that many papers on quantitative trait prediction solely use PCC as a single metric to evaluate the accuracy of their models, which is insufficient and limited from a formal perspective. This study addresses this crucial issue by presenting a typical example and conducting a comparative analysis of PCC and nine other evaluation metrics using four traditional methods and four machine learning-based methods, thereby contributing to the improvement of practical applicability and reliability of plant quantitative trait prediction models. It is recommended to employ PCC in conjunction with other evaluation metrics in a targeted manner based on specific application scenarios to reduce the likelihood of drawing misleading conclusions.

摘要

如何评估数量性状预测的准确性对于在植物育种的几种可能选择中选择最佳模型至关重要。皮尔逊相关系数(PCC)作为量化两个变量之间线性关联强度的指标,被广泛用于评估数量性状预测模型的准确性,并且在大多数情况下通常表现良好。然而,PCC可能并不总是能全面反映预测准确性,特别是在基于机器学习的方法中涉及非线性关系或复杂依赖关系的情况下。已经发现,许多关于数量性状预测的论文仅使用PCC作为单一指标来评估其模型的准确性,从形式角度来看,这是不够的且有局限性。本研究通过给出一个典型例子,并使用四种传统方法和四种基于机器学习的方法对PCC和其他九个评估指标进行比较分析,解决了这个关键问题,从而有助于提高植物数量性状预测模型的实际适用性和可靠性。建议根据具体应用场景有针对性地将PCC与其他评估指标结合使用,以减少得出误导性结论的可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26fa/11667204/1667ad5beff9/fpls-15-1480463-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26fa/11667204/2ebb8aded74e/fpls-15-1480463-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26fa/11667204/1667ad5beff9/fpls-15-1480463-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26fa/11667204/2ebb8aded74e/fpls-15-1480463-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26fa/11667204/1667ad5beff9/fpls-15-1480463-g002.jpg

相似文献

1
Using the Pearson's correlation coefficient as the sole metric to measure the accuracy of quantitative trait prediction: is it sufficient?使用皮尔逊相关系数作为衡量数量性状预测准确性的唯一指标:这足够吗?
Front Plant Sci. 2024 Dec 10;15:1480463. doi: 10.3389/fpls.2024.1480463. eCollection 2024.
2
Residual network improves the prediction accuracy of genomic selection.残差网络提高基因组选择的预测准确性。
Anim Genet. 2024 Aug;55(4):599-611. doi: 10.1111/age.13445. Epub 2024 May 15.
3
Multi-trait genomic-enabled prediction enhances accuracy in multi-year wheat breeding trials.多性状基因组赋能预测提高了多年小麦育种试验的准确性。
G3 (Bethesda). 2021 Sep 27;11(10). doi: 10.1093/g3journal/jkab270.
4
New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes.基于深度学习的基因组的新型预测模型,用于具有二元、有序和连续表型的多个特征。
G3 (Bethesda). 2019 May 7;9(5):1545-1556. doi: 10.1534/g3.119.300585.
5
Multi-trait, Multi-environment Deep Learning Modeling for Genomic-Enabled Prediction of Plant Traits.用于基于基因组的植物性状预测的多性状、多环境深度学习建模
G3 (Bethesda). 2018 Dec 10;8(12):3829-3840. doi: 10.1534/g3.118.200728.
6
Genomic-enabled prediction with classification algorithms.使用分类算法的基因组预测
Heredity (Edinb). 2014 Jun;112(6):616-26. doi: 10.1038/hdy.2013.144. Epub 2014 Jan 15.
7
A comparative study of 11 non-linear regression models highlighting autoencoder, DBN, and SVR, enhanced by SHAP importance analysis in soybean branching prediction.一项对11种非线性回归模型的比较研究,重点关注自动编码器、深度信念网络和支持向量回归,并通过SHAP重要性分析在大豆分枝预测中得到增强。
Sci Rep. 2024 Mar 11;14(1):5905. doi: 10.1038/s41598-024-55243-x.
8
A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.深度学习、支持向量机和贝叶斯阈值最佳线性无偏预测在植物育种中预测有序性状的基准比较
G3 (Bethesda). 2019 Feb 7;9(2):601-618. doi: 10.1534/g3.118.200998.
9
Genomic Prediction Accuracy of Seven Breeding Selection Traits Improved by QTL Identification in Flax.通过 QTL 鉴定提高亚麻七种育种选择性状的基因组预测准确性。
Int J Mol Sci. 2020 Feb 25;21(5):1577. doi: 10.3390/ijms21051577.
10
Genomic selection in plant breeding: Key factors shaping two decades of progress.植物育种中的基因组选择:塑造二十年进展的关键因素。
Mol Plant. 2024 Apr 1;17(4):552-578. doi: 10.1016/j.molp.2024.03.007. Epub 2024 Mar 12.

引用本文的文献

1
Psychopathological Influences on Surgical and Clinical Outcomes in Lumbar Disk Herniation: Prediction Models and Literature Analysis.腰椎间盘突出症手术及临床结局的精神病理学影响:预测模型与文献分析
J Pers Med. 2025 Jan 26;15(2):48. doi: 10.3390/jpm15020048.
2
Clinical validation of an abridged AIMS: Streamlining motor screening in the first-year infant.简化版婴幼儿运动评估量表(AIMS)的临床验证:优化对一岁婴儿的运动筛查
Early Hum Dev. 2025 Mar;202:106207. doi: 10.1016/j.earlhumdev.2025.106207. Epub 2025 Feb 3.

本文引用的文献

1
TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.TrG2P:一种基于迁移学习的工具,集成多性状数据,用于准确预测作物产量。
Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.
2
To have value, comparisons of high-throughput phenotyping methods need statistical tests of bias and variance.为了具有价值,高通量表型分析方法的比较需要进行偏差和方差的统计检验。
Front Plant Sci. 2024 Jan 19;14:1325221. doi: 10.3389/fpls.2023.1325221. eCollection 2023.
3
DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants.
DNNGP,一种基于深度神经网络的方法,用于利用植物中的多组学数据进行基因组预测。
Mol Plant. 2023 Jan 2;16(1):279-293. doi: 10.1016/j.molp.2022.11.004. Epub 2022 Nov 10.
4
GWAS, MWAS and mGWAS provide insights into precision agriculture based on genotype-dependent microbial effects in foxtail millet.GWAS、MWAS 和 mGWAS 基于谷子中基因型依赖性微生物效应,为精准农业提供了新的见解。
Nat Commun. 2022 Oct 7;13(1):5913. doi: 10.1038/s41467-022-33238-4.
5
Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction.由大数据、人工智能和综合基因组-环境预测驱动的智能育种。
Mol Plant. 2022 Nov 7;15(11):1664-1695. doi: 10.1016/j.molp.2022.09.001. Epub 2022 Sep 7.
6
Deep neural networks for genomic prediction do not estimate marker effects.深度神经网络在基因组预测中并不估计标记效应。
Plant Genome. 2021 Nov;14(3):e20147. doi: 10.1002/tpg2.20147. Epub 2021 Oct 1.
7
LightGBM: accelerated genomically designed crop breeding through ensemble learning.LightGBM:通过集成学习加速基因组设计的作物育种。
Genome Biol. 2021 Sep 20;22(1):271. doi: 10.1186/s13059-021-02492-y.
8
GMStool: GWAS-based marker selection tool for genomic prediction from genomic data.GMStool:基于 GWAS 的基因组预测标记选择工具,用于从基因组数据中进行基因组预测。
Sci Rep. 2020 Nov 12;10(1):19653. doi: 10.1038/s41598-020-76759-y.
9
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
10
Maize genomes to fields (G2F): 2014-2017 field seasons: genotype, phenotype, climatic, soil, and inbred ear image datasets.玉米基因组到田间(G2F):2014 - 2017年田间季:基因型、表型、气候、土壤和自交系果穗图像数据集。
BMC Res Notes. 2020 Feb 12;13(1):71. doi: 10.1186/s13104-020-4922-8.