• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过有效诊断影响观测的方法提高基因组预测准确性的统计方法。

Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation.

机构信息

Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, 110012, New Delhi, India.

出版信息

Sci Rep. 2020 May 21;10(1):8408. doi: 10.1038/s41598-020-65323-3.

DOI:10.1038/s41598-020-65323-3
PMID:32439883
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7242349/
Abstract

It is expected the predictive performance of genomic prediction methods may be adversely affected in the presence of outliers. In agriculture science an outlier may arise due to wrong data imputation, outlying response, and in a series of trials over the time or location. Although several statistical procedures are already there in literature for identification of outlier but identification of true outlier is still a challenge especially in case of high dimensional genomic data. Here we have proposed an efficient approach for detecting outlier in high dimensional genomic data, our approach is p-value based combination methods to produce single p-value for detecting the outliers. Robustness of our approach has been tested using simulated data through the evaluation measures like precision, recall etc. It has been observed that significant improvement in the performance of genomic prediction has been obtained by detecting the outliers and handling them accordingly through our proposed approach using real data.

摘要

预计在存在异常值的情况下,基因组预测方法的预测性能可能会受到不利影响。在农业科学中,异常值可能由于错误的数据插补、异常的响应以及随着时间或地点的一系列试验而产生。尽管文献中已经存在几种用于识别异常值的统计程序,但识别真正的异常值仍然是一个挑战,特别是在高维基因组数据的情况下。在这里,我们提出了一种用于检测高维基因组数据中异常值的有效方法,我们的方法是基于 p 值的组合方法,为检测异常值生成单个 p 值。通过使用模拟数据评估措施(如精度、召回率等)来测试我们方法的稳健性。通过使用真实数据,通过检测异常值并通过我们提出的方法进行相应处理,观察到基因组预测性能得到了显著提高。

相似文献

1
Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation.通过有效诊断影响观测的方法提高基因组预测准确性的统计方法。
Sci Rep. 2020 May 21;10(1):8408. doi: 10.1038/s41598-020-65323-3.
2
Bayesian Genomic Prediction with Genotype × Environment Interaction Kernel Models.使用基因型×环境互作核模型的贝叶斯基因组预测
G3 (Bethesda). 2017 Jan 5;7(1):41-53. doi: 10.1534/g3.116.035584.
3
Prediction of Multiple-Trait and Multiple-Environment Genomic Data Using Recommender Systems.使用推荐系统预测多性状和多环境基因组数据
G3 (Bethesda). 2018 Jan 4;8(1):131-147. doi: 10.1534/g3.117.300309.
4
Deep Kernel for Genomic and Near Infrared Predictions in Multi-environment Breeding Trials.深度核在多环境育种试验中基因组和近红外预测的应用。
G3 (Bethesda). 2019 Sep 4;9(9):2913-2924. doi: 10.1534/g3.119.400493.
5
Genomic Selection in Plant Breeding: Methods, Models, and Perspectives.基因组选择在植物育种中的应用:方法、模型与展望。
Trends Plant Sci. 2017 Nov;22(11):961-975. doi: 10.1016/j.tplants.2017.08.011. Epub 2017 Sep 28.
6
Use of multiple traits genomic prediction, genotype by environment interactions and spatial effect to improve prediction accuracy in yield data.利用多个性状基因组预测、基因型与环境互作和空间效应来提高产量数据的预测准确性。
PLoS One. 2020 May 13;15(5):e0232665. doi: 10.1371/journal.pone.0232665. eCollection 2020.
7
Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.利用真实或推算的全基因组标记预测牛模拟多基因表型及其潜在数量性状位点基因型的准确性。
Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.
8
Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers.利用系谱和分子标记预测植物育种中数量性状的遗传值。
Genetics. 2010 Oct;186(2):713-24. doi: 10.1534/genetics.110.118521. Epub 2010 Sep 2.
9
MegaLMM: Mega-scale linear mixed models for genomic predictions with thousands of traits.MegaLMM:用于具有数千个性状的基因组预测的大规模线性混合模型。
Genome Biol. 2021 Jul 23;22(1):213. doi: 10.1186/s13059-021-02416-w.
10
Training set optimization of genomic prediction by means of EthAcc.通过 EthAcc 对基因组预测进行训练集优化。
PLoS One. 2019 Feb 19;14(2):e0205629. doi: 10.1371/journal.pone.0205629. eCollection 2019.

引用本文的文献

1
GBLUP Outperforms Quantile Mapping and Outlier Detection for Enhanced Genomic Prediction.GBLUP在增强基因组预测方面优于分位数映射和异常值检测。
Int J Mol Sci. 2025 Apr 11;26(8):3620. doi: 10.3390/ijms26083620.
2
Hybrid Rice Production: A Worldwide Review of Floral Traits and Breeding Technology, with Special Emphasis on China.杂交水稻生产:全球花部性状与育种技术综述,重点关注中国
Plants (Basel). 2024 Feb 21;13(5):578. doi: 10.3390/plants13050578.
3
Identification of novel putative alleles related to important agronomic traits of wheat using robust strategies in GWAS.

本文引用的文献

1
Prediction of total genetic value using genome-wide dense marker maps.利用全基因组密集标记图谱预测总遗传值。
Genetics. 2001 Apr;157(4):1819-29. doi: 10.1093/genetics/157.4.1819.
利用 GWAS 中的稳健策略鉴定与小麦重要农艺性状相关的新型潜在等位基因。
Sci Rep. 2023 Jun 19;13(1):9927. doi: 10.1038/s41598-023-36134-z.
4
Digitalizing breeding in plants: A new trend of next-generation breeding based on genomic prediction.植物育种数字化:基于基因组预测的新一代育种新趋势。
Front Plant Sci. 2023 Jan 19;14:1092584. doi: 10.3389/fpls.2023.1092584. eCollection 2023.
5
Integrated model for genomic prediction under additive and non-additive genetic architecture.加性和非加性遗传结构下基因组预测的整合模型。
Front Plant Sci. 2022 Nov 30;13:1027558. doi: 10.3389/fpls.2022.1027558. eCollection 2022.
6
Genomic Selection: A Tool for Accelerating the Efficiency of Molecular Breeding for Development of Climate-Resilient Crops.基因组选择:加速培育抗逆作物分子育种效率的工具。
Front Genet. 2022 Feb 9;13:832153. doi: 10.3389/fgene.2022.832153. eCollection 2022.
7
Sparse Regression in Cancer Genomics: Comparing Variable Selection and Predictions in Real World Data.癌症基因组学中的稀疏回归:在真实世界数据中比较变量选择和预测
Cancer Inform. 2021 Nov 27;20:11769351211056298. doi: 10.1177/11769351211056298. eCollection 2021.