• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TrainSel:一个用于选择训练群体的R软件包。

TrainSel: An R Package for Selection of Training Populations.

作者信息

Akdemir Deniz, Rio Simon, Isidro Y Sánchez Julio

机构信息

Agriculture & Food Science Centre, Animal and Crop Science Division, University College Dublin, Dublin, Ireland.

Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Universidad Politécnica de Madrid (UPM), Madrid, Spain.

出版信息

Front Genet. 2021 May 7;12:655287. doi: 10.3389/fgene.2021.655287. eCollection 2021.

DOI:10.3389/fgene.2021.655287
PMID:34025720
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8138169/
Abstract

A major barrier to the wider use of supervised learning in emerging applications, such as genomic selection, is the lack of sufficient and representative labeled data to train prediction models. The amount and quality of labeled training data in many applications is usually limited and therefore careful selection of the training examples to be labeled can be useful for improving the accuracies in predictive learning tasks. In this paper, we present an R package, TrainSel, which provides flexible, efficient, and easy-to-use tools that can be used for the selection of training populations (STP). We illustrate its use, performance, and potentials in four different supervised learning applications within and outside of the plant breeding area.

摘要

在诸如基因组选择等新兴应用中,监督学习更广泛应用的一个主要障碍是缺乏足够且具有代表性的标记数据来训练预测模型。在许多应用中,标记训练数据的数量和质量通常是有限的,因此仔细选择要标记的训练示例对于提高预测学习任务的准确性可能是有用的。在本文中,我们展示了一个R包TrainSel,它提供了灵活、高效且易于使用的工具,可用于训练群体选择(STP)。我们在植物育种领域内外的四种不同监督学习应用中说明了它的用途、性能和潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/ad97056fa87e/fgene-12-655287-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/2f2514970e12/fgene-12-655287-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/d8ea0e0e5bc1/fgene-12-655287-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/98e40600a4be/fgene-12-655287-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/9f117c33da55/fgene-12-655287-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/ad97056fa87e/fgene-12-655287-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/2f2514970e12/fgene-12-655287-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/d8ea0e0e5bc1/fgene-12-655287-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/98e40600a4be/fgene-12-655287-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/9f117c33da55/fgene-12-655287-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee1a/8138169/ad97056fa87e/fgene-12-655287-g0005.jpg

相似文献

1
TrainSel: An R Package for Selection of Training Populations.TrainSel:一个用于选择训练群体的R软件包。
Front Genet. 2021 May 7;12:655287. doi: 10.3389/fgene.2021.655287. eCollection 2021.
2
Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data.基于机器学习的基因组预测:在合成数据和实际数据上,正则化回归、集成、基于实例和深度学习方法的性能比较。
BMC Genomics. 2024 Feb 7;25(1):152. doi: 10.1186/s12864-023-09933-x.
3
Deep virtual adversarial self-training with consistency regularization for semi-supervised medical image classification.深度对偶对抗自训练与一致性正则化在半监督医学图像分类中的应用。
Med Image Anal. 2021 May;70:102010. doi: 10.1016/j.media.2021.102010. Epub 2021 Feb 22.
4
Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry.基于伪标签优化的过程工业集成半监督软测量
Sensors (Basel). 2021 Dec 19;21(24):8471. doi: 10.3390/s21248471.
5
Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。
Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.
6
BWGS: A R package for genomic selection and its application to a wheat breeding programme.BWGS:一个基因组选择的 R 包及其在小麦育种计划中的应用。
PLoS One. 2020 Apr 2;15(4):e0222733. doi: 10.1371/journal.pone.0222733. eCollection 2020.
7
Transfer learning improves supervised image segmentation across imaging protocols.迁移学习可提高不同成像协议下的监督式图像分割效果。
IEEE Trans Med Imaging. 2015 May;34(5):1018-30. doi: 10.1109/TMI.2014.2366792. Epub 2014 Nov 4.
8
Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification.基于伪标签的半监督深度学习在高光谱图像分类中的应用。
IEEE Trans Image Process. 2018 Mar;27(3):1259-1270. doi: 10.1109/TIP.2017.2772836. Epub 2017 Nov 13.
9
Semi-supervised training of deep convolutional neural networks with heterogeneous data and few local annotations: An experiment on prostate histopathology image classification.基于异构数据和少量局部标注的深度卷积神经网络的半监督学习:前列腺组织病理学图像分类实验。
Med Image Anal. 2021 Oct;73:102165. doi: 10.1016/j.media.2021.102165. Epub 2021 Jul 14.
10
A self-supervised deep learning method for data-efficient training in genomics.一种用于基因组学中数据高效训练的自监督深度学习方法。
Commun Biol. 2023 Sep 11;6(1):928. doi: 10.1038/s42003-023-05310-2.

引用本文的文献

1
A comparison of design algorithms for choosing the training population in genomic models.基因组模型中用于选择训练群体的设计算法比较。
Front Genet. 2025 Feb 13;15:1462855. doi: 10.3389/fgene.2024.1462855. eCollection 2024.
2
Sparse testing designs for optimizing resource allocation in multi-environment cassava breeding trials.用于优化多环境木薯育种试验资源分配的稀疏测试设计
Plant Genome. 2025 Mar;18(1):e20558. doi: 10.1002/tpg2.20558.
3
Stochastic simulation to optimize rice breeding at IRRI.国际水稻研究所用于优化水稻育种的随机模拟

本文引用的文献

1
Optimizing Sequencing Resources in Genotyped Livestock Populations Using Linear Programming.利用线性规划优化基因分型家畜群体中的测序资源
Front Genet. 2021 Oct 22;12:740340. doi: 10.3389/fgene.2021.740340. eCollection 2021.
2
Optimization of training sets for genomic prediction of early-stage single crosses in maize.优化训练集以进行玉米早期单交种的基因组预测。
Theor Appl Genet. 2021 Feb;134(2):687-699. doi: 10.1007/s00122-020-03722-w. Epub 2021 Jan 4.
3
Optimizing whole-genomic prediction for autotetraploid blueberry breeding.
Front Plant Sci. 2024 Nov 1;15:1488814. doi: 10.3389/fpls.2024.1488814. eCollection 2024.
4
Training set optimization is a feasible alternative for perennial orphan crop domestication and germplasm management: an example.训练集优化是多年生孤儿作物驯化和种质管理的一种可行选择:一个实例。
Front Plant Sci. 2024 Sep 10;15:1441683. doi: 10.3389/fpls.2024.1441683. eCollection 2024.
5
Maximizing efficiency in sunflower breeding through historical data optimization.通过历史数据优化实现向日葵育种效率最大化。
Plant Methods. 2024 Mar 16;20(1):42. doi: 10.1186/s13007-024-01151-0.
6
Optimization of Multi-Generation Multi-location Genomic Prediction Models for Recurrent Genomic Selection in an Upland Rice Population.陆稻群体轮回基因组选择中多代多地点基因组预测模型的优化
Rice (N Y). 2023 Sep 27;16(1):43. doi: 10.1186/s12284-023-00661-0.
7
Oracle selection provides insight into how far off practice is from Utopia in plant breeding.选择合适的品种有助于洞察植物育种实践与理想状态之间的差距。
Front Plant Sci. 2023 Jul 21;14:1218665. doi: 10.3389/fpls.2023.1218665. eCollection 2023.
8
Genetic gain and inbreeding from simulation of different genomic mating schemes for pig improvement.通过模拟不同基因组交配方案对猪进行改良的遗传进展和近亲繁殖情况。
J Anim Sci Biotechnol. 2023 Jun 13;14(1):87. doi: 10.1186/s40104-023-00872-x.
9
Genetic Basis of Resistance to Yellow Rust Race at the Seedling Stage in Current Central and Northern European Winter Wheat Germplasm.当前中欧和北欧冬小麦种质资源苗期对条锈菌生理小种抗性的遗传基础
Plants (Basel). 2023 Jan 17;12(3):420. doi: 10.3390/plants12030420.
10
Building a Calibration Set for Genomic Prediction, Characteristics to Be Considered, and Optimization Approaches.构建用于基因组预测的校准集、需考虑的特征及优化方法。
Methods Mol Biol. 2022;2467:77-112. doi: 10.1007/978-1-0716-2205-6_3.
优化同源四倍体蓝莓育种的全基因组预测。
Heredity (Edinb). 2020 Dec;125(6):437-448. doi: 10.1038/s41437-020-00357-x. Epub 2020 Oct 19.
4
Training Population Optimization for Genomic Selection in .优化基因组选择中的训练群体 。
G3 (Bethesda). 2020 Jul 7;10(7):2465-2476. doi: 10.1534/g3.120.401402.
5
Genomic prediction of maize microphenotypes provides insights for optimizing selection and mining diversity.基因组预测玉米微观表型为优化选择和挖掘多样性提供了见解。
Plant Biotechnol J. 2020 Dec;18(12):2456-2465. doi: 10.1111/pbi.13420. Epub 2020 Jun 8.
6
Training set determination for genomic selection.基因组选择的训练集确定。
Theor Appl Genet. 2019 Oct;132(10):2781-2792. doi: 10.1007/s00122-019-03387-0. Epub 2019 Jul 2.
7
Training set optimization of genomic prediction by means of EthAcc.通过 EthAcc 对基因组预测进行训练集优化。
PLoS One. 2019 Feb 19;14(2):e0205629. doi: 10.1371/journal.pone.0205629. eCollection 2019.
8
Design of training populations for selective phenotyping in genomic prediction.用于基因组预测中选择性表型分析的训练群体设计。
Sci Rep. 2019 Feb 5;9(1):1446. doi: 10.1038/s41598-018-38081-6.
9
Optimal Designs for Genomic Selection in Hybrid Crops.杂种作物基因组选择的最优设计。
Mol Plant. 2019 Mar 4;12(3):390-401. doi: 10.1016/j.molp.2018.12.022. Epub 2019 Jan 6.
10
Multi-objective optimized genomic breeding strategies for sustainable food improvement.多目标优化的基因组育种策略,以实现可持续的粮食改良。
Heredity (Edinb). 2019 May;122(5):672-683. doi: 10.1038/s41437-018-0147-1. Epub 2018 Sep 27.