• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

种质库中遗传资源的入藏稀有性和等位基因特异性的信息视图,用于管理和保护。

An informational view of accession rarity and allele specificity in germplasm banks for management and conservation.

机构信息

Department of Plant Breeding/Universidad Autónoma Agraria Antonio Narro, Saltillo, Coahuila, Mexico.

International Maize and Wheat Improvement Center (CIMMYT), Mexico City, Mexico.

出版信息

PLoS One. 2018 Feb 28;13(2):e0193346. doi: 10.1371/journal.pone.0193346. eCollection 2018.

DOI:10.1371/journal.pone.0193346
PMID:29489873
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5831390/
Abstract

Germplasm banks are growing in their importance, number of accessions and amount of characterization data, with a large emphasis on molecular genetic markers. In this work, we offer an integrated view of accessions and marker data in an information theory framework. The basis of this development is the mutual information between accessions and allele frequencies for molecular marker loci, which can be decomposed in allele specificities, as well as in rarity and divergence of accessions. In this way, formulas are provided to calculate the specificity of the different marker alleles with reference to their distribution across accessions, accession rarity, defined as the weighted average of the specificity of its alleles, and divergence, defined by the Kullback-Leibler formula. Albeit being different measures, it is demonstrated that average rarity and divergence are equal for any collection. These parameters can contribute to the knowledge of the structure of a germplasm collection and to make decisions about the preservation of rare variants. The concepts herein developed served as the basis for a strategy for core subset selection called HCore, implemented in a publicly available R script. As a proof of concept, the mathematical view and tools developed in this research were applied to a large collection of Mexican wheat accessions, widely characterized by SNP markers. The most specific alleles were found to be private of a single accession, and the distribution of this parameter had its highest frequencies at low levels of specificity. Accession rarity and divergence had largely symmetrical distributions, and had a positive, albeit non-strictly linear relationship. Comparison of the HCore approach for core subset selection, with three state-of-the-art methods, showed it to be superior for average divergence and rarity, mean genetic distance and diversity. The proposed approach can be used for knowledge extraction and decision making in germplasm collections of diploid, inbred or outbred species.

摘要

种质库在重要性、访问量和特征描述数据量方面都在不断增加,其中很大一部分强调了分子遗传标记。在这项工作中,我们在信息论框架内提供了访问量和标记数据的综合视图。这一发展的基础是访问量和分子标记基因座等位基因频率之间的互信息,可以分解为等位基因特异性以及访问量的稀有性和多样性。通过这种方式,提供了公式来计算不同标记等位基因的特异性,参考其在访问量中的分布、定义为其等位基因特异性加权平均值的访问量稀有性以及通过 Kullback-Leibler 公式定义的多样性。尽管是不同的措施,但证明了任何集合的平均稀有性和多样性都是相等的。这些参数可以有助于了解种质资源收集的结构,并就保存稀有变体做出决策。本文所开发的概念为核心子集选择策略 HCore 提供了基础,该策略在一个可公开获取的 R 脚本中实现。作为概念验证,本文研究中开发的数学观点和工具被应用于广泛用 SNP 标记进行特征描述的大量墨西哥小麦访问量。发现最具特异性的等位基因是单个访问量所特有的,并且该参数的分布在特异性水平较低时具有最高频率。访问量稀有性和多样性的分布大致对称,并且存在正相关关系,尽管不是严格的线性关系。核心子集选择的 HCore 方法与三种最先进的方法的比较表明,它在平均多样性和稀有性、平均遗传距离和多样性方面具有优势。该方法可用于二倍体、自交或杂交物种的种质资源收集的知识提取和决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd6/5831390/05ae782d6ecb/pone.0193346.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd6/5831390/0537c0d937a7/pone.0193346.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd6/5831390/bd2e402a3468/pone.0193346.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd6/5831390/05ae782d6ecb/pone.0193346.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd6/5831390/0537c0d937a7/pone.0193346.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd6/5831390/bd2e402a3468/pone.0193346.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fd6/5831390/05ae782d6ecb/pone.0193346.g003.jpg

相似文献

1
An informational view of accession rarity and allele specificity in germplasm banks for management and conservation.种质库中遗传资源的入藏稀有性和等位基因特异性的信息视图,用于管理和保护。
PLoS One. 2018 Feb 28;13(2):e0193346. doi: 10.1371/journal.pone.0193346. eCollection 2018.
2
Genomic characterization of a core set of the USDA-NPGS Ethiopian sorghum germplasm collection: implications for germplasm conservation, evaluation, and utilization in crop improvement.美国农业部国家植物种质系统埃塞俄比亚高粱种质收集核心集的基因组特征:对种质保存、评估及作物改良利用的意义
BMC Genomics. 2017 Jan 26;18(1):108. doi: 10.1186/s12864-016-3475-7.
3
Genetic structure and molecular diversity of Brazilian grapevine germplasm: Management and use in breeding programs.巴西葡萄种质资源的遗传结构和分子多样性:在育种计划中的管理和利用。
PLoS One. 2020 Oct 15;15(10):e0240665. doi: 10.1371/journal.pone.0240665. eCollection 2020.
4
Genetic diversity and linkage disequilibrium using SNP (KASP) and AFLP markers in a worldwide durum wheat (Triticum turgidum L. var durum) collection.利用 SNP(KASP)和 AFLP 标记对世界范围内硬粒小麦(Triticum turgidum L. var durum)收集品系进行遗传多样性和连锁不平衡分析。
PLoS One. 2019 Jun 28;14(6):e0218562. doi: 10.1371/journal.pone.0218562. eCollection 2019.
5
Wheat gene bank accessions as a source of new alleles of the powdery mildew resistance gene Pm3: a large scale allele mining project.小麦基因库资源作为抗白粉病基因 Pm3 新等位基因的来源:一项大规模的等位基因挖掘项目。
BMC Plant Biol. 2010 May 17;10:88. doi: 10.1186/1471-2229-10-88.
6
Genetic diversity and accession structure in European Cynara cardunculus collections.欧洲刺菜蓟种质资源库中的遗传多样性与种质结构
PLoS One. 2017 Jun 1;12(6):e0178770. doi: 10.1371/journal.pone.0178770. eCollection 2017.
7
A worldwide bread wheat core collection arrayed in a 384-well plate.一个排列在384孔板中的全球面包小麦核心种质库。
Theor Appl Genet. 2007 May;114(7):1265-75. doi: 10.1007/s00122-007-0517-1. Epub 2007 Feb 21.
8
Extracting samples of high diversity from thematic collections of large gene banks using a genetic-distance based approach.基于遗传距离的方法从大型基因库的主题收藏中提取多样性高的样本。
BMC Plant Biol. 2010 Jun 24;10:127. doi: 10.1186/1471-2229-10-127.
9
Genetic diversity and population structure analysis to construct a core collection from a large Capsicum germplasm.基于遗传多样性和群体结构分析从大量辣椒种质中构建核心种质库。
BMC Genet. 2016 Nov 14;17(1):142. doi: 10.1186/s12863-016-0452-8.
10
Tagging large CNV blocks in wheat boosts digitalization of germplasm resources by ultra-low-coverage sequencing.在小麦中标记大片 CNV 块可通过超低覆盖测序提高种质资源数字化水平。
Genome Biol. 2024 Jul 1;25(1):171. doi: 10.1186/s13059-024-03315-6.

引用本文的文献

1
Optimizing the accession-level quantity of seeds to put into storage to minimize seed (gene)bank regeneration or re-collection.优化入库种子的数量,以尽量减少种子(基因)库的再生或重新采集。
Conserv Physiol. 2025 Feb 23;13(1):coaf011. doi: 10.1093/conphys/coaf011. eCollection 2025.
2
SeqSNP-Based Targeted GBS Provides Insight into the Genetic Relationships among Global Collections of ssp. (Turnip Rape).基于序列单核苷酸多态性的靶向 GBS 揭示了全球 ssp. (芜菁型油菜)群体间的遗传关系。
Genes (Basel). 2024 Sep 10;15(9):1187. doi: 10.3390/genes15091187.
3
Genetic Structure and Geographical Differentiation of Traditional Rice ( L.) from Northern Vietnam.

本文引用的文献

1
An SSR-based approach incorporating a novel algorithm for identification of rare maize genotypes facilitates criteria for landrace conservation in Mexico.一种基于简单序列重复(SSR)的方法,结合了一种用于鉴定稀有玉米基因型的新算法,为墨西哥地方品种的保护提供了便利标准。
Ecol Evol. 2017 Feb 10;7(6):1680-1690. doi: 10.1002/ece3.2754. eCollection 2017 Mar.
2
Unlocking the genetic diversity of Creole wheats.揭开克里奥尔小麦的遗传多样性。
Sci Rep. 2016 Mar 15;6:23092. doi: 10.1038/srep23092.
3
Ecological specialization and rarity indices estimated for a large number of plant species in France.
越南北部传统水稻(Oryza L.)的遗传结构与地理分化
Plants (Basel). 2021 Oct 3;10(10):2094. doi: 10.3390/plants10102094.
4
Assessment of genetic diversity and population structure of oil palm (Elaeis guineensis Jacq.) field genebank: A step towards molecular-assisted germplasm conservation.油棕(Elaeis guineensis Jacq.)田间基因库遗传多样性和群体结构评估:迈向分子辅助种质保存的一步。
PLoS One. 2021 Jul 29;16(7):e0255418. doi: 10.1371/journal.pone.0255418. eCollection 2021.
5
Harnessing Crop Wild Diversity for Climate Change Adaptation.利用作物野生多样性适应气候变化。
Genes (Basel). 2021 May 20;12(5):783. doi: 10.3390/genes12050783.
6
Comparison of Morphological and Genetic Characteristics of Avocados Grown in Tanzania.坦桑尼亚种植的鳄梨的形态和遗传特征比较。
Genes (Basel). 2021 Jan 4;12(1):63. doi: 10.3390/genes12010063.
7
Genetic diversity and population structure of black cottonwood (Populus deltoides) revealed using simple sequence repeat markers.利用简单重复序列标记揭示黑杨(Populus deltoides)的遗传多样性和种群结构。
BMC Genet. 2020 Jan 6;21(1):2. doi: 10.1186/s12863-019-0805-1.
8
The impact of sample selection strategies on genetic diversity and representativeness in germplasm bank collections.样本选择策略对种质库收集遗传多样性和代表性的影响。
BMC Plant Biol. 2019 Nov 27;19(1):520. doi: 10.1186/s12870-019-2142-y.
针对法国大量植物物种估算的生态特化和稀有度指数。
Data Brief. 2015 Mar 4;3:165-8. doi: 10.1016/j.dib.2015.02.015. eCollection 2015 Jun.
4
Analysis and optimization of bulk DNA sampling with binary scoring for germplasm characterization.利用二分值对种质特征进行批量 DNA 采样的分析与优化。
PLoS One. 2013 Nov 19;8(11):e79936. doi: 10.1371/journal.pone.0079936. eCollection 2013.
5
Informativeness of microsatellite markers.微卫星标记的信息性
Methods Mol Biol. 2013;1006:259-70. doi: 10.1007/978-1-62703-389-3_18.
6
Core Hunter II: fast core subset selection based on multiple genetic diversity measures using Mixed Replica search.Core Hunter II:基于混合副本搜索的多种遗传多样性度量的快速核心子集选择。
BMC Bioinformatics. 2012 Nov 23;13:312. doi: 10.1186/1471-2105-13-312.
7
Genomics of gene banks: A case study in rice.基因库的基因组学:以水稻为例。
Am J Bot. 2012 Feb;99(2):407-23. doi: 10.3732/ajb.1100385. Epub 2012 Feb 6.
8
Cancer reduces transcriptome specialization.癌症降低转录组的专业化。
PLoS One. 2010 May 3;5(5):e10398. doi: 10.1371/journal.pone.0010398.
9
Of bits and wows: A Bayesian theory of surprise with applications to attention.比特与惊叹:应用于注意力的贝叶斯惊奇理论。
Neural Netw. 2010 Jun;23(5):649-66. doi: 10.1016/j.neunet.2009.12.007. Epub 2009 Dec 28.
10
Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures.核心猎手:一种基于多种遗传指标对遗传资源进行采样的算法。
BMC Bioinformatics. 2009 Aug 6;10:243. doi: 10.1186/1471-2105-10-243.