• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过层次聚类实现自动四倍体基因型分型

Automated tetraploid genotype calling by hierarchical clustering.

作者信息

Schmitz Carley Cari A, Coombs Joseph J, Douches David S, Bethke Paul C, Palta Jiwan P, Novy Richard G, Endelman Jeffrey B

机构信息

Department of Horticulture, University of Wisconsin, Madison, WI, 53706, USA.

Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, 48824, USA.

出版信息

Theor Appl Genet. 2017 Apr;130(4):717-726. doi: 10.1007/s00122-016-2845-5. Epub 2017 Jan 9.

DOI:10.1007/s00122-016-2845-5
PMID:28070610
Abstract

New software to make tetraploid genotype calls from SNP array data was developed, which uses hierarchical clustering and multiple F1 populations to calibrate the relationship between signal intensity and allele dosage. SNP arrays are transforming breeding and genetics research for autotetraploids. To fully utilize these arrays, the relationship between signal intensity and allele dosage must be calibrated for each marker. We developed an improved computational method to automate this process, which is provided as the R package ClusterCall. In the training phase of the algorithm, hierarchical clustering within an F1 population is used to group samples with similar intensity values, and allele dosages are assigned to clusters based on expected segregation ratios. In the prediction phase, multiple F1 populations and the prediction set are clustered together, and the genotype for each cluster is the mode of the training set samples. A concordance metric, defined as the proportion of training set samples equal to the mode, can be used to eliminate unreliable markers and compare different algorithms. Across three potato families genotyped with an 8K SNP array, ClusterCall scored 5729 markers with at least 0.95 concordance (94.6% of its total), compared to 5325 with the software fitTetra (82.5% of its total). The three families were used to predict genotypes for 5218 SNPs in the SolCAP diversity panel, compared with 3521 SNPs in a previous study in which genotypes were called manually. One of the additional markers produced a significant association for vine maturity near a well-known causal locus on chromosome 5. In conclusion, when multiple F1 populations are available, ClusterCall is an efficient method for accurate, autotetraploid genotype calling that enables the use of SNP data for research and plant breeding.

摘要

开发了一种从SNP阵列数据中进行四倍体基因型分型的新软件,该软件使用层次聚类和多个F1群体来校准信号强度与等位基因剂量之间的关系。SNP阵列正在改变同源四倍体的育种和遗传学研究。为了充分利用这些阵列,必须针对每个标记校准信号强度与等位基因剂量之间的关系。我们开发了一种改进的计算方法来自动化这一过程,该方法以R包ClusterCall的形式提供。在算法的训练阶段,F1群体内的层次聚类用于对具有相似强度值的样本进行分组,并根据预期的分离比例为聚类分配等位基因剂量。在预测阶段,将多个F1群体和预测集聚类在一起,每个聚类的基因型是训练集样本的众数。一个一致性指标,定义为等于众数的训练集样本的比例,可用于消除不可靠的标记并比较不同的算法。在使用8K SNP阵列进行基因分型的三个马铃薯家族中,ClusterCall对5729个标记的评分一致性至少为0.95(占其总数的94.6%),相比之下,软件fitTetra对5325个标记的评分一致性为0.95(占其总数的82.5%)。这三个家族用于预测SolCAP多样性面板中5218个SNP的基因型,相比之下,在之前一项手动进行基因型分型的研究中有3521个SNP。其中一个额外的标记在5号染色体上一个著名的因果位点附近与葡萄成熟度产生了显著关联。总之,当有多个F1群体可用时,ClusterCall是一种准确进行同源四倍体基因型分型的有效方法,能够将SNP数据用于研究和植物育种。

相似文献

1
Automated tetraploid genotype calling by hierarchical clustering.通过层次聚类实现自动四倍体基因型分型
Theor Appl Genet. 2017 Apr;130(4):717-726. doi: 10.1007/s00122-016-2845-5. Epub 2017 Jan 9.
2
FitTetra 2.0 - improved genotype calling for tetraploids with multiple population and parental data support.FitTetra 2.0 - 提高了四倍体的基因型调用能力,支持多群体和双亲数据。
BMC Bioinformatics. 2019 Mar 20;20(1):148. doi: 10.1186/s12859-019-2703-y.
3
Genotype calling in tetraploid species from bi-allelic marker data using mixture models.利用混合模型对来自双等位基因标记数据的四倍体物种进行基因型调用。
BMC Bioinformatics. 2011 May 19;12:172. doi: 10.1186/1471-2105-12-172.
4
Linkage analysis and QTL mapping using SNP dosage data in a tetraploid potato mapping population.利用四倍体马铃薯作图群体中的 SNP 剂量数据进行连锁分析和 QTL 作图。
PLoS One. 2013 May 21;8(5):e63939. doi: 10.1371/journal.pone.0063939. Print 2013.
5
Development and analysis of a 20K SNP array for potato (Solanum tuberosum): an insight into the breeding history.用于马铃薯(Solanum tuberosum)的20K单核苷酸多态性阵列的开发与分析:对育种历史的洞察
Theor Appl Genet. 2015 Dec;128(12):2387-401. doi: 10.1007/s00122-015-2593-y. Epub 2015 Aug 12.
6
Integrating haplotype-specific linkage maps in tetraploid species using SNP markers.利用单核苷酸多态性(SNP)标记整合四倍体物种中特定单倍型的连锁图谱。
Theor Appl Genet. 2016 Nov;129(11):2211-2226. doi: 10.1007/s00122-016-2768-1. Epub 2016 Aug 25.
7
Physical mapping of QTL for tuber yield, starch content and starch yield in tetraploid potato (Solanum tuberosum L.) by means of genome wide genotyping by sequencing and the 8.3 K SolCAP SNP array.利用全基因组测序基因分型和8.3K SolCAP SNP芯片对四倍体马铃薯(Solanum tuberosum L.)块茎产量、淀粉含量和淀粉产量的QTL进行物理定位。
BMC Genomics. 2017 Aug 22;18(1):642. doi: 10.1186/s12864-017-3979-9.
8
QTL mapping in autotetraploids using SNP dosage information.利用 SNP 剂量信息进行同源四倍体的 QTL 作图。
Theor Appl Genet. 2014 Sep;127(9):1885-904. doi: 10.1007/s00122-014-2347-2. Epub 2014 Jul 1.
9
Improved genetic resolution for linkage mapping of resistance to potato wart in monoparental dihaploids with potential diagnostic value in tetraploid potato varieties.提高马铃薯疣抗性连锁作图的遗传分辨率,在四倍体马铃薯品种中具有潜在的诊断价值的单亲二倍体。
Theor Appl Genet. 2018 Dec;131(12):2555-2566. doi: 10.1007/s00122-018-3172-9. Epub 2018 Aug 29.
10
Genetic Linkage Mapping of Economically Important Traits in Cultivated Tetraploid Potato (Solanum tuberosum L.).栽培四倍体马铃薯(Solanum tuberosum L.)经济重要性状的遗传连锁图谱构建
G3 (Bethesda). 2015 Sep 14;5(11):2357-64. doi: 10.1534/g3.115.019646.

引用本文的文献

1
Genomic prediction for potato (Solanum tuberosum) quality traits improved through image analysis.通过图像分析改进马铃薯(Solanum tuberosum)品质性状的基因组预测。
Plant Genome. 2024 Dec;17(4):e20507. doi: 10.1002/tpg2.20507. Epub 2024 Sep 10.
2
Genotyping-by-sequencing targets genic regions and improves resolution of genome-wide association studies in autotetraploid potato.重测序基因分型技术靶向基因区域,提高了自交四倍体马铃薯全基因组关联研究的分辨率。
Theor Appl Genet. 2024 Jul 9;137(8):180. doi: 10.1007/s00122-024-04651-8.
3
A genomic dataset integrating genotyping-by-sequencing, SolCAP array and PCR marker data on tetraploid potato advanced breeding lines.

本文引用的文献

1
Software for Genome-Wide Association Studies in Autopolyploids and Its Application to Potato.软件用于自交多倍体的全基因组关联研究及其在马铃薯中的应用。
Plant Genome. 2016 Jul;9(2). doi: 10.3835/plantgenome2015.08.0073.
2
Probabilistic Multilocus Haplotype Reconstruction in Outcrossing Tetraploids.异交四倍体中概率多位点单倍型重建
Genetics. 2016 May;203(1):119-31. doi: 10.1534/genetics.115.185579. Epub 2016 Feb 26.
3
Genetic mapping with an inbred line-derived F2 population in potato.利用自交系衍生的 F2 群体进行马铃薯的遗传作图。
一个整合了四倍体马铃薯高级育种系的测序基因分型、SolCAP阵列和PCR标记数据的基因组数据集。
Front Plant Sci. 2024 May 17;15:1384401. doi: 10.3389/fpls.2024.1384401. eCollection 2024.
4
QTL identified that influence tuber length-width ratio, degree of flatness, tuber size, and specific gravity in a russet-skinned, tetraploid mapping population.在一个赤褐色表皮的四倍体作图群体中,鉴定出了影响块茎长宽比、扁平程度、块茎大小和比重的数量性状基因座。
Front Plant Sci. 2024 Mar 22;15:1343632. doi: 10.3389/fpls.2024.1343632. eCollection 2024.
5
Identification of QTL associated with plant vine characteristics and infection response to late blight, early blight, and Verticillium wilt in a tetraploid potato population derived from late blight-resistant Palisade Russet.在源自晚疫病抗性品种Palisade Russet的四倍体马铃薯群体中,鉴定与植株藤蔓特征以及对晚疫病、早疫病和黄萎病的感染反应相关的数量性状位点。
Front Plant Sci. 2023 Oct 11;14:1222596. doi: 10.3389/fpls.2023.1222596. eCollection 2023.
6
Implementation of different relationship estimate methodologies in breeding value prediction in kiwiberry ().不同关系估计方法在猕猴桃育种值预测中的应用()。 (原文括号部分内容缺失,无法准确完整翻译)
Mol Breed. 2023 Oct 18;43(10):75. doi: 10.1007/s11032-023-01419-8. eCollection 2023 Oct.
7
Defining a diverse core collection of the Colombian Central Collection of potatoes: a tool to advance research and breeding.界定哥伦比亚马铃薯中央种质库的多样化核心种质库:推动研究与育种的工具。
Front Plant Sci. 2023 Apr 26;14:1046400. doi: 10.3389/fpls.2023.1046400. eCollection 2023.
8
Genetic Analysis of Potato Breeding Collection Using Single-Nucleotide Polymorphism (SNP) Markers.利用单核苷酸多态性(SNP)标记对马铃薯育种群体进行遗传分析
Plants (Basel). 2023 May 6;12(9):1895. doi: 10.3390/plants12091895.
9
Analysis of genetic diversity and population structure among cultivated potato clones from Korea and global breeding programs.分析韩国和全球育种计划中栽培马铃薯无性系的遗传多样性和群体结构。
Sci Rep. 2022 Jun 21;12(1):10462. doi: 10.1038/s41598-022-12874-2.
10
Linkage and QTL mapping for tuber shape and specific gravity in a tetraploid mapping population of potato representing the russet market class.四倍体马铃薯作图群体中块茎形状和比重的连锁和 QTL 作图,该群体代表了红皮市场类别。
BMC Plant Biol. 2021 Nov 3;21(1):507. doi: 10.1186/s12870-021-03265-2.
Theor Appl Genet. 2016 May;129(5):935-43. doi: 10.1007/s00122-016-2673-7. Epub 2016 Feb 5.
4
The Double-Reduction Landscape in Tetraploid Potato as Revealed by a High-Density Linkage Map.高密度连锁图谱揭示的四倍体马铃薯中的双减景观
Genetics. 2015 Nov;201(3):853-63. doi: 10.1534/genetics.115.181008. Epub 2015 Sep 15.
5
Development and analysis of a 20K SNP array for potato (Solanum tuberosum): an insight into the breeding history.用于马铃薯(Solanum tuberosum)的20K单核苷酸多态性阵列的开发与分析:对育种历史的洞察
Theor Appl Genet. 2015 Dec;128(12):2387-401. doi: 10.1007/s00122-015-2593-y. Epub 2015 Aug 12.
6
Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).利用RNA测序技术组装一个拥有超过13000个全长表达基因的玫瑰转录组,并开发用于玫瑰(蔷薇属)的WagRhSNP 68k公理单核苷酸多态性阵列。
Front Plant Sci. 2015 Apr 21;6:249. doi: 10.3389/fpls.2015.00249. eCollection 2015.
7
Tetrasomic recombination is surprisingly frequent in allotetraploid Arachis.在异源四倍体花生中,四体重组出奇地频繁。
Genetics. 2015 Apr;199(4):1093-105. doi: 10.1534/genetics.115.174607. Epub 2015 Feb 20.
8
A saturated genetic linkage map of autotetraploid alfalfa (Medicago sativa L.) developed using genotyping-by-sequencing is highly syntenous with the Medicago truncatula genome.利用简化基因组测序技术构建的同源四倍体紫花苜蓿(Medicago sativa L.)饱和遗传连锁图谱与蒺藜苜蓿基因组高度同源。
G3 (Bethesda). 2014 Aug 21;4(10):1971-9. doi: 10.1534/g3.114.012245.
9
Doubling down on genomes: polyploidy and crop plants.聚焦基因组加倍:多倍体与农作物
Am J Bot. 2014 Oct;101(10):1711-25. doi: 10.3732/ajb.1400119. Epub 2014 Aug 3.
10
Development of an alfalfa SNP array and its use to evaluate patterns of population structure and linkage disequilibrium.苜蓿单核苷酸多态性(SNP)芯片的开发及其在评估群体结构和连锁不平衡模式中的应用。
PLoS One. 2014 Jan 9;9(1):e84329. doi: 10.1371/journal.pone.0084329. eCollection 2014.