State Key Laboratory of Crop Genetics and Germplasm Enhancement, Cotton Germplasm Enhancement and Application Engineering Research Center (Ministry of Education), Nanjing Agricultural University, Nanjing, 210095, China.
Institute of Agricultural Sciences in Coastal Area of Jiangsu Province, Yancheng, 224002, China.
BMC Plant Biol. 2021 May 31;21(1):250. doi: 10.1186/s12870-021-03009-2.
Numerous quantitative trait loci (QTLs) and candidate genes associated with yield-related traits have been identified in cotton by genome-wide association study (GWAS) analysis. However, most of the phenotypic data were from a single or few environments, and the stable loci remained to be validated under multiple field environments.
Here, 242 upland cotton accessions collected from different origins were continuously investigated for phenotypic data of four main yield components, including boll weight (BW) and lint percentage (LP) under 13 field environments, and boll number per plant (BN) and seed index (SI) under 11 environments. Correlation analysis revealed a positive correlation between BN and LP, BW and SI, while SI had a negative correlation with LP and BN. Genetic analysis indicated that LP had the highest heritability estimates of 94.97%, followed by 92.08% for SI, 86.09% for BW, and 72.92% for BN, indicating LP and SI were more suitable traits for genetic improvement. Based on 56,010 high-quality single nucleotide polymorphisms (SNPs) and GWAS analysis, a total of 95 non-redundant QTLs were identified, including 12 of BN, 23 of BW, 45 of LP, and 33 of SI, respectively. Of them, 10 pairs of homologous QTLs were detected between A and D sub-genomes. We also found that 15 co-located QTLs with more than two traits and 12 high-confidence QTLs were detected under more than six environments, respectively. Further, two NET genes (GH_A08G0716 and GH_A08G0783), located in a novel QTL hotspot (qtl24, qtl25 and qlt26) were predominately expressed in early fiber development stages, exhibited significant correlation with LP and SI. The GH_A07G1389 in the stable qtl19 region encoded a tetratricopeptide repeat (TPR)-like superfamily protein and was a homologous gene involved in short fiber mutant ligon lintless-y (Li), implying important roles in cotton yield.
The present study provides a foundation for understanding the regulatory mechanisms of yield components and may enhance yield improvement through molecular breeding in cotton.
通过全基因组关联研究(GWAS)分析,已经在棉花中鉴定出许多与产量相关性状相关的数量性状位点(QTL)和候选基因。然而,大多数表型数据来自于单个或少数环境,并且稳定的基因座仍需要在多个田间环境中进行验证。
在这里,从不同来源收集了 242 个陆地棉品系,连续在 13 个田间环境中对四个主要产量组成部分的表型数据进行了调查,包括铃重(BW)和皮棉率(LP),以及在 11 个环境下的铃数/株(BN)和种子指数(SI)。相关性分析表明,BN 与 LP、BW 与 SI 之间呈正相关,而 SI 与 LP 和 BN 之间呈负相关。遗传分析表明,LP 的遗传力估计值最高,为 94.97%,其次是 SI(92.08%)、BW(86.09%)和 BN(72.92%),表明 LP 和 SI 更适合遗传改良。基于 56010 个高质量的单核苷酸多态性(SNP)和 GWAS 分析,共鉴定到 95 个非冗余 QTL,分别为 BN 的 12 个、BW 的 23 个、LP 的 45 个和 SI 的 33 个,其中 A 和 D 亚基因组之间检测到 10 对同源 QTL。我们还发现,在两个以上环境下检测到 15 个共定位 QTL 和 12 个高置信度 QTL。此外,两个 NET 基因(GH_A08G0716 和 GH_A08G0783)位于一个新的 QTL 热点(qtl24、qtl25 和 qlt26)中,在早期纤维发育阶段主要表达,与 LP 和 SI 呈显著相关性。位于稳定 qtl19 区域的 GH_A07G1389 编码一个四肽重复(TPR)样超家族蛋白,是一个涉及短纤维突变体 ligon lintless-y(Li)的同源基因,表明其在棉花产量中具有重要作用。
本研究为了解产量组成部分的调控机制提供了基础,并可能通过棉花的分子育种提高产量。