Univaso Luciano, Peña Francisca, Román-Figueroa Celián, Paneque Manuel
Bionostra Chile Research Foundation, Almirante Lynch, 1179, San Miguel, Santiago, 8920033, Chile.
Department of Environmental Sciences and Natural Resources, Faculty of Agricultural Sciences, Universidad de Chile, Santa Rosa 11315, La Pintana, Santiago, 8820808, Chile.
BMC Genomics. 2025 Jul 17;26(1):672. doi: 10.1186/s12864-025-11831-3.
Maize (Zea mays L.) has a deep cultural significance in Latin America, with traditional and native varieties cultivated for millennia. Approximately 220 maize races have been identified in the region. These races have adapted to the diverse environmental conditions, resulting in a considerable diversity of varieties. Although DNA barcoding is widely used for species identification, distinguishing between varieties within a species remains challenging in plants, owing to the conserved nature of standard barcoding loci. Intragenic marker combinations are often used to address this limitation. However, they remain insufficient for variety-level resolution. Therefore, we developed a pipeline to identify robust markers that can distinguish maize varieties based on their phylogeographic origins.
In this study, the small single-copy (SSC) region of the chloroplast genome exhibited the highest mutation rate per nucleotide. Furthermore, the intergenic regions rps15-rpl32 and ndhD-ndhF exhibited the highest mutation rates in the SSC. Comparatively, the coding genes in this region were more conserved. The rps15-rpl32 locus demonstrated improved resolution for phylogeographic analysis when concatenated with a short genetic anchor sequence. This marker accurately identified the geographic origin of maize samples. Overall, it was the most informative marker despite its relatively low SNP and InDel frequency and moderate divergence levels. Contrastingly, the ndhF-ndhD locus exhibited higher mutation rates but failed to effectively resolve phylogenetic relationships.
Our findings demonstrate that concatenated loci can accurately identify the geographic origin of Zea mays varieties and subspecies and elucidate their relationships. Moreover, the superior performance of rps15-rpl32 in the delineation of phylogenetic relationships among regional genomes shows its potential application as a marker for distinguishing closely related varieties and subspecies. This locus can facilitate the streamlined validation of Zea mays varieties for regional authenticity.
玉米(Zea mays L.)在拉丁美洲具有深厚的文化意义,数千年来一直种植传统和本地品种。该地区已鉴定出约220个玉米种族。这些种族适应了多样的环境条件,导致品种具有相当大的多样性。尽管DNA条形码广泛用于物种鉴定,但由于标准条形码位点的保守性,区分植物物种内的品种仍然具有挑战性。基因内标记组合通常用于解决这一局限性。然而,它们在品种水平分辨率方面仍然不足。因此,我们开发了一种流程来识别能够根据玉米品种的系统地理起源区分它们的稳健标记。
在本研究中,叶绿体基因组的小单拷贝(SSC)区域显示出每核苷酸最高的突变率。此外,基因间区域rps15 - rpl32和ndhD - ndhF在SSC中表现出最高的突变率。相比之下,该区域的编码基因更保守。当与短的遗传锚定序列连接时,rps15 - rpl32位点在系统地理分析中显示出更高的分辨率。该标记准确地鉴定了玉米样本的地理起源。总体而言,尽管其单核苷酸多态性(SNP)和插入缺失(InDel)频率相对较低且分歧水平中等,但它是信息最丰富的标记。相比之下,ndhF - ndhD位点表现出更高的突变率,但未能有效解析系统发育关系。
我们的研究结果表明,串联位点能够准确识别玉米品种和亚种的地理起源并阐明它们之间的关系。此外,rps15 - rpl32在区域基因组系统发育关系划分中的卓越性能表明其作为区分密切相关品种和亚种的标记具有潜在应用价值。该位点有助于简化玉米品种区域真实性的验证。