Getahun Addisu, Nida Habte, Woldesemayat Adugna Abdi
College of Natural and Applied Sciences (CNAS), Department of Biotechnology, Addis Ababa Science and Technology University (AASTU), 16417, Addis Ababa, Ethiopia.
Biotechnology and Bioprocess Center of Excellence, AASTU, 16417, Addis Ababa, Ethiopia.
BMC Genom Data. 2025 Sep 26;26(1):67. doi: 10.1186/s12863-025-01350-1.
Sorghum, a diploid C4 cereal (2n = 2x = 20) with a 750 Mbp genome, is widely adaptable to tropical and temperate climates. As its center of origin and diversity, Ethiopia holds valuable genetic variation for improving yield and nutritional traits. This study aimed to identify and functionally characterize quantitative trait nucleotides (QTNs) linked to key agronomic and yield-related traits and their associated candidate genes.
Two hundred sixteen sorghum genotypes were evaluated over two seasons in northwestern Ethiopia using an alpha lattice design. Agronomic traits assessed included days to flowering, days to maturity, plant height, seed number per plant, seed yield, and thousand-seed weight. Genotyping-by-sequencing (GBS) generated 351,692 SNPs, with 50,165 high-quality markers retained. Candidate gene identification and functional characterization were carried out using a combination of bioinformatics tools and publicly available databases. Data normalization and analysis were conducted using META-R and SAS JMP. Linkage disequilibrium was assessed via TASSEL 5.0, and multi-locus genome-wide association study (ML-GWAS) identified significant QTNs (LOD ≥ 4.0) associated with phenotypic traits.
This study investigates the genetic basis of key agronomic and yield related traits in sorghum by identifying QTNs associated with phenotypic variation. Descriptive statistics revealed notable variability in traits such as days to flowering (101 days), days to maturity (145.77 days), plant height (357.47 cm), seed number per plant (1808.92 count), seed yield (45.07 g), and thousand-seed weight (23.44 g). Correlation analysis showed strong relationships, particularly between days to flowering and maturity (r = 0.7058). ML-GWAS detected 176 QTNs across all 10 chromosomes, with 34 considered reliable Due to their consistent identification across multiple models. 117 candidate genes were mapped to these QTNs, associated with six major traits: 20 for flowering time, 16 for maturity, 16 for plant height, 17 for seed number per plant, 38 for seed yield, and 10 for seed weight. Key genes included Sobic.001G196700 (flowering time) and Sobic.005G176100 (stress responses). Two important regulatory genes, SbMADS1 and SbFT, were highlighted for their roles in flowering regulation. SbMADS1 influences days to flowering, while SbFT acts as a mobile signal integrating photoperiod cues. These genes are involved in starch and sucrose metabolism pathways, essential for energy storage and mobilization, thereby supporting improved growth and yield in sorghum.
This study highlights the complexity of trait inheritance shaped by diverse genetic factors and underscores the significance of major, stable, and unique QTNs for marker-assisted selection. Functional genome annotation revealed that candidate genes are involved in key biological processes and metabolic pathways, including starch and sucrose metabolism, secondary metabolism, and hormonal signaling.
高粱是一种二倍体C4谷物(2n = 2x = 20),基因组大小为750 Mbp,广泛适应热带和温带气候。作为高粱的起源和多样性中心,埃塞俄比亚拥有改良产量和营养性状的宝贵遗传变异。本研究旨在鉴定与关键农艺和产量相关性状连锁的数量性状核苷酸(QTNs)及其相关候选基因,并对其进行功能表征。
在埃塞俄比亚西北部,采用α格子设计,在两个季节对216份高粱基因型进行了评估。评估的农艺性状包括开花天数、成熟天数、株高、单株种子数、种子产量和千粒重。通过简化基因组测序(GBS)产生了351,692个单核苷酸多态性(SNPs),保留了50,165个高质量标记。使用生物信息学工具和公开可用数据库相结合的方法进行候选基因鉴定和功能表征。使用META-R和SAS JMP进行数据标准化和分析。通过TASSEL 5.0评估连锁不平衡,并通过多位点全基因组关联研究(ML-GWAS)鉴定与表型性状相关的显著QTNs(LOD≥4.0)。
本研究通过鉴定与表型变异相关的QTNs,研究了高粱关键农艺和产量相关性状的遗传基础。描述性统计显示,开花天数(101天)、成熟天数(145.77天)、株高(357.47厘米)、单株种子数(1808.92粒)、种子产量(45.07克)和千粒重(23.44克)等性状存在显著变异性。相关性分析表明,各性状之间存在很强的相关性,尤其是开花天数和成熟天数之间(r = 0.7058)。ML-GWAS在所有10条染色体上检测到176个QTNs,其中34个由于在多个模型中被一致鉴定而被认为是可靠的。117个候选基因被定位到这些QTNs上,与六个主要性状相关:20个与开花时间相关,16个与成熟相关,16个与株高相关,17个与单株种子数相关,38个与种子产量相关,10个与种子重量相关。关键基因包括Sobic.001G196700(开花时间)和Sobic.005G176100(应激反应)。两个重要的调控基因SbMADS1和SbFT因其在开花调控中的作用而受到关注。SbMADS1影响开花天数,而SbFT作为整合光周期信号的移动信号。这些基因参与淀粉和蔗糖代谢途径,这对能量储存和调动至关重要,从而有助于提高高粱的生长和产量。
本研究突出了由多种遗传因素塑造的性状遗传的复杂性,并强调了主要、稳定和独特的QTNs对标记辅助选择的重要性。功能基因组注释表明,候选基因参与关键的生物学过程和代谢途径,包括淀粉和蔗糖代谢、次生代谢和激素信号传导。