Savadi Siddanna, Muralidhara B M, Venkataravanappa V, Adiga J D
ICAR- Directorate of Cashew Research (DCR), Puttur, Karnataka, India.
ICAR-Indian Institute of Horticultural Research (IIHR), CHES, Madikeri, Karnataka, India.
Front Plant Sci. 2023 Aug 21;14:1242025. doi: 10.3389/fpls.2023.1242025. eCollection 2023.
The cashew is an edible tree nut crop having a wide range of food and industrial applications. Despite great economic importance, the genome-wide characterization of microsatellites [simple sequence repeats (SSRs)] in cashew is lacking. In this study, we carried out the first comprehensive genome-wide microsatellites/SSRs characterization in cashew and developed polymorphic markers and a web-based microsatellite database. A total of 54526 SSRs were discovered in the cashew genome, with a mean frequency of 153 SSRs/Mb. Among the mined genome-wide SSRs (2-6 bp size motifs), the dinucleotide repeat motifs were dominant (68.98%) followed by the trinucleotides (24.56%). The Class I type of SSRs (≥20 bp) were 45.10%, while Class II repeat motifs (≥12-<20 bp) were 54.89% of the total genomic SSRs discovered here. Further, the AT-rich SSRs occurred more frequently in the cashew genome (84%) compared to the GC-rich SSRs. The validation of the -mined genome-wide SSRs by PCR screening in cashew genotypes resulted in the development of 59 polymorphic SSR markers, and the polymorphism information content (PIC) of the polymorphic SSR markers ranged from 0.19 to 0.84. Further, a web-based database, "Cashew Microsatellite Database (CMDB)," was constructed to provide access to the genome-wide SSRs mined in this study as well as transcriptome-based SSRs from our previous study to the research community through a user-friendly searchable interface. Besides, CMDB provides information on experimentally validated SSRs. CMDB permits the retrieval of SSR markers information with the customized search options. Altogether, the genome-wide SSRs characterization, the polymorphic markers and CMDB database developed in this study would serve as valuable marker resources for DNA fingerprinting, germplasm characterization, genetic studies, and molecular breeding in cashew and related species.
腰果是一种可食用的坚果作物,具有广泛的食品和工业应用。尽管腰果具有重要的经济价值,但目前缺乏对其微卫星(简单序列重复,SSRs)的全基因组特征分析。在本研究中,我们首次对腰果进行了全面的全基因组微卫星/SSRs特征分析,并开发了多态性标记和一个基于网络的微卫星数据库。在腰果基因组中共发现了54526个SSRs,平均频率为153个SSRs/Mb。在挖掘出的全基因组SSRs(2 - 6 bp大小基序)中,二核苷酸重复基序占主导地位(68.98%),其次是三核苷酸(24.56%)。I类SSRs(≥20 bp)占45.10%,而II类重复基序(≥12 - <20 bp)占本研究发现的总基因组SSRs的54.89%。此外,与富含GC的SSRs相比,富含AT的SSRs在腰果基因组中出现的频率更高(84%)。通过对腰果基因型进行PCR筛选验证挖掘出的全基因组SSRs,开发出了59个多态性SSR标记,这些多态性SSR标记的多态性信息含量(PIC)范围为0.19至0.84。此外,构建了一个基于网络的数据库“腰果微卫星数据库(CMDB)”,通过用户友好的可搜索界面,为研究群体提供本研究挖掘出的全基因组SSRs以及我们之前研究中基于转录组的SSRs的访问权限。此外,CMDB提供经过实验验证的SSRs的信息。CMDB允许通过定制搜索选项检索SSR标记信息。总之,本研究中进行的全基因组SSRs特征分析、开发的多态性标记和CMDB数据库将作为有价值的标记资源,用于腰果及相关物种的DNA指纹识别、种质特征分析、遗传研究和分子育种。