Department of Molecular Biology and Genetics, Necmettin Erbakan University, Meram, Konya, 42090, Turkey.
Department of Biotechnology, Necmettin Erbakan University, Meram, Konya, 42090, Turkey.
Sci Rep. 2021 Feb 15;11(1):3818. doi: 10.1038/s41598-021-83113-3.
Quince (Cydonia oblonga Mill.) is the sole member of the genus Cydonia in the Rosacea family and closely related to the major pome fruits, apple (Malus domestica Borkh.) and pear (Pyrus communis L.). In the present work, whole genome shotgun paired-end sequencing was employed in order to assemble the first draft genome of quince. A genome assembly that spans 488.4 Mb of sequence corresponding to 71.2% of the estimated genome size (686 Mb) was produced in the study. Gene predictions via ab initio and homology-based sequence annotation strategies resulted in the identification of 25,428 and 30,684 unique putative protein coding genes, respectively. 97.4 and 95.6% of putative homologs of Arabidopsis and rice transcription factors were identified in the ab initio predicted genic sequences. Different machine learning algorithms were tested for classifying pre-miRNA (precursor microRNA) coding sequences, identifying Support Vector Machine (SVM) as the best performing classifier. SVM classification predicted 600 putative pre-miRNA coding loci. Repetitive DNA content of the assembly was also characterized. The first draft assembly of the quince genome produced in this work would constitute a foundation for functional genomic research in quince toward dissecting the genetic basis of important traits and performing genomics-assisted breeding.
榅桲(Cydonia oblonga Mill.)是蔷薇科榅桲属的唯一成员,与主要的仁果类水果苹果(Malus domestica Borkh.)和梨(Pyrus communis L.)密切相关。本研究采用全基因组鸟枪法测序,以组装榅桲的首个基因组草图。该研究产生的基因组组装序列跨越了 488.4 Mb,相当于估计基因组大小(686 Mb)的 71.2%。通过从头预测和基于同源序列的注释策略,分别鉴定出了 25428 个和 30684 个独特的假定蛋白编码基因。在从头预测的基因序列中,鉴定出了拟南芥和水稻转录因子的同源物分别为 97.4%和 95.6%。针对分类前体 microRNA(precursor microRNA,pre-miRNA)编码序列,测试了不同的机器学习算法,发现支持向量机(Support Vector Machine,SVM)是表现最好的分类器。SVM 分类预测了 600 个假定的 pre-miRNA 编码基因座。组装重复 DNA 含量也得到了表征。本研究中生成的榅桲基因组首个草图将为榅桲功能基因组研究奠定基础,有助于解析重要性状的遗传基础,并进行基于基因组的育种。