Herraiz Francisco J, Blanca José, Ziarsolo Pello, Gramazio Pietro, Plazas Mariola, Anderson Gregory J, Prohens Jaime, Vilanova Santiago
Instituto de Conservación y Mejora de la Agrodiversidad Valenciana, Universitat Politècnica de València, Camino de Vera 14, 46022, Valencia, Spain.
Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06268-3043, USA.
BMC Genomics. 2016 May 4;17:321. doi: 10.1186/s12864-016-2656-8.
Solanum sect. Basarthrum is phylogenetically very close to potatoes (Solanum sect. Petota) and tomatoes (Solanum sect. Lycopersicon), two groups with great economic importance, and for which Solanum sect. Basarthrum represents a tertiary gene pool for breeding. This section includes the important regional cultigen, the pepino (Solanum muricatum), and several wild species. Among the wild species, S. caripense is prominent due to its major involvement in the origin of pepino and its wide geographical distribution. Despite the value of the pepino as an emerging crop, and the potential for gene transfer from both the pepino and S. caripense to potatoes and tomatoes, there has been virtually no genomic study of these species.
Using Illumina HiSeq 2000, RNA-Seq was performed with a pool of three tissues (young leaf, flowers in pre-anthesis and mature fruits) from S. muricatum and S. caripense, generating almost 111,000,000 reads among the two species. A high quality de novo transcriptome was assembled from S. muricatum clean reads resulting in 75,832 unigenes with an average length of 704 bp. These unigenes were functionally annotated based on similarity of public databases. We used Blast2GO, to conduct an exhaustive study of the gene ontology, including GO terms, EC numbers and KEGG pathways. Pepino unigenes were compared to both potato and tomato genomes in order to determine their estimated relative position, and to infer gene prediction models. Candidate genes related to traits of interest in other Solanaceae were evaluated by presence or absence and compared with S. caripense transcripts. In addition, by studying five genes, the phylogeny of pepino and five other members of the family, Solanaceae, were studied. The comparison of S. caripense reads against S. muricatum assembled transcripts resulted in thousands of intra- and interspecific nucleotide-level variants. In addition, more than 1000 SSRs were identified in the pepino transcriptome.
This study represents the first genomic resource for the pepino. We suggest that the data will be useful not only for improvement of the pepino, but also for potato and tomato breeding and gene transfer. The high quality of the transcriptome presented here also facilitates comparative studies in the genus Solanum. The accurate transcript annotation will enable us to figure out the gene function of particular traits of interest. The high number of markers (SSR and nucleotide-level variants) obtained will be useful for breeding programs, as well as studies of synteny, diversity evolution, and phylogeny.
茄属Basarthrum组在系统发育上与马铃薯(茄属Petota组)和番茄(茄属Lycopersicon组)非常接近,后两组具有重大经济重要性,而茄属Basarthrum组是用于育种的第三级基因库。该组包括重要的地方栽培种番木瓜(Solanum muricatum)和几个野生种。在这些野生种中,卡里番茄(S. caripense)很突出,因为它在很大程度上参与了番木瓜的起源且地理分布广泛。尽管番木瓜作为一种新兴作物具有价值,并且番木瓜和卡里番茄都有向马铃薯和番茄进行基因转移的潜力,但实际上对这些物种尚未进行基因组研究。
使用Illumina HiSeq 2000对番木瓜和卡里番茄的三个组织(幼叶、花前花和成熟果实)混合样本进行RNA测序,在这两个物种中产生了近1.11亿条 reads。从番木瓜的clean reads中组装出高质量的从头转录组,得到75,832个单基因,平均长度为704 bp。基于公共数据库的相似性对这些单基因进行了功能注释。我们使用Blast2GO对基因本体进行了详尽研究,包括GO术语、EC编号和KEGG途径。将番木瓜单基因与马铃薯和番茄基因组进行比较,以确定它们估计的相对位置,并推断基因预测模型。通过存在或不存在来评估与其他茄科植物感兴趣性状相关的候选基因,并与卡里番茄转录本进行比较。此外,通过研究五个基因,对番木瓜和茄科的其他五个成员进行了系统发育研究。将卡里番茄的reads与番木瓜组装的转录本进行比较,产生了数千个种内和种间核苷酸水平的变异。此外,在番木瓜转录组中鉴定出1000多个SSR。
本研究代表了番木瓜的首个基因组资源。我们认为这些数据不仅对番木瓜的改良有用,而且对马铃薯和番茄育种及基因转移也有用。这里呈现的高质量转录组也有助于茄属的比较研究。准确的转录本注释将使我们能够弄清楚感兴趣的特定性状的基因功能。获得的大量标记(SSR和核苷酸水平的变异)将对育种计划以及共线性、多样性进化和系统发育研究有用。