Department of Food Science and Engineering, Moutai Institute, Luban Street, Renhuai, 564502, Guizhou, People's Republic of China.
BMC Plant Biol. 2022 Nov 3;22(1):513. doi: 10.1186/s12870-022-03901-5.
Genome variation not only plays an important role in plant phenotypic modeling and adaptive evolution, but also enhances population genetic diversity and regulates gene expression. The tea tree (Camellia sinensis) has a large genome (~ 3.0 Gb), making the identification of genome-wide variants time-consuming and expensive. With the continuous publication of a large number of different types of population sequencing data, there is a lack of an open platform to integrate these data and identify variants in the tea plant genome.To integrate the genetic variation confidence in the tea plant population genome, 238 whole-genome resequencing, 213 transcriptome sequencing, and 96 hybrid F1 individuals with a total of more than 20 Tb were collected for mutation site identification. Based on these variations information, we constructed the first tea tree variation web service database TeaPVs ( http://47.106.184.91:8025/ and http://liushang.top:8025/ ). It supports users to search all SNP, Indel, SV mutations and SSR/Polymorphic SSR sequences by location or gene ID. Furthermore, the website also provides the functions of gene expression search of different transcriptome, sequence blast, sequence extraction of CDS and mutation loci, etc.The features of the TeaPVs database make it a comprehensive tea plant genetic variation bioinformatics platform for researchers, and will also be helpful for revealing new functional mutations in the tea plant genome and molecular marker-assisted breeding.
基因组变异不仅在植物表型建模和适应性进化中起着重要作用,而且还增强了群体遗传多样性并调节了基因表达。茶树(Camellia sinensis)具有较大的基因组(约 3.0 Gb),使得全基因组变异的鉴定既耗时又昂贵。随着大量不同类型的群体测序数据的不断发表,缺乏一个开放的平台来整合这些数据并鉴定茶树基因组中的变异。为了整合茶树群体基因组中的遗传变异置信度,我们收集了 238 个全基因组重测序、213 个转录组测序和 96 个杂交 F1 个体,总数据量超过 20 Tb,用于突变位点鉴定。基于这些变异信息,我们构建了第一个茶树变异网络服务数据库 TeaPVs(http://47.106.184.91:8025/ 和 http://liushang.top:8025/)。它支持用户按位置或基因 ID 搜索所有 SNP、Indel、SV 突变和 SSR/多态性 SSR 序列。此外,该网站还提供了不同转录组的基因表达搜索、序列比对、CDS 序列提取和突变位点等功能。TeaPVs 数据库的特点使其成为一个综合性的茶树遗传变异生物信息学平台,可供研究人员使用,也有助于揭示茶树基因组中的新功能突变和分子标记辅助育种。