Zhejiang Provincial Key Laboratory of Crop Genetic Resources, Institute of Crop Science, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, Zhejiang 310058, China.
Tea Research Institute, Chinese Academy of Agricultural Science, Hangzhou 310008, China.
Database (Oxford). 2022 Sep 12;2022. doi: 10.1093/database/baac080.
The rapid advancement of sequencing technology, including next-generation sequencing (NGS), has greatly improved sequencing efficiency and decreased cost. Consequently, huge amounts of genomic, transcriptomic and epigenetic data concerning cotton species have been generated and released. These large-scale data provide immense opportunities for the study of cotton genomic structure and evolution, population genetic diversity and genome-wide mining of excellent genes for important traits. However, the complexity of NGS data also causes distress, as it cannot be utilized easily. Here, we presented the cotton omics data platform COTTONOMICS (http://cotton.zju.edu.cn/), an easily accessible web database that integrates 32.5 TB of omics data including seven assembled genomes, resequencing data from 1180 allotetraploid cotton accessions and RNA-sequencing (RNA-seq), small RNA-sequencing (smRNA-seq), Chromatin Immunoprecipitation sequencing (ChIP-seq), DNase hypersensitive sites sequencing (DNase-seq) and Bisulfite sequencing (BS-seq). COTTONOMICS allows users to employ various search scenarios and retrieve information concerning the cotton genomes, genomic variation (Single nucleotide polymorphisms (SNPs) and Insertion and Deletion (InDels)), gene expression, smRNA expression, epigenetic regulation and quantitative trait locus (QTLs). The user-friendly web interface offers a variety of modules for storing, retrieving, analyzing and visualizing cotton multi-omics data to diverse ends, thereby enabling users to decipher cotton population genetics and identify potential novel genes that influence agronomically beneficial traits. Database URL: http://cotton.zju.edu.cn.
测序技术的快速发展,包括下一代测序(NGS),极大地提高了测序效率并降低了成本。因此,产生并发布了大量关于棉花物种的基因组、转录组和表观遗传数据。这些大规模数据为研究棉花基因组结构和进化、群体遗传多样性以及全基因组挖掘重要性状的优异基因提供了巨大的机会。然而,NGS 数据的复杂性也带来了困扰,因为它不容易被利用。在这里,我们介绍了棉花组学数据平台 COTTONOMICS(http://cotton.zju.edu.cn/),这是一个易于访问的 Web 数据库,集成了 32.5TB 的组学数据,包括七个组装的基因组、1180 个异源四倍体棉花品系的重测序数据以及 RNA-seq、small RNA-seq、ChIP-seq、DNase 高敏位点测序(DNase-seq)和 Bisulfite 测序(BS-seq)。COTTONOMICS 允许用户使用各种搜索场景检索有关棉花基因组、基因组变异(单核苷酸多态性(SNPs)和插入缺失(InDels))、基因表达、small RNA 表达、表观遗传调控和数量性状位点(QTLs)的信息。用户友好的 Web 界面提供了多种模块,用于存储、检索、分析和可视化棉花多组学数据,以满足各种需求,从而使用户能够破译棉花群体遗传学并识别可能影响农艺有益性状的潜在新基因。数据库网址:http://cotton.zju.edu.cn。