International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines.
Institut de recherche pour le développement (IRD), University of Montpellier, DIADE, IPME, Montpellier, France.
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz028.
Rice molecular genetics, breeding, genetic diversity, and allied research (such as rice-pathogen interaction) have adopted sequencing technologies and high-density genotyping platforms for genome variation analysis and gene discovery. Germplasm collections representing rice diversity, improved varieties, and elite breeding materials are accessible through rice gene banks for use in research and breeding, with many having genome sequences and high-density genotype data available. Combining phenotypic and genotypic information on these accessions enables genome-wide association analysis, which is driving quantitative trait loci discovery and molecular marker development. Comparative sequence analyses across quantitative trait loci regions facilitate the discovery of novel alleles. Analyses involving DNA sequences and large genotyping matrices for thousands of samples, however, pose a challenge to non-computer savvy rice researchers.
The Rice Galaxy resource has shared datasets that include high-density genotypes from the 3,000 Rice Genomes project and sequences with corresponding annotations from 9 published rice genomes. The Rice Galaxy web server and deployment installer includes tools for designing single-nucleotide polymorphism assays, analyzing genome-wide association studies, population diversity, rice-bacterial pathogen diagnostics, and a suite of published genomic prediction methods. A prototype Rice Galaxy compliant to Open Access, Open Data, and Findable, Accessible, Interoperable, and Reproducible principles is also presented.
Rice Galaxy is a freely available resource that empowers the plant research community to perform state-of-the-art analyses and utilize publicly available big datasets for both fundamental and applied science.
水稻分子遗传学、育种、遗传多样性和相关研究(如水稻-病原体相互作用)已经采用测序技术和高密度基因分型平台进行基因组变异分析和基因发现。通过水稻基因库,可以获得代表水稻多样性、改良品种和优秀育种材料的种质资源,用于研究和育种,其中许多都有基因组序列和高密度基因型数据。结合这些资源的表型和基因型信息,可以进行全基因组关联分析,从而推动数量性状位点的发现和分子标记的开发。在数量性状位点区域进行比较序列分析有助于发现新的等位基因。然而,涉及数千个样本的 DNA 序列和大型基因分型矩阵的分析对不熟悉计算机的水稻研究人员来说是一个挑战。
Rice Galaxy 资源共享了数据集,包括来自 3000 个水稻基因组项目的高密度基因型和来自 9 个已发表的水稻基因组的序列及其相应注释。Rice Galaxy 网络服务器和部署安装程序包括用于设计单核苷酸多态性检测、全基因组关联研究分析、群体多样性、水稻-细菌病原体诊断以及一系列已发表的基因组预测方法的工具。还提出了一个符合开放获取、开放数据、可查找、可访问、互操作和可重复原则的 Rice Galaxy 原型。
Rice Galaxy 是一个免费的资源,使植物研究社区能够进行最先进的分析,并利用公共可用的大型数据集进行基础和应用科学研究。