Close Timothy J, Wanamaker Steve I, Caldo Rico A, Turner Stacy M, Ashlock Daniel A, Dickerson Julie A, Wing Rod A, Muehlbauer Gary J, Kleinhofs Andris, Wise Roger P
Department of Botany and Plant Sciences, University of California, Riverside, California 92521, USA.
Plant Physiol. 2004 Mar;134(3):960-8. doi: 10.1104/pp.103.034462.
In recent years, access to complete genomic sequences, coupled with rapidly accumulating data related to RNA and protein expression patterns, has made it possible to determine comprehensively how genes contribute to complex phenotypes. However, for major crop plants, publicly available, standard platforms for parallel expression analysis have been limited. We report the conception and design of the new publicly available, 22K Barley1 GeneChip probe array, a model for plants without a fully sequenced genome. Array content was derived from worldwide contribution of 350,000 high-quality ESTs from 84 cDNA libraries, in addition to 1,145 barley (Hordeum vulgare) gene sequences from the National Center for Biotechnology Information nonredundant database. Conserved sequences expressed in seedlings of wheat (Triticum aestivum), oat (Avena strigosa), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays) were identified that will be valuable in the design of arrays across grasses. To enhance the usability of the data, BarleyBase, a MIAME-compliant, MySQL relational database, serves as a public repository for raw and normalized expression data from the Barley1 GeneChip probe array. Interconnecting links with PlantGDB and Gramene allow BarleyBase users to perform gene predictions using the 21,439 non-redundant Barley1 exemplar sequences or cross-species comparison at the genome level, respectively. We expect that this first generation array will accelerate hypothesis generation and gene discovery in disease defense pathways, responses to abiotic stresses, development, and evolutionary diversity in monocot plants.
近年来,完整基因组序列的获取,再加上与RNA和蛋白质表达模式相关的数据迅速积累,使得全面确定基因如何影响复杂表型成为可能。然而,对于主要农作物而言,公开可用的用于平行表达分析的标准平台一直很有限。我们报告了新型公开可用的22K大麦1基因芯片探针阵列的概念和设计,这是一个针对没有完全测序基因组的植物的模型。阵列内容除了来自美国国家生物技术信息中心非冗余数据库的1145个大麦(Hordeum vulgare)基因序列外,还源自84个cDNA文库中350,000个高质量EST的全球贡献。在小麦(Triticum aestivum)、燕麦(Avena strigosa)、水稻(Oryza sativa)、高粱(Sorghum bicolor)和玉米(Zea mays)幼苗中表达的保守序列被鉴定出来,这些序列在禾本科植物阵列设计中将具有重要价值。为了提高数据的可用性,BarleyBase,一个符合MIAME标准的MySQL关系数据库,作为大麦1基因芯片探针阵列原始和标准化表达数据的公共存储库。与PlantGDB和Gramene的互连链接分别允许BarleyBase用户使用21,439个非冗余大麦1示例序列进行基因预测或在基因组水平进行跨物种比较。我们预计,这第一代阵列将加速单子叶植物疾病防御途径、对非生物胁迫的反应、发育和进化多样性方面的假设生成和基因发现。