Pan Jian-Bo, Hu Shi-Chang, Shi Dan, Cai Mei-Chun, Li Yin-Bo, Zou Quan, Ji Zhi-Liang
Department of Chemical Biology, College of Chemistry and Chemical Engineering, The Key Laboratory for Chemical Biology of Fujian Province, Xiamen University, Xiamen, Fujian, PR China.
PLoS One. 2013 Dec 2;8(12):e80747. doi: 10.1371/journal.pone.0080747. eCollection 2013.
Pattern genes are a group of genes that have a modularized expression behavior under serial physiological conditions. The identification of pattern genes will provide a path toward a global and dynamic understanding of gene functions and their roles in particular biological processes or events, such as development and pathogenesis. In this study, we present PaGenBase, a novel repository for the collection of tissue- and time-specific pattern genes, including specific genes, selective genes, housekeeping genes and repressed genes. The PaGenBase database is now freely accessible at http://bioinf.xmu.edu.cn/PaGenBase/. In the current version (PaGenBase 1.0), the database contains 906,599 pattern genes derived from the literature or from data mining of more than 1,145,277 gene expression profiles in 1,062 distinct samples collected from 11 model organisms. Four statistical parameters were used to quantitatively evaluate the pattern genes. Moreover, three methods (quick search, advanced search and browse) were designed for rapid and customized data retrieval. The potential applications of PaGenBase are also briefly described. In summary, PaGenBase will serve as a resource for the global and dynamic understanding of gene function and will facilitate high-level investigations in a variety of fields, including the study of development, pathogenesis and novel drug discovery.
模式基因是一组在一系列生理条件下具有模块化表达行为的基因。模式基因的鉴定将为全面、动态地理解基因功能及其在特定生物学过程或事件(如发育和发病机制)中的作用提供一条途径。在本研究中,我们展示了PaGenBase,这是一个用于收集组织和时间特异性模式基因的新型数据库,包括特异性基因、选择性基因、管家基因和抑制基因。PaGenBase数据库现在可通过http://bioinf.xmu.edu.cn/PaGenBase/免费访问。在当前版本(PaGenBase 1.0)中,该数据库包含从文献或从11种模式生物收集的1062个不同样本中的1145277多个基因表达谱的数据挖掘中获得的906599个模式基因。使用四个统计参数对模式基因进行定量评估。此外,还设计了三种方法(快速搜索、高级搜索和浏览)用于快速和定制的数据检索。还简要描述了PaGenBase的潜在应用。总之,PaGenBase将作为一个资源,用于全面、动态地理解基因功能,并将促进包括发育、发病机制研究和新药发现在内的各种领域的高级研究。