Lawrence Berkeley National Laboratory, Genomics Division, 1 Cyclotron Road MS64-121, Berkeley, CA 94720, USA.
Database (Oxford). 2011 Aug 19;2011:bar023. doi: 10.1093/database/bar023. Print 2011.
The model organism Encyclopedia of DNA Elements (modENCODE) project is a National Human Genome Research Institute (NHGRI) initiative designed to characterize the genomes of Drosophila melanogaster and Caenorhabditis elegans. A Data Coordination Center (DCC) was created to collect, store and catalog modENCODE data. An effective DCC must gather, organize and provide all primary, interpreted and analyzed data, and ensure the community is supplied with the knowledge of the experimental conditions, protocols and verification checks used to generate each primary data set. We present here the design principles of the modENCODE DCC, and describe the ramifications of collecting thorough and deep metadata for describing experiments, including the use of a wiki for capturing protocol and reagent information, and the BIR-TAB specification for linking biological samples to experimental results. modENCODE data can be found at http://www.modencode.org.
模式生物百科全书 DNA 元件计划(modENCODE)是一个由国立人类基因组研究所(NHGRI)发起的项目,旨在对黑腹果蝇和秀丽隐杆线虫的基因组进行特征描述。创建了一个数据协调中心(DCC)来收集、存储和编目 modENCODE 数据。一个有效的 DCC 必须收集、组织和提供所有的原始、解释和分析数据,并确保为社区提供用于生成每个原始数据集的实验条件、协议和验证检查的知识。我们在这里介绍了 modENCODE DCC 的设计原则,并描述了为描述实验收集全面而深入的元数据的影响,包括使用维基来捕获协议和试剂信息,以及使用 BIR-TAB 规范将生物样本链接到实验结果。modENCODE 数据可以在 http://www.modencode.org 找到。