*-DCC:一个用于收集、注释和探索多种测序实验的平台。
*-DCC: A platform to collect, annotate, and explore a large variety of sequencing experiments.
机构信息
Department of Biosciences and Nutrition, Karolinska Institutet, NEO, Huddinge SE-141 83, Sweden.
Department of Statistics, University of California Berkeley, 367 Evans Hall,, Berkeley, CA 94720, USA.
出版信息
Gigascience. 2020 Mar 1;9(3). doi: 10.1093/gigascience/giaa024.
BACKGROUND
Over the past few years the variety of experimental designs and protocols for sequencing experiments increased greatly. To ensure the wide usability of the produced data beyond an individual project, rich and systematic annotation of the underlying experiments is crucial.
FINDINGS
We first developed an annotation structure that captures the overall experimental design as well as the relevant details of the steps from the biological sample to the library preparation, the sequencing procedure, and the sequencing and processed files. Through various design features, such as controlled vocabularies and different field requirements, we ensured a high annotation quality, comparability, and ease of annotation. The structure can be easily adapted to a large variety of species. We then implemented the annotation strategy in a user-hosted web platform with data import, query, and export functionality.
CONCLUSIONS
We present here an annotation structure and user-hosted platform for sequencing experiment data, suitable for lab-internal documentation, collaborations, and large-scale annotation efforts.
背景
在过去的几年中,测序实验的实验设计和方案种类大大增加。为了确保生成的数据在单个项目之外具有广泛的可用性,对基础实验进行丰富而系统的注释至关重要。
发现
我们首先开发了一种注释结构,该结构可以捕获整体实验设计以及从生物样本到文库制备、测序过程以及测序和处理文件的各个步骤的相关详细信息。通过各种设计功能,如受控词汇表和不同字段要求,我们确保了高质量、可比性和易于注释的注释。该结构可以轻松适应各种物种。然后,我们在一个具有数据导入、查询和导出功能的用户托管的 Web 平台中实现了注释策略。
结论
我们在这里介绍了一种适合实验室内部文档、协作和大规模注释工作的测序实验数据注释结构和用户托管平台。