Department of Human Genetics, Leiden University Medical Center, Leiden, Netherlands.
Department of Clinical Genetics, Leiden University Medical Center, Leiden, Netherlands.
Eur J Hum Genet. 2021 Dec;29(12):1796-1803. doi: 10.1038/s41431-021-00959-x. Epub 2021 Sep 15.
Gene variant databases are the backbone of DNA-based diagnostics. These databases, also called Locus-Specific DataBases (LSDBs), store information on variants in the human genome and the observed phenotypic consequences. The largest collection of public databases uses the free, open-source LOVD software platform. To cope with the current demand for online databases, we have entirely redesigned the LOVD software. LOVD3 is genome-centered and can be used to store summary variant data, as well as full case-level data with information on individuals, phenotypes, screenings, and variants. While built on a standard core, the software is highly flexible and allows personalization to cope with the largely different demands of gene/disease database curators. LOVD3 follows current standards and includes tools to check variant descriptions, generate HTML files of reference sequences, predict the consequences of exon deletions/duplications on the reading frame, and link to genomic views in the different genomes browsers. It includes APIs to collect and submit data. The software is used by about 100 databases, of which 56 public LOVD instances are registered on our website and together contain 1,000,000,000 variant observations in 1,500,000 individuals. 42 LOVD instances share data with the federated LOVD data network containing 3,000,000 unique variants in 23,000 genes. This network can be queried directly, quickly identifying LOVD instances containing relevant information on a searched variant.
基因变异数据库是基于 DNA 的诊断学的基础。这些数据库,也称为特定基因座数据库 (LSDBs),存储人类基因组中变异的信息和观察到的表型后果。最大的公共数据库集合使用免费的开源 LOVD 软件平台。为了满足当前对在线数据库的需求,我们完全重新设计了 LOVD 软件。LOVD3 以基因组为中心,可以用于存储摘要变异数据,以及包含个体、表型、筛查和变异信息的完整病例级数据。虽然基于标准核心构建,但该软件具有高度的灵活性,并允许个性化以应对基因/疾病数据库管理员的需求差异。LOVD3 遵循当前的标准,并包括检查变异描述、生成参考序列 HTML 文件、预测外显子缺失/重复对阅读框的影响以及链接到不同基因组浏览器中的基因组视图的工具。它包括用于收集和提交数据的 API。该软件被大约 100 个数据库使用,其中 56 个公共 LOVD 实例在我们的网站上注册,总共包含 150 万个体中的 10 亿个变异观察值。42 个 LOVD 实例与联邦 LOVD 数据网络共享数据,该网络包含 23000 个基因中的 300 万个独特变异。可以直接查询该网络,快速识别包含搜索变异相关信息的 LOVD 实例。