Center for Bioinformatics, State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing 100871, China.
Department of Bioinformatics, Chongqing Medical University, Chongqing 400016, China; College of Life Sciences, Beijing Normal University, Beijing 100875, China; National Institute of Biological Sciences, Beijing 102206, China.
Genomics Proteomics Bioinformatics. 2020 Apr;18(2):140-149. doi: 10.1016/j.gpb.2020.05.002. Epub 2020 Sep 8.
Mosaic variants resulting from postzygotic mutations are prevalent in the human genome and play important roles in human diseases. However, except for cancer-related variants, there is no collection of postzygotic mosaic variants in noncancer disease-related and healthy individuals. Here, we present MosaicBase, a comprehensive database that includes 6698 mosaic variants related to 266 noncancer diseases and 27,991 mosaic variants identified in 422 healthy individuals. Genomic and phenotypic information of each variant was manually extracted and curated from 383 publications. MosaicBase supports the query of variants with Online Mendelian Inheritance in Man (OMIM) entries, genomic coordinates, gene symbols, or Entrez IDs. We also provide an integrated genome browser for users to easily access mosaic variants and their related annotations for any genomic region. By analyzing the variants collected in MosaicBase, we find that mosaic variants that directly contribute to disease phenotype show features distinct from those of variants in individuals with mild or no phenotypes, in terms of their genomic distribution, mutation signatures, and fraction of mutant cells. MosaicBase will not only assist clinicians in genetic counseling and diagnosis but also provide a useful resource to understand the genomic baseline of postzygotic mutations in the general human population. MosaicBase is publicly available at http://mosaicbase.com/ or http://49.4.21.8:8000.
镶嵌变体是由合子后突变引起的,在人类基因组中很常见,在人类疾病中发挥着重要作用。然而,除了与癌症相关的变体外,在非癌症疾病相关和健康个体中没有收集合子后镶嵌变体。在这里,我们展示了 MosaicBase,这是一个全面的数据库,其中包括与 266 种非癌症疾病相关的 6698 种镶嵌变体,以及在 422 名健康个体中鉴定的 27991 种镶嵌变体。每个变体的基因组和表型信息都是从 383 篇出版物中手动提取和整理的。MosaicBase 支持通过在线孟德尔遗传数据库(OMIM)条目、基因组坐标、基因符号或 Entrez ID 查询变体。我们还为用户提供了一个集成的基因组浏览器,方便用户访问任何基因组区域的镶嵌变体及其相关注释。通过分析 MosaicBase 中收集的变体,我们发现直接导致疾病表型的镶嵌变体在基因组分布、突变特征和突变细胞比例方面与个体中具有轻度或无表型的变体有明显不同。MosaicBase 将不仅有助于临床医生进行遗传咨询和诊断,还为了解一般人群中合子后突变的基因组基线提供了有用的资源。MosaicBase 可在 http://mosaicbase.com/ 或 http://49.4.21.8:8000 上公开获取。