Departments of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030.
Texas Children's Cancer Center, Texas Children's Hospital, Houston, TX.
Hum Mutat. 2019 Jan;40(1):73-89. doi: 10.1002/humu.23668. Epub 2018 Nov 8.
MicroRNA (miRNA) expression is frequently deregulated in human disease, in contrast, disease-associated miRNA mutations are understudied. We developed Annotative Database of miRNA Elements, ADmiRE, which combines multiple existing and new biological annotations to aid prioritization of causal miRNA variation. We annotated 10,206 mature (3,257 within seed region) miRNA variants from multiple large sequencing datasets including gnomAD (15,496 genomes; 123,136 exomes). The pattern of miRNA variation closely resembles protein-coding exonic regions, with no difference between intragenic and intergenic miRNAs (P = 0.56), and high confidence miRNAs demonstrate higher sequence constraint (P < 0.001). Conservation analysis across 100 vertebrates identified 765 highly conserved miRNAs that also have limited genetic variation in gnomAD. We applied ADmiRE to the TCGA PanCancerAtlas WES dataset containing over 10,000 individuals across 33 adult cancers and annotated 1,267 germline (rare in gnomAD) and 1,492 somatic miRNA variants. Several miRNA families with deregulated gene expression in cancer have low levels of both somatic and germline variants, e.g., let-7 and miR-10. In addition to known somatic miR-142 mutations in hematologic cancers, we describe novel somatic miR-21 mutations in esophageal cancers impacting downstream miRNA targets. Through the development of ADmiRE, we present a framework for annotation and prioritization of miRNA variation in disease datasets.
微 RNA(miRNA)表达在人类疾病中经常失调,相比之下,与疾病相关的 miRNA 突变研究较少。我们开发了 miRNA 元素注释数据库(ADmiRE),它结合了多个现有的和新的生物学注释,以帮助优先考虑因果 miRNA 变异。我们对来自多个大型测序数据集(包括 gnomAD(15,496 个基因组;123,136 个外显子))的 10,206 个成熟 miRNA 变体(3,257 个位于种子区域内)进行了注释。miRNA 变异的模式与蛋白编码外显子区域非常相似,基因内和基因间 miRNA 之间没有差异(P = 0.56),并且高置信 miRNA 显示出更高的序列约束(P < 0.001)。在 100 种脊椎动物中的保守性分析确定了 765 个高度保守的 miRNA,这些 miRNA 在 gnomAD 中也具有有限的遗传变异。我们将 ADmiRE 应用于 TCGA PanCancerAtlas WES 数据集,该数据集包含超过 10000 个个体,涵盖 33 种成人癌症,并注释了 1,267 个种系(gnomAD 中罕见)和 1,492 个体细胞 miRNA 变体。在癌症中表达失调的几个 miRNA 家族,体细胞和种系变体水平都较低,例如 let-7 和 miR-10。除了血液癌症中已知的体细胞 miR-142 突变外,我们还描述了食管癌中影响下游 miRNA 靶标的新型体细胞 miR-21 突变。通过 ADmiRE 的开发,我们提出了一种在疾病数据集中注释和优先考虑 miRNA 变异的框架。