基于染色质可及性的疾病特异性非编码 GWAS 变体优先级排序。

Disease-specific prioritization of non-coding GWAS variants based on chromatin accessibility.

机构信息

Department of Computational & Systems Biology and Center for Evolutionary Biology and Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA; Department of Human Genetics, University of Pittsburgh School of Public Health, Pittsburgh, PA, USA.

Children's Hospital of Philadelphia, Philadelphia, PA, USA.

出版信息

HGG Adv. 2024 Jul 18;5(3):100310. doi: 10.1016/j.xhgg.2024.100310. Epub 2024 May 21.

DOI:10.1016/j.xhgg.2024.100310

PMID:38773771

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11259938/

Abstract

Non-protein-coding genetic variants are a major driver of the genetic risk for human disease; however, identifying which non-coding variants contribute to diseases and their mechanisms remains challenging. In silico variant prioritization methods quantify a variant's severity, but for most methods, the specific phenotype and disease context of the prediction remain poorly defined. For example, many commonly used methods provide a single, organism-wide score for each variant, while other methods summarize a variant's impact in certain tissues and/or cell types. Here, we propose a complementary disease-specific variant prioritization scheme, which is motivated by the observation that variants contributing to disease often operate through specific biological mechanisms. We combine tissue/cell-type-specific variant scores (e.g., GenoSkyline, FitCons2, DNA accessibility) into disease-specific scores with a logistic regression approach and apply it to ∼25,000 non-coding variants spanning 111 diseases. We show that this disease-specific aggregation significantly improves the association of common non-coding genetic variants with disease (average precision: 0.151, baseline = 0.09), compared with organism-wide scores (GenoCanyon, LINSIGHT, GWAVA, Eigen, CADD; average precision: 0.129, baseline = 0.09). Further on, disease similarities based on data-driven aggregation weights highlight meaningful disease groups, and it provides information about tissues and cell types that drive these similarities. We also show that so-learned similarities are complementary to genetic similarities as quantified by genetic correlation. Overall, our approach demonstrates the strengths of disease-specific variant prioritization, leads to improvement in non-coding variant prioritization, and enables interpretable models that link variants to disease via specific tissues and/or cell types.

摘要

非蛋白编码遗传变异是人类疾病遗传风险的主要驱动因素；然而，确定哪些非编码变异导致疾病及其机制仍然具有挑战性。基于计算机的变异优先级方法量化了变异的严重程度，但对于大多数方法而言，预测的具体表型和疾病背景仍然定义不明确。例如，许多常用的方法为每个变异提供一个单一的、全器官的评分，而其他方法则在某些组织和/或细胞类型中总结变异的影响。在这里，我们提出了一种互补的疾病特异性变异优先级方案，这是受到以下观察结果的启发：导致疾病的变异通常通过特定的生物学机制起作用。我们使用逻辑回归方法将组织/细胞类型特异性变异评分（例如，GenoSkyline、FitCons2、DNA 可及性）组合成疾病特异性评分，并将其应用于跨越 111 种疾病的约 25000 个非编码变体。我们表明，与全器官评分（GenoCanyon、LINSIGHT、GWAVA、Eigen、CADD；平均精度：0.129，基线= 0.09）相比，这种疾病特异性聚集显著提高了常见非编码遗传变异与疾病的关联（平均精度：0.151，基线= 0.09）。此外，基于数据驱动的聚集权重的疾病相似性突出了有意义的疾病组，并提供了有关驱动这些相似性的组织和细胞类型的信息。我们还表明，如此学习到的相似性与遗传相关性量化的遗传相似性是互补的。总体而言，我们的方法展示了疾病特异性变异优先级的优势，导致非编码变异优先级的改进，并提供了可解释的模型，通过特定的组织和/或细胞类型将变体与疾病联系起来。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61c8/11259938/b87f92bb9d7f/gr1.jpg

相似文献

Disease-specific prioritization of non-coding GWAS variants based on chromatin accessibility.基于染色质可及性的疾病特异性非编码 GWAS 变体优先级排序。

HGG Adv. 2024 Jul 18;5(3):100310. doi: 10.1016/j.xhgg.2024.100310. Epub 2024 May 21.

cepip: context-dependent epigenomic weighting for prioritization of regulatory variants and disease-associated genes.cepip：用于调控变异和疾病相关基因优先级排序的上下文依赖表观基因组加权法

Genome Biol. 2017 Mar 16;18(1):52. doi: 10.1186/s13059-017-1177-3.

Comprehensive functional annotation of susceptibility variants associated with asthma.全面注释与哮喘相关的易感性变异。

Hum Genet. 2020 Aug;139(8):1037-1053. doi: 10.1007/s00439-020-02151-5. Epub 2020 Apr 2.

Critical assessment of variant prioritization methods for rare disease diagnosis within the rare genomes project.对罕见基因组项目中罕见病诊断的变异优先级方法的批判性评估。

Hum Genomics. 2024 Apr 29;18(1):44. doi: 10.1186/s40246-024-00604-w.

DIVAN: accurate identification of non-coding disease-specific risk variants using multi-omics profiles.DIVAN：利用多组学图谱准确识别非编码疾病特异性风险变异体。

Genome Biol. 2016 Dec 6;17(1):252. doi: 10.1186/s13059-016-1112-z.

Incorporating Non-Coding Annotations into Rare Variant Analysis.将非编码注释纳入罕见变异分析。

PLoS One. 2016 Apr 29;11(4):e0154181. doi: 10.1371/journal.pone.0154181. eCollection 2016.

Regulatory Single-Nucleotide Variant Predictor Increases Predictive Performance of Functional Regulatory Variants.调控单核苷酸变异预测器提高了功能性调控变异的预测性能。

Hum Mutat. 2016 Nov;37(11):1137-1143. doi: 10.1002/humu.23049. Epub 2016 Aug 31.

In silico searching for disease-associated functional DNA variants.通过计算机模拟搜索与疾病相关的功能性DNA变异体。

Methods Mol Biol. 2011;760:239-50. doi: 10.1007/978-1-61779-176-5_15.

A semi-supervised approach for predicting cell-type specific functional consequences of non-coding variation using MPRAs.基于 MPRAs 的半监督方法，用于预测非编码变异的细胞类型特异性功能后果。

Nat Commun. 2018 Dec 5;9(1):5199. doi: 10.1038/s41467-018-07349-w.

Integrative Tissue-Specific Functional Annotations in the Human Genome Provide Novel Insights on Many Complex Traits and Improve Signal Prioritization in Genome Wide Association Studies.人类基因组中的综合组织特异性功能注释为许多复杂性状提供了新见解，并改善了全基因组关联研究中的信号优先级。

PLoS Genet. 2016 Apr 8;12(4):e1005947. doi: 10.1371/journal.pgen.1005947. eCollection 2016 Apr.

引用本文的文献

"Frustratingly easy" domain adaptation for cross-species transcription factor binding prediction.用于跨物种转录因子结合预测的“简单到令人沮丧”的域适应

bioRxiv. 2025 May 26:2025.05.21.655414. doi: 10.1101/2025.05.21.655414.

Non-coding variation in dementias: mechanisms, insights, and challenges.痴呆症中的非编码变异：机制、见解与挑战。

NPJ Dement. 2025;1(1):9. doi: 10.1038/s44400-025-00012-4. Epub 2025 Jun 3.

Update on the genetics of allergic diseases.过敏性疾病遗传学的最新进展。

J Allergy Clin Immunol. 2025 Jun;155(6):1738-1752. doi: 10.1016/j.jaci.2025.03.012. Epub 2025 Mar 24.

本文引用的文献

A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies.一种用于检测大规模全基因组测序研究中非编码稀有变异关联的框架。

Nat Methods. 2022 Dec;19(12):1599-1611. doi: 10.1038/s41592-022-01640-x. Epub 2022 Oct 27.

Role of B-Cell in the Pathogenesis of Systemic Sclerosis.B 细胞在系统性硬化症发病机制中的作用。

Front Immunol. 2022 Jul 12;13:933468. doi: 10.3389/fimmu.2022.933468. eCollection 2022.

Anorexia Nervosa and Autism Spectrum Disorder: A Systematic Review.神经性厌食症与自闭症谱系障碍：系统综述。

Psychiatry Res. 2021 Dec;306:114271. doi: 10.1016/j.psychres.2021.114271. Epub 2021 Nov 10.

Investigating the shared genetic architecture between multiple sclerosis and inflammatory bowel diseases.研究多发性硬化症和炎症性肠病之间的共享遗传结构。

Nat Commun. 2021 Sep 24;12(1):5641. doi: 10.1038/s41467-021-25768-0.

Detection of Genetic Overlap Between Rheumatoid Arthritis and Systemic Lupus Erythematosus Using GWAS Summary Statistics.利用全基因组关联研究汇总统计数据检测类风湿性关节炎和系统性红斑狼疮之间的遗传重叠

Front Genet. 2021 Mar 18;12:656545. doi: 10.3389/fgene.2021.656545. eCollection 2021.

Genome-wide genetic links between amyotrophic lateral sclerosis and autoimmune diseases.肌萎缩侧索硬化症与自身免疫性疾病之间的全基因组遗传联系。

BMC Med. 2021 Feb 5;19(1):27. doi: 10.1186/s12916-021-01903-y.

Regulatory genomic circuitry of human disease loci by integrative epigenomics.通过整合表观基因组学研究人类疾病相关位点的调控基因组回路。

Nature. 2021 Feb;590(7845):300-307. doi: 10.1038/s41586-020-03145-z. Epub 2021 Feb 3.

Innate Lymphoid Cells and Celiac Disease: Current Perspective.固有淋巴细胞与乳糜泻：最新研究进展

Cell Mol Gastroenterol Hepatol. 2021;11(3):803-814. doi: 10.1016/j.jcmgh.2020.12.002. Epub 2020 Dec 10.

Monocytes as Potential Mediators of Pathogen-Induced T-Helper 17 Differentiation in Patients With Primary Sclerosing Cholangitis (PSC).原发性硬化性胆管炎患者中单核细胞作为病原体诱导辅助性 T 细胞 17 分化的潜在介质。

Hepatology. 2020 Oct;72(4):1310-1326. doi: 10.1002/hep.31140. Epub 2020 Oct 8.

Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale.大规模全基因组测序研究中通过多种计算功能注释的动态整合增强罕见变异关联分析。

Nat Genet. 2020 Sep;52(9):969-983. doi: 10.1038/s41588-020-0676-4. Epub 2020 Aug 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于染色质可及性的疾病特异性非编码 GWAS 变体优先级排序。

Disease-specific prioritization of non-coding GWAS variants based on chromatin accessibility.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献