• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从头发现模式可实现对非编码变异功能后果的稳健评估。

De novo pattern discovery enables robust assessment of functional consequences of non-coding variants.

机构信息

Department of Molecular Physiology & Biophysics, Nashville, TN, USA.

Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA.

出版信息

Bioinformatics. 2019 May 1;35(9):1453-1460. doi: 10.1093/bioinformatics/bty826.

DOI:10.1093/bioinformatics/bty826
PMID:30256891
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6499232/
Abstract

MOTIVATION

Given the complexity of genome regions, prioritize the functional effects of non-coding variants remains a challenge. Although several frameworks have been proposed for the evaluation of the functionality of non-coding variants, most of them used 'black boxes' methods that simplify the task as the pathogenicity/benign classification problem, which ignores the distinct regulatory mechanisms of variants and leads to less desirable performance. In this study, we developed DVAR, an unsupervised framework that leverage various biochemical and evolutionary evidence to distinguish the gene regulatory categories of variants and assess their comprehensive functional impact simultaneously.

RESULTS

DVAR performed de novo pattern discovery in high-dimensional data and identified five regulatory clusters of non-coding variants. Leveraging the new insights into the multiple functional patterns, it measures both the between-class and the within-class functional implication of the variants to achieve accurate prioritization. Compared to other two-class learning methods, it showed improved performance in identification of clinically significant variants, fine-mapped GWAS variants, eQTLs and expression-modulating variants. Moreover, it has superior performance on disease causal variants verified by genome-editing (like CRISPR-Cas9), which could provide a pre-selection strategy for genome-editing technologies across the whole genome. Finally, evaluated in BioVU and UK Biobank, two large-scale DNA biobanks linked to complete electronic health records, DVAR demonstrated its effectiveness in prioritizing non-coding variants associated with medical phenotypes.

AVAILABILITY AND IMPLEMENTATION

The C++ and Python source codes, the pre-computed DVAR-cluster labels and DVAR-scores across the whole genome are available at https://www.vumc.org/cgg/dvar.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

由于基因组区域的复杂性,优先考虑非编码变异的功能影响仍然是一个挑战。尽管已经提出了几种评估非编码变异功能的框架,但它们大多数都使用了“黑盒”方法,将致病性/良性分类问题简化为任务,这忽略了变异的不同调节机制,并导致性能不理想。在这项研究中,我们开发了 DVAR,这是一种无监督的框架,利用各种生化和进化证据来区分变体的基因调控类别,并同时评估它们的综合功能影响。

结果

DVAR 在高维数据中进行了全新的模式发现,并确定了五个非编码变异的调控聚类。利用对多种功能模式的新见解,它测量了变体之间的类间和类内功能含义,以实现准确的优先级排序。与其他两种学习方法相比,它在识别临床上有意义的变体、精细映射的 GWAS 变体、eQTLs 和表达调节变体方面表现出了更好的性能。此外,它在经过基因组编辑(如 CRISPR-Cas9)验证的疾病因果变异方面表现出了卓越的性能,这为整个基因组的基因组编辑技术提供了一种预选策略。最后,在两个与完整电子健康记录相关的大型 DNA 生物库(BioVU 和 UK Biobank)中进行评估,DVAR 证明了其在优先考虑与医学表型相关的非编码变异方面的有效性。

可用性和实现

C++和 Python 源代码、预先计算的整个基因组的 DVAR 聚类标签和 DVAR 分数可在 https://www.vumc.org/cgg/dvar 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
De novo pattern discovery enables robust assessment of functional consequences of non-coding variants.从头发现模式可实现对非编码变异功能后果的稳健评估。
Bioinformatics. 2019 May 1;35(9):1453-1460. doi: 10.1093/bioinformatics/bty826.
2
TVAR: assessing tissue-specific functional effects of non-coding variants with deep learning.TVAR:利用深度学习评估非编码变异的组织特异性功能效应。
Bioinformatics. 2022 Oct 14;38(20):4697-4704. doi: 10.1093/bioinformatics/btac608.
3
Quantifying functional impact of non-coding variants with multi-task Bayesian neural network.使用多任务贝叶斯神经网络量化非编码变异的功能影响。
Bioinformatics. 2020 Mar 1;36(5):1397-1404. doi: 10.1093/bioinformatics/btz767.
4
Predicting regulatory variants with composite statistic.使用复合统计量预测调控变异体。
Bioinformatics. 2016 Sep 15;32(18):2729-36. doi: 10.1093/bioinformatics/btw288. Epub 2016 Jun 6.
5
Openness weighted association studies: leveraging personal genome information to prioritize non-coding variants.基于开放权重的关联研究:利用个人基因组信息对非编码变异进行优先级排序。
Bioinformatics. 2021 Dec 11;37(24):4737-4743. doi: 10.1093/bioinformatics/btab514.
6
DeepPerVar: a multi-modal deep learning framework for functional interpretation of genetic variants in personal genome.DeepPerVar:一个多模态深度学习框架,用于个人基因组中遗传变异的功能解释。
Bioinformatics. 2022 Dec 13;38(24):5340-5351. doi: 10.1093/bioinformatics/btac696.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
CRISPRitz: rapid, high-throughput and variant-aware in silico off-target site identification for CRISPR genome editing.CRISPRitz:CRISPR 基因组编辑的快速、高通量和变体感知的计算机模拟脱靶位点识别。
Bioinformatics. 2020 Apr 1;36(7):2001-2008. doi: 10.1093/bioinformatics/btz867.
9
Exploiting deep transfer learning for the prediction of functional non-coding variants using genomic sequence.利用深度迁移学习,通过基因组序列预测功能非编码变异。
Bioinformatics. 2022 Jun 13;38(12):3164-3172. doi: 10.1093/bioinformatics/btac214.
10
DeepCRISTL: deep transfer learning to predict CRISPR/Cas9 functional and endogenous on-target editing efficiency.DeepCRISTL:深度迁移学习预测 CRISPR/Cas9 功能和内源性靶标编辑效率。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i161-i168. doi: 10.1093/bioinformatics/btac218.

引用本文的文献

1
MIRACN: a residual convolutional neural network for predicting cell line specific functional regulatory variants.MIRACN:一种用于预测细胞系特异性功能调控变异的残差卷积神经网络。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf196.
2
GoFCards: an integrated database and analytic platform for gain of function variants in humans.GoFCards:一个用于人类功能获得性变异的综合数据库和分析平台。
Nucleic Acids Res. 2025 Jan 6;53(D1):D976-D988. doi: 10.1093/nar/gkae1079.
3
In silico generation and augmentation of regulatory variants from massively parallel reporter assay using conditional variational autoencoder.利用条件变分自编码器通过大规模平行报告基因检测在计算机上生成和增强调控变异体。
bioRxiv. 2024 Jun 29:2024.06.25.600715. doi: 10.1101/2024.06.25.600715.
4
Topologically associating domains define the impact of de novo promoter variants on autism spectrum disorder risk.拓扑关联域定义了从头开始的启动子变体对自闭症谱系障碍风险的影响。
Cell Genom. 2024 Feb 14;4(2):100488. doi: 10.1016/j.xgen.2024.100488. Epub 2024 Jan 26.
5
VarCards2: an integrated genetic and clinical database for ACMG-AMP variant-interpretation guidelines in the human whole genome.VarCards2:一个整合的遗传和临床数据库,用于 ACMG-AMP 变异解读指南在人类全基因组中的应用。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1478-D1489. doi: 10.1093/nar/gkad1061.
6
Optimized high-throughput screening of non-coding variants identified from genome-wide association studies.优化高通量筛选全基因组关联研究中鉴定的非编码变异。
Nucleic Acids Res. 2023 Feb 22;51(3):e18. doi: 10.1093/nar/gkac1198.
7
GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies.幽灵复刻推断使全基因组关联研究中假定因果变异的识别成为可能。
Nat Commun. 2022 Nov 23;13(1):7209. doi: 10.1038/s41467-022-34932-z.
8
Exploration of Tools for the Interpretation of Human Non-Coding Variants.人类非编码变异解释工具的探索。
Int J Mol Sci. 2022 Oct 26;23(21):12977. doi: 10.3390/ijms232112977.
9
Multi-omics approach dissects cis-regulatory mechanisms underlying North Carolina macular dystrophy, a retinal enhanceropathy.多组学方法解析北卡罗来纳黄斑营养不良的顺式调控机制,一种视网膜增强病变。
Am J Hum Genet. 2022 Nov 3;109(11):2029-2048. doi: 10.1016/j.ajhg.2022.09.013. Epub 2022 Oct 14.
10
TVAR: assessing tissue-specific functional effects of non-coding variants with deep learning.TVAR:利用深度学习评估非编码变异的组织特异性功能效应。
Bioinformatics. 2022 Oct 14;38(20):4697-4704. doi: 10.1093/bioinformatics/btac608.

本文引用的文献

1
FUN-LDA: A Latent Dirichlet Allocation Model for Predicting Tissue-Specific Functional Effects of Noncoding Variation: Methods and Applications.FUN-LDA:一种用于预测非编码变异组织特异性功能效应的潜在狄利克雷分配模型:方法与应用。
Am J Hum Genet. 2018 May 3;102(5):920-942. doi: 10.1016/j.ajhg.2018.03.026.
2
A benchmark study of scoring methods for non-coding mutations.非编码突变评分方法的基准研究。
Bioinformatics. 2018 May 15;34(10):1635-1641. doi: 10.1093/bioinformatics/bty008.
3
The impact of rare variation on gene expression across tissues.罕见变异对跨组织基因表达的影响。
Nature. 2017 Oct 11;550(7675):239-243. doi: 10.1038/nature24267.
4
SNPDelScore: combining multiple methods to score deleterious effects of noncoding mutations in the human genome.SNPDelScore:结合多种方法对人类基因组中非编码突变的有害效应进行评分。
Bioinformatics. 2018 Jan 15;34(2):289-291. doi: 10.1093/bioinformatics/btx583.
5
Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data.基于功能和群体基因组数据对有害非编码变异进行快速、可扩展的预测。
Nat Genet. 2017 Apr;49(4):618-624. doi: 10.1038/ng.3810. Epub 2017 Mar 13.
6
The non-coding variant rs1800734 enhances DCLK3 expression through long-range interaction and promotes colorectal cancer progression.非编码变异 rs1800734 通过长距离相互作用增强 DCLK3 的表达,促进结直肠癌的进展。
Nat Commun. 2017 Feb 14;8:14418. doi: 10.1038/ncomms14418.
7
DIVAN: accurate identification of non-coding disease-specific risk variants using multi-omics profiles.DIVAN:利用多组学图谱准确识别非编码疾病特异性风险变异体。
Genome Biol. 2016 Dec 6;17(1):252. doi: 10.1186/s13059-016-1112-z.
8
A Functional Variant Associated with Atrial Fibrillation Regulates PITX2c Expression through TFAP2a.一种与心房颤动相关的功能性变体通过TFAP2a调节PITX2c表达。
Am J Hum Genet. 2016 Dec 1;99(6):1281-1291. doi: 10.1016/j.ajhg.2016.10.001. Epub 2016 Nov 17.
9
Cancer driver gene discovery through an integrative genomics approach in a non-parametric Bayesian framework.在非参数贝叶斯框架下通过综合基因组学方法发现癌症驱动基因。
Bioinformatics. 2017 Feb 15;33(4):483-490. doi: 10.1093/bioinformatics/btw662.
10
Functional mechanisms underlying pleiotropic risk alleles at the 19p13.1 breast-ovarian cancer susceptibility locus.位于 19p13.1 乳腺癌卵巢癌易感位点的多效风险等位基因的功能机制。
Nat Commun. 2016 Sep 7;7:12675. doi: 10.1038/ncomms12675.