• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TIVAN-indel:一种注释和预测非编码调控小插入和缺失的计算框架。

TIVAN-indel: a computational framework for annotating and predicting non-coding regulatory small insertions and deletions.

机构信息

Department of Computer Science, Indiana University, Bloomington, IN 47405, USA.

Department of Biostatistics, University of Florida, Gainesville, FL 32603, USA.

出版信息

Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad060.

DOI:10.1093/bioinformatics/btad060
PMID:36707993
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9900211/
Abstract

MOTIVATION

Small insertion and deletion (sindel) of human genome has an important implication for human disease. One important mechanism for non-coding sindel (nc-sindel) to have an impact on human diseases and phenotypes is through the regulation of gene expression. Nevertheless, current sequencing experiments may lack statistical power and resolution to pinpoint the functional sindel due to lower minor allele frequency or small effect size. As an alternative strategy, a supervised machine learning method can identify the otherwise masked functional sindels by predicting their regulatory potential directly. However, computational methods for annotating and predicting the regulatory sindels, especially in the non-coding regions, are underdeveloped.

RESULTS

By leveraging labeled nc-sindels identified by cis-expression quantitative trait loci analyses across 44 tissues in Genotype-Tissue Expression (GTEx), and a compilation of both generic functional annotations and large-scale epigenomic profiles, we develop TIssue-specific Variant Annotation for Non-coding indel (TIVAN-indel), which is a supervised computational framework for predicting non-coding regulatory sindels. As a result, we demonstrate that TIVAN-indel achieves the best prediction performance in both with-tissue prediction and cross-tissue prediction. As an independent evaluation, we train TIVAN-indel from the 'Whole Blood' tissue in GTEx and test the model using 15 immune cell types from an independent study named Database of Immune Cell Expression. Lastly, we perform an enrichment analysis for both true and predicted sindels in key regulatory regions such as chromatin interactions, open chromatin regions and histone modification sites, and find biologically meaningful enrichment patterns.

AVAILABILITY AND IMPLEMENTATION

https://github.com/lichen-lab/TIVAN-indel.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

人类基因组的小插入和缺失(sindel)对人类疾病有重要影响。非编码 sindel(nc-sindel)影响人类疾病和表型的一个重要机制是通过调节基因表达。然而,由于较小的次要等位基因频率或较小的效应大小,当前的测序实验可能缺乏识别功能 sindel 的统计能力和分辨率。作为一种替代策略,监督机器学习方法可以通过直接预测其调节潜力来识别否则被掩盖的功能 sindel。然而,注释和预测调节 sindel 的计算方法,特别是在非编码区域,还不够发达。

结果

通过利用 cis 表达数量性状基因座分析在 44 种组织中鉴定的标记 nc-sindel,以及通用功能注释和大规模表观基因组谱的综合,我们开发了 TIssue-specific Variant Annotation for Non-coding indel(TIVAN-indel),这是一种用于预测非编码调节 sindel 的监督计算框架。结果表明,TIVAN-indel 在组织内预测和跨组织预测中均具有最佳的预测性能。作为独立评估,我们从 GTEx 中的“全血”组织中训练 TIVAN-indel,并使用名为 Database of Immune Cell Expression 的独立研究中的 15 种免疫细胞类型测试模型。最后,我们对关键调节区域(如染色质相互作用、开放染色质区域和组蛋白修饰位点)中的真实和预测的 sindel 进行富集分析,并发现了有生物学意义的富集模式。

可用性和实现

https://github.com/lichen-lab/TIVAN-indel。

补充信息

补充数据可在 Bioinformatics 在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/192eef9b17f9/btad060f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/cb40d5341ee1/btad060f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/73545b09b6e4/btad060f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/58c08d32e79a/btad060f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/b3c135908c4b/btad060f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/67a6fe8c5970/btad060f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/32cb51142a6a/btad060f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/2800583ce979/btad060f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/192eef9b17f9/btad060f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/cb40d5341ee1/btad060f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/73545b09b6e4/btad060f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/58c08d32e79a/btad060f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/b3c135908c4b/btad060f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/67a6fe8c5970/btad060f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/32cb51142a6a/btad060f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/2800583ce979/btad060f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b93e/9900211/192eef9b17f9/btad060f8.jpg

相似文献

1
TIVAN-indel: a computational framework for annotating and predicting non-coding regulatory small insertions and deletions.TIVAN-indel:一种注释和预测非编码调控小插入和缺失的计算框架。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad060.
2
TIVAN: tissue-specific cis-eQTL single nucleotide variant annotation and prediction.TIVAN:组织特异性顺式-eQTL 单核苷酸变异体注释和预测。
Bioinformatics. 2019 May 1;35(9):1573-1575. doi: 10.1093/bioinformatics/bty872.
3
TVAR: assessing tissue-specific functional effects of non-coding variants with deep learning.TVAR:利用深度学习评估非编码变异的组织特异性功能效应。
Bioinformatics. 2022 Oct 14;38(20):4697-4704. doi: 10.1093/bioinformatics/btac608.
4
DeepPHiC: predicting promoter-centered chromatin interactions using a novel deep learning approach.DeepPHiC:使用新型深度学习方法预测以启动子为中心的染色质相互作用。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac801.
5
Predicting target genes of non-coding regulatory variants with IRT.利用 IRT 预测非编码调控变体的靶基因。
Bioinformatics. 2020 Aug 15;36(16):4440-4448. doi: 10.1093/bioinformatics/btaa254.
6
EpiSAFARI: sensitive detection of valleys in epigenetic signals for enhancing annotations of functional elements.EpiSAFARI:用于增强功能元件注释的表观遗传信号中谷值的灵敏检测。
Bioinformatics. 2020 Feb 15;36(4):1014-1021. doi: 10.1093/bioinformatics/btz702.
7
Regulatory annotation of genomic intervals based on tissue-specific expression QTLs.基于组织特异性表达 QTL 的基因组区间调控注释。
Bioinformatics. 2020 Feb 1;36(3):690-697. doi: 10.1093/bioinformatics/btz669.
8
Quantifying functional impact of non-coding variants with multi-task Bayesian neural network.使用多任务贝叶斯神经网络量化非编码变异的功能影响。
Bioinformatics. 2020 Mar 1;36(5):1397-1404. doi: 10.1093/bioinformatics/btz767.
9
GREGOR: evaluating global enrichment of trait-associated variants in epigenomic features using a systematic, data-driven approach.格雷戈尔:使用系统的数据驱动方法评估表观基因组特征中与性状相关变体的全局富集情况。
Bioinformatics. 2015 Aug 15;31(16):2601-6. doi: 10.1093/bioinformatics/btv201. Epub 2015 Apr 16.
10
IMPACT: Genomic Annotation of Cell-State-Specific Regulatory Elements Inferred from the Epigenome of Bound Transcription Factors.影响:从结合转录因子的表观基因组推断细胞状态特异性调控元件的基因组注释。
Am J Hum Genet. 2019 May 2;104(5):879-895. doi: 10.1016/j.ajhg.2019.03.012. Epub 2019 Apr 18.

引用本文的文献

1
Deep5hmC: predicting genome-wide 5-hydroxymethylcytosine landscape via a multimodal deep learning model.Deep5hmC:通过多模态深度学习模型预测全基因组 5-羟甲基胞嘧啶景观。
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae528.
2
In silico generation and augmentation of regulatory variants from massively parallel reporter assay using conditional variational autoencoder.利用条件变分自编码器通过大规模平行报告基因检测在计算机上生成和增强调控变异体。
bioRxiv. 2024 Jun 29:2024.06.25.600715. doi: 10.1101/2024.06.25.600715.
3
Deep5hmC: Predicting genome-wide 5-Hydroxymethylcytosine landscape via a multimodal deep learning model.

本文引用的文献

1
DeepPHiC: predicting promoter-centered chromatin interactions using a novel deep learning approach.DeepPHiC:使用新型深度学习方法预测以启动子为中心的染色质相互作用。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac801.
2
FAVOR: functional annotation of variants online resource and annotator for variation across the human genome.FAVOR:在线变体功能注释资源和人类基因组变异注释器。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1300-D1311. doi: 10.1093/nar/gkac966.
3
A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies.
深度5-羟甲基胞嘧啶:通过多模态深度学习模型预测全基因组5-羟甲基胞嘧啶图谱
bioRxiv. 2024 Mar 6:2024.03.04.583444. doi: 10.1101/2024.03.04.583444.
4
Design and deep learning of synthetic B-cell-specific promoters.合成 B 细胞特异性启动子的设计与深度学习。
Nucleic Acids Res. 2023 Nov 27;51(21):11967-11979. doi: 10.1093/nar/gkad930.
一种用于检测大规模全基因组测序研究中非编码稀有变异关联的框架。
Nat Methods. 2022 Dec;19(12):1599-1611. doi: 10.1038/s41592-022-01640-x. Epub 2022 Oct 27.
4
DeepPerVar: a multi-modal deep learning framework for functional interpretation of genetic variants in personal genome.DeepPerVar:一个多模态深度学习框架,用于个人基因组中遗传变异的功能解释。
Bioinformatics. 2022 Dec 13;38(24):5340-5351. doi: 10.1093/bioinformatics/btac696.
5
STAAR workflow: a cloud-based workflow for scalable and reproducible rare variant analysis.STAAR工作流程:一种用于可扩展和可重复的罕见变异分析的基于云的工作流程。
Bioinformatics. 2022 May 26;38(11):3116-3117. doi: 10.1093/bioinformatics/btac272.
6
Exploiting deep transfer learning for the prediction of functional non-coding variants using genomic sequence.利用深度迁移学习,通过基因组序列预测功能非编码变异。
Bioinformatics. 2022 Jun 13;38(12):3164-3172. doi: 10.1093/bioinformatics/btac214.
7
A multi-dimensional integrative scoring framework for predicting functional variants in the human genome.多维综合评分框架,用于预测人类基因组中的功能变体。
Am J Hum Genet. 2022 Mar 3;109(3):446-456. doi: 10.1016/j.ajhg.2022.01.017. Epub 2022 Feb 24.
8
WEVar: a novel statistical learning framework for predicting noncoding regulatory variants.WEVar:一种用于预测非编码调控变异的新型统计学习框架。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab189.
9
Prioritizing non-coding regions based on human genomic constraint and sequence context with deep learning.基于深度学习的人类基因组约束和序列上下文对非编码区域进行优先级排序。
Nat Commun. 2021 Mar 8;12(1):1504. doi: 10.1038/s41467-021-21790-4.
10
Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.美国国立卫生研究院生物医学高级研究与发展局(NHLBI)TOPMed 项目中对 53831 个不同基因组进行测序。
Nature. 2021 Feb;590(7845):290-299. doi: 10.1038/s41586-021-03205-y. Epub 2021 Feb 10.