• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Exploiting deep transfer learning for the prediction of functional non-coding variants using genomic sequence.利用深度迁移学习,通过基因组序列预测功能非编码变异。
Bioinformatics. 2022 Jun 13;38(12):3164-3172. doi: 10.1093/bioinformatics/btac214.
2
DeepPerVar: a multi-modal deep learning framework for functional interpretation of genetic variants in personal genome.DeepPerVar:一个多模态深度学习框架,用于个人基因组中遗传变异的功能解释。
Bioinformatics. 2022 Dec 13;38(24):5340-5351. doi: 10.1093/bioinformatics/btac696.
3
TSVM: Transfer Support Vector Machine for Predicting MPRA Validated Regulatory Variants.TSVM:用于预测 MPRA 验证调控变异的转移支持向量机。
IEEE/ACM Trans Comput Biol Bioinform. 2024 May-Jun;21(3):472-479. doi: 10.1109/TCBB.2024.3374413. Epub 2024 Jun 5.
4
TVAR: assessing tissue-specific functional effects of non-coding variants with deep learning.TVAR:利用深度学习评估非编码变异的组织特异性功能效应。
Bioinformatics. 2022 Oct 14;38(20):4697-4704. doi: 10.1093/bioinformatics/btac608.
5
transferGWAS: GWAS of images using deep transfer learning.基于深度迁移学习的影像 GWAS。
Bioinformatics. 2022 Jul 11;38(14):3621-3628. doi: 10.1093/bioinformatics/btac369.
6
Chromatin accessibility prediction via a hybrid deep convolutional neural network.基于混合深度卷积神经网络的染色质可及性预测。
Bioinformatics. 2018 Mar 1;34(5):732-738. doi: 10.1093/bioinformatics/btx679.
7
Quantifying functional impact of non-coding variants with multi-task Bayesian neural network.使用多任务贝叶斯神经网络量化非编码变异的功能影响。
Bioinformatics. 2020 Mar 1;36(5):1397-1404. doi: 10.1093/bioinformatics/btz767.
8
Combining artificial intelligence: deep learning with Hi-C data to predict the functional effects of non-coding variants.将人工智能:深度学习与 Hi-C 数据相结合,以预测非编码变异的功能效应。
Bioinformatics. 2021 Jun 16;37(10):1339-1344. doi: 10.1093/bioinformatics/btaa970.
9
Using the structure of genome data in the design of deep neural networks for predicting amyotrophic lateral sclerosis from genotype.利用基因组数据结构设计深度神经网络,从基因型预测肌萎缩侧索硬化症。
Bioinformatics. 2019 Jul 15;35(14):i538-i547. doi: 10.1093/bioinformatics/btz369.
10
parSMURF, a high-performance computing tool for the genome-wide detection of pathogenic variants.parSMURF,一种用于全基因组致病性变异检测的高性能计算工具。
Gigascience. 2020 May 1;9(5). doi: 10.1093/gigascience/giaa052.

引用本文的文献

1
RiceSNP-ABST: a deep learning approach to identify abiotic stress-associated single nucleotide polymorphisms in rice.水稻SNP-ABST:一种用于识别水稻中非生物胁迫相关单核苷酸多态性的深度学习方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae702.
2
Opportunities, challenges and future perspectives of using bioinformatics and artificial intelligence techniques on tropical disease identification using omics data.利用生物信息学和人工智能技术通过组学数据进行热带疾病识别的机遇、挑战及未来展望。
Front Digit Health. 2024 Nov 25;6:1471200. doi: 10.3389/fdgth.2024.1471200. eCollection 2024.
3
RiceSNP-BST: a deep learning framework for predicting biotic stress-associated SNPs in rice.RiceSNP-BST:一种用于预测水稻中与生物胁迫相关的 SNP 的深度学习框架。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae599.
4
In silico generation and augmentation of regulatory variants from massively parallel reporter assay using conditional variational autoencoder.利用条件变分自编码器通过大规模平行报告基因检测在计算机上生成和增强调控变异体。
bioRxiv. 2024 Jun 29:2024.06.25.600715. doi: 10.1101/2024.06.25.600715.
5
TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.TrG2P:一种基于迁移学习的工具,集成多性状数据,用于准确预测作物产量。
Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.
6
A statistical approach for identifying single nucleotide variants that affect transcription factor binding.一种用于识别影响转录因子结合的单核苷酸变异的统计方法。
iScience. 2024 Apr 18;27(5):109765. doi: 10.1016/j.isci.2024.109765. eCollection 2024 May 17.
7
CVD-associated SNPs with regulatory potential reveal novel non-coding disease genes.与心血管疾病相关的具有调节潜力的单核苷酸多态性揭示了新的非编码疾病基因。
Hum Genomics. 2023 Jul 25;17(1):69. doi: 10.1186/s40246-023-00513-4.
8
Harnessing the potential of machine learning and artificial intelligence for dementia research.利用机器学习和人工智能在痴呆症研究中的潜力。
Brain Inform. 2023 Feb 24;10(1):6. doi: 10.1186/s40708-022-00183-3.
9
Multi-task deep autoencoder to predict Alzheimer's disease progression using temporal DNA methylation data in peripheral blood.使用外周血中的时间性DNA甲基化数据的多任务深度自动编码器来预测阿尔茨海默病的进展。
Comput Struct Biotechnol J. 2022 Oct 23;20:5761-5774. doi: 10.1016/j.csbj.2022.10.016. eCollection 2022.
10
TIVAN-indel: a computational framework for annotating and predicting non-coding regulatory small insertions and deletions.TIVAN-indel:一种注释和预测非编码调控小插入和缺失的计算框架。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad060.

本文引用的文献

1
XGBoost based machine learning approach to predict the risk of fall in older adults using gait outcomes.基于 XGBoost 的机器学习方法,利用步态结果预测老年人跌倒风险。
Sci Rep. 2021 Jun 9;11(1):12183. doi: 10.1038/s41598-021-91797-w.
2
Transfer learning-trained convolutional neural networks identify novel MRI biomarkers of Alzheimer's disease progression.迁移学习训练的卷积神经网络识别出阿尔茨海默病进展的新型磁共振成像生物标志物。
Alzheimers Dement (Amst). 2021 May 14;13(1):e12140. doi: 10.1002/dad2.12140. eCollection 2021.
3
WEVar: a novel statistical learning framework for predicting noncoding regulatory variants.WEVar:一种用于预测非编码调控变异的新型统计学习框架。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab189.
4
Transfer learning enables prediction of CYP2D6 haplotype function.迁移学习可用于预测 CYP2D6 单倍型功能。
PLoS Comput Biol. 2020 Nov 2;16(11):e1008399. doi: 10.1371/journal.pcbi.1008399. eCollection 2020 Nov.
5
Imputing missing RNA-sequencing data from DNA methylation by using a transfer learning-based neural network.基于迁移学习的神经网络对 RNA 测序缺失数据进行推断。
Gigascience. 2020 Jul 1;9(7). doi: 10.1093/gigascience/giaa076.
6
The Human Gene Mutation Database (HGMD): optimizing its use in a clinical diagnostic or research setting.人类基因突变数据库(HGMD):优化其在临床诊断或研究环境中的使用。
Hum Genet. 2020 Oct;139(10):1197-1207. doi: 10.1007/s00439-020-02199-3. Epub 2020 Jun 28.
7
Exploring human genomic diversity with gnomAD.利用gnomAD探索人类基因组多样性。
Nat Rev Genet. 2020 Aug;21(8):448. doi: 10.1038/s41576-020-0255-7.
8
Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets.矩阵分解和迁移学习揭示了多个单细胞ATAC测序数据集之间的调控生物学。
Nucleic Acids Res. 2020 Jul 9;48(12):e68. doi: 10.1093/nar/gkaa349.
9
CRISPR/Cas9-Mediated TERT Disruption in Cancer Cells.CRISPR/Cas9 介导的癌细胞中端粒酶逆转录酶基因敲除
Int J Mol Sci. 2020 Jan 19;21(2):653. doi: 10.3390/ijms21020653.
10
Ranking of non-coding pathogenic variants and putative essential regions of the human genome.人类基因组中非编码致病性变异体和推定必需区域的排名。
Nat Commun. 2019 Nov 20;10(1):5241. doi: 10.1038/s41467-019-13212-3.

利用深度迁移学习,通过基因组序列预测功能非编码变异。

Exploiting deep transfer learning for the prediction of functional non-coding variants using genomic sequence.

机构信息

Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN 46202, USA.

Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN 46202, USA.

出版信息

Bioinformatics. 2022 Jun 13;38(12):3164-3172. doi: 10.1093/bioinformatics/btac214.

DOI:10.1093/bioinformatics/btac214
PMID:35389435
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9890318/
Abstract

MOTIVATION

Though genome-wide association studies have identified tens of thousands of variants associated with complex traits and most of them fall within the non-coding regions, they may not be the causal ones. The development of high-throughput functional assays leads to the discovery of experimental validated non-coding functional variants. However, these validated variants are rare due to technical difficulty and financial cost. The small sample size of validated variants makes it less reliable to develop a supervised machine learning model for achieving a whole genome-wide prediction of non-coding causal variants.

RESULTS

We will exploit a deep transfer learning model, which is based on convolutional neural network, to improve the prediction for functional non-coding variants (NCVs). To address the challenge of small sample size, the transfer learning model leverages both large-scale generic functional NCVs to improve the learning of low-level features and context-specific functional NCVs to learn high-level features toward the context-specific prediction task. By evaluating the deep transfer learning model on three MPRA datasets and 16 GWAS datasets, we demonstrate that the proposed model outperforms deep learning models without pretraining or retraining. In addition, the deep transfer learning model outperforms 18 existing computational methods in both MPRA and GWAS datasets.

AVAILABILITY AND IMPLEMENTATION

https://github.com/lichen-lab/TLVar.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

尽管全基因组关联研究已经确定了成千上万种与复杂性状相关的变体,其中大多数位于非编码区域,但它们可能不是因果变体。高通量功能检测方法的发展导致了实验验证的非编码功能变体的发现。然而,由于技术难度和资金成本,这些经过验证的变体很少。验证变体的样本量小,使得开发用于对非编码因果变体进行全基因组预测的监督机器学习模型变得不太可靠。

结果

我们将利用基于卷积神经网络的深度迁移学习模型来改进对功能非编码变体(NCV)的预测。为了解决样本量小的问题,迁移学习模型利用大规模通用功能 NCV 来改进低层次特征的学习,并利用特定于上下文的功能 NCV 来学习高层次特征,以实现特定于上下文的预测任务。通过在三个 MPRA 数据集和 16 个 GWAS 数据集上评估深度迁移学习模型,我们证明了所提出的模型优于没有预训练或再训练的深度学习模型。此外,深度迁移学习模型在 MPRA 和 GWAS 数据集上均优于 18 种现有的计算方法。

可用性和实现

https://github.com/lichen-lab/TLVar。

补充信息

补充数据可在生物信息学在线获得。