• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度神经网络区分近期平衡选择和不完全清除。

Distinguishing between recent balancing selection and incomplete sweep using deep neural networks.

机构信息

Department of Biological Sciences, Middle East Technical University, Ankara, Turkey.

Laboratory of Medical Genetics, Department of Biomedical Sciences and Human Oncology, Università degli Studi di Bari Aldo Moro, Bari, Italy.

出版信息

Mol Ecol Resour. 2021 Nov;21(8):2706-2718. doi: 10.1111/1755-0998.13379. Epub 2021 Apr 5.

DOI:10.1111/1755-0998.13379
PMID:33749134
Abstract

Balancing selection is an important adaptive mechanism underpinning a wide range of phenotypes. Despite its relevance, the detection of recent balancing selection from genomic data is challenging as its signatures are qualitatively similar to those left by ongoing positive selection. In this study, we developed and implemented two deep neural networks and tested their performance to predict loci under recent selection, either due to balancing selection or incomplete sweep, from population genomic data. Specifically, we generated forward-in-time simulations to train and test an artificial neural network (ANN) and a convolutional neural network (CNN). ANN received as input multiple summary statistics calculated on the locus of interest, while CNN was applied directly on the matrix of haplotypes. We found that both architectures have high accuracy to identify loci under recent selection. CNN generally outperformed ANN to distinguish between signals of balancing selection and incomplete sweep and was less affected by incorrect training data. We deployed both trained networks on neutral genomic regions in European populations and demonstrated a lower false-positive rate for CNN than ANN. We finally deployed CNN within the MEFV gene region and identified several common variants predicted to be under incomplete sweep in a European population. Notably, two of these variants are functional changes and could modulate susceptibility to familial Mediterranean fever, possibly as a consequence of past adaptation to pathogens. In conclusion, deep neural networks were able to characterize signals of selection on intermediate frequency variants, an analysis currently inaccessible by commonly used strategies.

摘要

平衡选择是支撑广泛表型的一个重要适应机制。尽管其相关性很重要,但从基因组数据中检测到最近的平衡选择是具有挑战性的,因为其特征与正在进行的正选择留下的特征本质上是相似的。在这项研究中,我们开发并实施了两个深度神经网络,并测试了它们从群体基因组数据中预测由于平衡选择或不完全清除而导致的近期选择下的基因座的性能。具体来说,我们进行了向前时间模拟,以训练和测试人工神经网络 (ANN) 和卷积神经网络 (CNN)。ANN 接收作为输入的是在感兴趣的基因座上计算的多个汇总统计数据,而 CNN 则直接应用于单倍型矩阵。我们发现,这两种架构都具有很高的准确性,可以识别最近选择下的基因座。CNN 通常比 ANN 更能区分平衡选择和不完全清除的信号,并且受错误训练数据的影响较小。我们将这两个经过训练的网络部署在欧洲人群的中性基因组区域,并证明 CNN 的假阳性率低于 ANN。我们最后在 MEFV 基因区域内部署了 CNN,并在欧洲人群中鉴定出了几个预测为不完全清除的常见变体。值得注意的是,其中两个变体是功能变化,可能是过去适应病原体的结果,从而可能调节家族性地中海热的易感性。总之,深度神经网络能够对中频变异的选择信号进行特征化,这是目前常用策略无法进行的分析。

相似文献

1
Distinguishing between recent balancing selection and incomplete sweep using deep neural networks.利用深度神经网络区分近期平衡选择和不完全清除。
Mol Ecol Resour. 2021 Nov;21(8):2706-2718. doi: 10.1111/1755-0998.13379. Epub 2021 Apr 5.
2
ImaGene: a convolutional neural network to quantify natural selection from genomic data.ImaGene:一种从基因组数据中定量自然选择的卷积神经网络。
BMC Bioinformatics. 2019 Nov 22;20(Suppl 9):337. doi: 10.1186/s12859-019-2927-x.
3
Uncovering Footprints of Natural Selection Through Spectral Analysis of Genomic Summary Statistics.通过基因组汇总统计的光谱分析揭示自然选择的足迹。
Mol Biol Evol. 2023 Jul 5;40(7). doi: 10.1093/molbev/msad157.
4
CNN-MGP: Convolutional Neural Networks for Metagenomics Gene Prediction.CNN-MGP:用于宏基因组基因预测的卷积神经网络。
Interdiscip Sci. 2019 Dec;11(4):628-635. doi: 10.1007/s12539-018-0313-4. Epub 2018 Dec 27.
5
White blood cells detection and classification based on regional convolutional neural networks.基于区域卷积神经网络的白细胞检测与分类。
Med Hypotheses. 2020 Feb;135:109472. doi: 10.1016/j.mehy.2019.109472. Epub 2019 Nov 4.
6
A Composite-Likelihood Method for Detecting Incomplete Selective Sweep from Population Genomic Data.一种用于从群体基因组数据中检测不完全选择性清除的复合似然方法。
Genetics. 2015 Jun;200(2):633-49. doi: 10.1534/genetics.115.175380. Epub 2015 Apr 24.
7
Detecting adaptive introgression in human evolution using convolutional neural networks.使用卷积神经网络检测人类进化中的适应性基因渗入。
Elife. 2021 May 25;10:e64669. doi: 10.7554/eLife.64669.
8
Genome-wide scans for selective sweeps using convolutional neural networks.基于卷积神经网络的全基因组选择清除扫描。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i194-i203. doi: 10.1093/bioinformatics/btad265.
9
A deep dive into understanding tumor foci classification using multiparametric MRI based on convolutional neural network.基于卷积神经网络,深入探究利用多参数磁共振成像进行肿瘤病灶分类。
Med Phys. 2020 Sep;47(9):4077-4086. doi: 10.1002/mp.14255. Epub 2020 Jun 12.
10
A population genetics study of the familial Mediterranean fever gene: evidence of balancing selection under an overdominance regime.家族性地中海热基因的群体遗传学研究:超显性遗传模式下平衡选择的证据。
Genes Immun. 2009 Dec;10(8):678-86. doi: 10.1038/gene.2009.59. Epub 2009 Aug 13.

引用本文的文献

1
Evolution of tandem repeats in putative circumsporozoite protein to enhance its function: A recent and exclusive event in in India.假定环子孢子蛋白中串联重复序列的进化以增强其功能:印度近期发生的独特事件。
Curr Res Microb Sci. 2025 Jun 27;9:100432. doi: 10.1016/j.crmicr.2025.100432. eCollection 2025.
2
Genomic Anomaly Detection with Functional Data Analysis.基于功能数据分析的基因组异常检测
Genes (Basel). 2025 Jun 15;16(6):710. doi: 10.3390/genes16060710.
3
Integrating Deep Learning Models with Genome-Wide Association Study-Based Identification Enhanced Phenotype Predictions in Group A .
将深度学习模型与基于全基因组关联研究的识别相结合可增强A组中的表型预测。
J Microbiol Biotechnol. 2025 Mar 26;35:e2411010. doi: 10.4014/jmb.2411.11010.
4
Digital Image Processing to Detect Adaptive Evolution.用于检测适应性进化的数字图像处理
Mol Biol Evol. 2024 Dec 6;41(12). doi: 10.1093/molbev/msae242.
5
Reliability of a generative artificial intelligence tool for pediatric familial Mediterranean fever: insights from a multicentre expert survey.生成式人工智能工具在儿科家族性地中海热中的可靠性:来自多中心专家调查的见解。
Pediatr Rheumatol Online J. 2024 Aug 23;22(1):78. doi: 10.1186/s12969-024-01011-0.
6
Polymorphism-Aware Models in RevBayes: Species Trees, Disentangling Balancing Selection, and GC-Biased Gene Conversion.RevBayes 中的多态性感知模型:种系树、平衡选择的分解以及 GC 偏向性基因转换。
Mol Biol Evol. 2024 Jul 3;41(7). doi: 10.1093/molbev/msae138.
7
Temporal challenges in detecting balancing selection from population genomic data.从群体基因组数据中检测平衡选择的时间挑战。
G3 (Bethesda). 2024 Jun 5;14(6). doi: 10.1093/g3journal/jkae069.
8
Interpreting generative adversarial networks to infer natural selection from genetic data.从遗传数据推断自然选择的生成对抗网络解释。
Genetics. 2024 Apr 3;226(4). doi: 10.1093/genetics/iyae024.
9
The Impact of Patterns in Linkage Disequilibrium and Sequencing Quality on the Imprint of Balancing Selection.连锁不平衡和测序质量模式对平衡选择印迹的影响。
Genome Biol Evol. 2024 Feb 1;16(2). doi: 10.1093/gbe/evae009.
10
Population genetic simulation: Benchmarking frameworks for non-standard models of natural selection.群体遗传模拟:非标准自然选择模型的基准框架。
Mol Ecol Resour. 2024 Apr;24(3):e13930. doi: 10.1111/1755-0998.13930. Epub 2024 Jan 21.