• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Tensor decomposition based feature extraction and classification to detect natural selection from genomic data.基于张量分解的特征提取与分类,以从基因组数据中检测自然选择。
bioRxiv. 2023 Mar 29:2023.03.27.527731. doi: 10.1101/2023.03.27.527731.
2
Tensor Decomposition-based Feature Extraction and Classification to Detect Natural Selection from Genomic Data.基于张量分解的特征提取与分类方法从基因组数据中检测自然选择。
Mol Biol Evol. 2023 Oct 4;40(10). doi: 10.1093/molbev/msad216.
3
Uncovering Footprints of Natural Selection Through Spectral Analysis of Genomic Summary Statistics.通过基因组汇总统计的光谱分析揭示自然选择的足迹。
Mol Biol Evol. 2023 Jul 5;40(7). doi: 10.1093/molbev/msad157.
4
Detecting Positive Selection in Populations Using Genetic Data.利用遗传数据检测群体中的正选择。
Methods Mol Biol. 2020;2090:87-123. doi: 10.1007/978-1-0716-0199-0_5.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Identification of natural selection in genomic data with deep convolutional neural network.利用深度卷积神经网络识别基因组数据中的自然选择
BioData Min. 2021 Dec 4;14(1):51. doi: 10.1186/s13040-021-00280-9.
7
A Likelihood Approach for Uncovering Selective Sweep Signatures from Haplotype Data.一种从单倍型数据中发现选择清除信号的似然方法。
Mol Biol Evol. 2020 Oct 1;37(10):3023-3046. doi: 10.1093/molbev/msaa115.
8
Soft shoulders ahead: spurious signatures of soft and partial selective sweeps result from linked hard sweeps.前方的软肩:软选择清除和部分选择清除的虚假信号源于连锁的硬选择清除。
Genetics. 2015 May;200(1):267-84. doi: 10.1534/genetics.115.174912. Epub 2015 Feb 25.
9
Detecting adaptive introgression in human evolution using convolutional neural networks.使用卷积神经网络检测人类进化中的适应性基因渗入。
Elife. 2021 May 25;10:e64669. doi: 10.7554/eLife.64669.
10
Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data.使用微阵列基因表达数据的用于疾病分类的核嵌入高斯过程。
BMC Bioinformatics. 2007 Feb 28;8:67. doi: 10.1186/1471-2105-8-67.

基于张量分解的特征提取与分类,以从基因组数据中检测自然选择。

Tensor decomposition based feature extraction and classification to detect natural selection from genomic data.

作者信息

Amin Md Ruhul, Hasan Mahmudul, Arnab Sandipan Paul, DeGiorgio Michael

出版信息

bioRxiv. 2023 Mar 29:2023.03.27.527731. doi: 10.1101/2023.03.27.527731.

DOI:10.1101/2023.03.27.527731
PMID:37034767
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10081272/
Abstract

Inferences of adaptive events are important for learning about traits, such as human digestion of lactose after infancy and the rapid spread of viral variants. Early efforts toward identifying footprints of natural selection from genomic data involved development of summary statistic and likelihood methods. However, such techniques are grounded in simple patterns or theoretical models that limit the complexity of settings they can explore. Due to the renaissance in artificial intelligence, machine learning methods have taken center stage in recent efforts to detect natural selection, with strategies such as convolutional neural networks applied to images of haplotypes. Yet, limitations of such techniques include estimation of large numbers of model parameters under non-convex settings and feature identification without regard to location within an image. An alternative approach is to use tensor decomposition to extract features from multidimensional data while preserving the latent structure of the data, and to feed these features to machine learning models. Here, we adopt this framework and present a novel approach termed , which extracts features from images of haplotypes across sampled individuals using tensor decomposition, and then makes predictions from these features using classical machine learning methods. As a proof of concept, we explore the performance of on simulated neutral and selective sweep scenarios and find that it has high power and accuracy to discriminate sweeps from neutrality, robustness to common technical hurdles, and easy visualization of feature importance. Therefore, is a powerful addition to the toolkit for detecting adaptive processes from genomic data.

摘要

对适应性事件的推断对于了解各种性状非常重要,比如婴儿期后人类对乳糖的消化以及病毒变体的快速传播。早期从基因组数据中识别自然选择印记的努力涉及总结统计方法和似然方法的开发。然而,这些技术基于简单模式或理论模型,限制了它们所能探索的设置的复杂性。由于人工智能的复兴,机器学习方法在最近检测自然选择的努力中占据了核心地位,诸如卷积神经网络等策略被应用于单倍型图像。然而,此类技术的局限性包括在非凸设置下估计大量模型参数以及在不考虑图像内位置的情况下进行特征识别。一种替代方法是使用张量分解从多维数据中提取特征,同时保留数据的潜在结构,并将这些特征输入机器学习模型。在此,我们采用这一框架并提出一种名为 的新方法,该方法使用张量分解从跨样本个体的单倍型图像中提取特征,然后使用经典机器学习方法根据这些特征进行预测。作为概念验证,我们在模拟的中性和选择扫荡场景中探索了 的性能,发现它在区分扫荡与中性方面具有高功效和准确性,对常见技术障碍具有鲁棒性,并且特征重要性易于可视化。因此, 是从基因组数据中检测适应性过程的工具包中的一项强大补充。