• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对软性选择清除的无根据热情 III:并非监督机器学习算法。

On the Unfounded Enthusiasm for Soft Selective Sweeps III: The Supervised Machine Learning Algorithm That Isn't.

机构信息

Department of Biology, Lund University, Sölvegatan 35, 22362 Lund, Sweden.

Department of Biology & Biochemistry, University of Houston, Science & Research Building 2, Suite #342, 3455 Cullen Bldv., Houston, TX 77204-5001, USA.

出版信息

Genes (Basel). 2021 Apr 5;12(4):527. doi: 10.3390/genes12040527.

DOI:10.3390/genes12040527
PMID:33916341
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8066263/
Abstract

In the last 15 years or so, soft selective sweep mechanisms have been catapulted from a curiosity of little evolutionary importance to a ubiquitous mechanism claimed to explain most adaptive evolution and, in some cases, most evolution. This transformation was aided by a series of articles by Daniel Schrider and Andrew Kern. Within this series, a paper entitled "Soft sweeps are the dominant mode of adaptation in the human genome" (Schrider and Kern, . , (8), 1863-1877) attracted a great deal of attention, in particular in conjunction with another paper (Kern and Hahn, . , (6), 1366-1371), for purporting to discredit the Neutral Theory of Molecular Evolution (Kimura 1968). Here, we address an alleged novelty in Schrider and Kern's paper, i.e., the claim that their study involved an artificial intelligence technique called supervised machine learning (SML). SML is predicated upon the existence of a training dataset in which the correspondence between the input and output is known empirically to be true. Curiously, Schrider and Kern did not possess a training dataset of genomic segments known to have evolved either neutrally or through soft or hard selective sweeps. Thus, their claim of using SML is thoroughly and utterly misleading. In the absence of legitimate training datasets, Schrider and Kern used: (1) simulations that employ many manipulatable variables and (2) a system of data cherry-picking rivaling the worst excesses in the literature. These two factors, in addition to the lack of negative controls and the irreproducibility of their results due to incomplete methodological detail, lead us to conclude that all evolutionary inferences derived from so-called SML algorithms (e.g., S/HIC) should be taken with a huge shovel of salt.

摘要

在过去的 15 年左右的时间里,软选择清除机制已经从一个几乎没有进化意义的好奇心,一跃成为一种无处不在的机制,据称这种机制可以解释大多数适应性进化,甚至在某些情况下可以解释大多数进化。这种转变得益于一系列由 Daniel Schrider 和 Andrew Kern 撰写的文章。在这一系列文章中,一篇题为“软清除是人类基因组中适应的主要模式”(Schrider 和 Kern,.,(8),1863-1877)的论文引起了极大的关注,尤其是与另一篇论文(Kern 和 Hahn,.,(6),1366-1371)结合起来看,因为这两篇论文声称否定了分子进化的中性理论(Kimura 1968)。在这里,我们讨论了 Schrider 和 Kern 的论文中据称的新颖之处,即他们的研究涉及一种称为监督机器学习(SML)的人工智能技术。SML 基于存在一个训练数据集,其中输入和输出之间的对应关系在经验上被证明是真实的。奇怪的是,Schrider 和 Kern 没有拥有一个已知经历过中性进化或软选择性清除或硬选择性清除的基因组片段的训练数据集。因此,他们声称使用 SML 是完全误导性的。在没有合法训练数据集的情况下,Schrider 和 Kern 使用了:(1)使用许多可操纵变量的模拟,以及(2)与文献中最严重的过度数据选择系统。这两个因素,加上缺乏负对照组以及由于方法学细节不完整而导致结果不可重现,使我们得出结论,所有从所谓的 SML 算法(例如 S/HIC)得出的进化推断都应该谨慎对待。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c27/8066263/0170b9677364/genes-12-00527-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c27/8066263/702c728959f7/genes-12-00527-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c27/8066263/0170b9677364/genes-12-00527-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c27/8066263/702c728959f7/genes-12-00527-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c27/8066263/0170b9677364/genes-12-00527-g002.jpg

相似文献

1
On the Unfounded Enthusiasm for Soft Selective Sweeps III: The Supervised Machine Learning Algorithm That Isn't.对软性选择清除的无根据热情 III:并非监督机器学习算法。
Genes (Basel). 2021 Apr 5;12(4):527. doi: 10.3390/genes12040527.
2
S/HIC: Robust Identification of Soft and Hard Sweeps Using Machine Learning.S/HIC:使用机器学习对软硬选择进行稳健识别
PLoS Genet. 2016 Mar 15;12(3):e1005928. doi: 10.1371/journal.pgen.1005928. eCollection 2016 Mar.
3
Soft Sweeps Are the Dominant Mode of Adaptation in the Human Genome.软清扫是人类基因组中适应性的主要模式。
Mol Biol Evol. 2017 Aug 1;34(8):1863-1877. doi: 10.1093/molbev/msx154.
4
Soft shoulders ahead: spurious signatures of soft and partial selective sweeps result from linked hard sweeps.前方的软肩:软选择清除和部分选择清除的虚假信号源于连锁的硬选择清除。
Genetics. 2015 May;200(1):267-84. doi: 10.1534/genetics.115.174912. Epub 2015 Feb 25.
5
Detecting Positive Selection in Populations Using Genetic Data.利用遗传数据检测群体中的正选择。
Methods Mol Biol. 2020;2090:87-123. doi: 10.1007/978-1-0716-0199-0_5.
6
Recent selective sweeps in North American Drosophila melanogaster show signatures of soft sweeps.北美黑腹果蝇最近的选择性清除显示出软清除的特征。
PLoS Genet. 2015 Feb 23;11(2):e1005004. doi: 10.1371/journal.pgen.1005004. eCollection 2015 Feb.
7
diploS/HIC: An Updated Approach to Classifying Selective Sweeps.diploS/HIC:一种用于分类选择性清除的更新方法。
G3 (Bethesda). 2018 May 31;8(6):1959-1970. doi: 10.1534/g3.118.200262.
8
Adaptation in structured populations and fuzzy boundaries between hard and soft sweeps.结构群体中的适应和硬、软选择之间的模糊边界。
PLoS Comput Biol. 2019 Nov 11;15(11):e1007426. doi: 10.1371/journal.pcbi.1007426. eCollection 2019 Nov.
9
Versatile Detection of Diverse Selective Sweeps with Flex-Sweep.利用 Flex-Sweep 实现多种选择清除的灵活检测。
Mol Biol Evol. 2023 Jun 1;40(6). doi: 10.1093/molbev/msad139.
10
Population genetic processes affecting the mode of selective sweeps and effective population size in influenza virus H3N2.影响流感病毒H3N2中选择性清除模式和有效种群大小的群体遗传过程。
BMC Evol Biol. 2016 Aug 3;16:156. doi: 10.1186/s12862-016-0727-8.

引用本文的文献

1
Not by Selection Alone: Expanding the Scope of Gene-Culture Coevolution.并非仅靠选择:拓展基因-文化共同进化的范畴
Evol Anthropol. 2025 Sep;34(3):e70007. doi: 10.1002/evan.70007.
2
Principal Component Analyses (PCA)-based findings in population genetic studies are highly biased and must be reevaluated.基于主成分分析(PCA)的群体遗传学研究结果存在高度偏差,必须重新评估。
Sci Rep. 2022 Aug 29;12(1):14683. doi: 10.1038/s41598-022-14395-4.

本文引用的文献

1
Discovery of Ongoing Selective Sweeps within Anopheles Mosquito Populations Using Deep Learning.利用深度学习发现疟蚊种群中的持续选择清除。
Mol Biol Evol. 2021 Mar 9;38(3):1168-1183. doi: 10.1093/molbev/msaa259.
2
A Comparative Study of Supervised Machine Learning Algorithms for the Prediction of Long-Range Chromatin Interactions.基于监督学习算法的长程染色质互作预测的比较研究
Genes (Basel). 2020 Aug 24;11(9):985. doi: 10.3390/genes11090985.
3
On the unfounded enthusiasm for soft selective sweeps II: Examining recent evidence from humans, flies, and viruses.
对软选择优势毫无根据的热情 II:审视来自人类、苍蝇和病毒的最新证据。
PLoS Genet. 2018 Dec 28;14(12):e1007859. doi: 10.1371/journal.pgen.1007859. eCollection 2018 Dec.
4
Detection and Classification of Hard and Soft Sweeps from Unphased Genotypes by Multilocus Genotype Identity.利用多位点基因型一致鉴定对未分相基因型的硬和软扫描的检测与分类
Genetics. 2018 Dec;210(4):1429-1452. doi: 10.1534/genetics.118.301502. Epub 2018 Oct 12.
5
Genenames.org: the HGNC and VGNC resources in 2019.Genenames.org:2019 年的 HGNC 和 VGNC 资源。
Nucleic Acids Res. 2019 Jan 8;47(D1):D786-D792. doi: 10.1093/nar/gky930.
6
Using Supervised Learning Methods for Gene Selection in RNA-Seq Case-Control Studies.在RNA测序病例对照研究中使用监督学习方法进行基因选择
Front Genet. 2018 Aug 3;9:297. doi: 10.3389/fgene.2018.00297. eCollection 2018.
7
A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots.关于如何不过度解读 STRUCTURE 和 ADMIXTURE 条形图的教程。
Nat Commun. 2018 Aug 14;9(1):3258. doi: 10.1038/s41467-018-05257-7.
8
Evolutionary genomic dynamics of Peruvians before, during, and after the Inca Empire.秘鲁人在印加帝国前后的进化基因组动态。
Proc Natl Acad Sci U S A. 2018 Jul 10;115(28):E6526-E6535. doi: 10.1073/pnas.1720798115. Epub 2018 Jun 26.
9
GeneMANIA update 2018.GeneMANIA 更新 2018.
Nucleic Acids Res. 2018 Jul 2;46(W1):W60-W64. doi: 10.1093/nar/gky311.
10
Adaptive Landscape of Protein Variation in Human Exomes.人类外显子组中蛋白质变异的适应景观。
Mol Biol Evol. 2018 Aug 1;35(8):2015-2025. doi: 10.1093/molbev/msy107.