• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于在 RADseq 研究中选择最佳聚类阈值的经验流水线。

An empirical pipeline for choosing the optimal clustering threshold in RADseq studies.

机构信息

Department of Ecology and Evolutionary Biology, La Kretz Center for California Conservation Science, and Institute of the Environment and Sustainability, University of California, Los Angeles, California.

Faculty of Arts and Science, Department of Biochemistry, Kütahya Dumlupınar University, Kutahya, Turkey.

出版信息

Mol Ecol Resour. 2019 Sep;19(5):1195-1204. doi: 10.1111/1755-0998.13029. Epub 2019 Jun 1.

DOI:10.1111/1755-0998.13029
PMID:31058458
Abstract

Genomic data are increasingly used for high resolution population genetic studies including those at the forefront of biological conservation. A key methodological challenge is determining sequence similarity clustering thresholds for RADseq data when no reference genome is available. These thresholds define the maximum permitted divergence among allelic variants and the minimum divergence among putative paralogues and are central to downstream population genomic analyses. Here we develop a novel set of metrics to determine sequence similarity thresholds that maximize the correct separation of paralogous regions and minimize oversplitting naturally occurring allelic variation within loci. These metrics empirically identify the threshold value at which true alleles at opposite ends of several major axes of genetic variation begin to incorrectly separate into distinct clusters, allowing researchers to choose thresholds just below this value. We test our approach on a recently published data set for the protected foothill yellow-legged frog (Rana boylii). The metrics recover a consistent pattern of roughly 96% similarity as a threshold above which genetic divergence and data missingness become increasingly correlated. We provide scripts for assessing different clustering thresholds and discuss how this approach can be applied across a wide range of empirical data sets.

摘要

基因组数据越来越多地用于高分辨率的群体遗传研究,包括处于生物学保护前沿的研究。当没有参考基因组时,确定 RADseq 数据的序列相似性聚类阈值是一个关键的方法学挑战。这些阈值定义了等位变异之间允许的最大差异以及假定的同源基因之间的最小差异,是下游群体基因组分析的核心。在这里,我们开发了一套新的指标来确定序列相似性阈值,这些阈值可以最大限度地正确分离同源区域,并最小化对自然发生的等位基因变异的过度分割。这些指标从经验上确定了在几个主要遗传变异轴的末端的真实等位基因开始错误地分为不同聚类的阈值值,允许研究人员选择略低于该值的阈值。我们在最近发表的保护黄腿蛙(Rana boylii)数据集上测试了我们的方法。这些指标恢复了一个一致的模式,即相似度约为 96%,超过这个阈值,遗传分化和数据缺失变得越来越相关。我们提供了评估不同聚类阈值的脚本,并讨论了如何将这种方法应用于广泛的经验数据集。

相似文献

1
An empirical pipeline for choosing the optimal clustering threshold in RADseq studies.一种用于在 RADseq 研究中选择最佳聚类阈值的经验流水线。
Mol Ecol Resour. 2019 Sep;19(5):1195-1204. doi: 10.1111/1755-0998.13029. Epub 2019 Jun 1.
2
Population genomic data reveal extreme geographic subdivision and novel conservation actions for the declining foothill yellow-legged frog.种群基因组数据揭示了衰落的丘陵黄腿蛙极端的地理分化和新的保护行动。
Heredity (Edinb). 2018 Aug;121(2):112-125. doi: 10.1038/s41437-018-0097-7. Epub 2018 Jun 25.
3
Breaking RAD: an evaluation of the utility of restriction site-associated DNA sequencing for genome scans of adaptation.打破常规:对限制性位点相关DNA测序在适应性基因组扫描中的效用评估
Mol Ecol Resour. 2017 Mar;17(2):142-152. doi: 10.1111/1755-0998.12635. Epub 2016 Dec 16.
4
How "simple" methodological decisions affect interpretation of population structure based on reduced representation library DNA sequencing: A case study using the lake whitefish.简单的方法学决策如何影响基于简化代表性文库 DNA 测序的群体结构解释:以湖白鲑为例的案例研究。
PLoS One. 2020 Jan 24;15(1):e0226608. doi: 10.1371/journal.pone.0226608. eCollection 2020.
5
PyRAD: assembly of de novo RADseq loci for phylogenetic analyses.PyRAD:用于系统发育分析的从头RADseq位点组装
Bioinformatics. 2014 Jul 1;30(13):1844-9. doi: 10.1093/bioinformatics/btu121. Epub 2014 Mar 5.
6
Molecular phylogenetics of western North American frogs of the Rana boylii species group.北美西部博伊德林蛙物种组蛙类的分子系统发育学
Mol Phylogenet Evol. 2001 Apr;19(1):131-43. doi: 10.1006/mpev.2000.0908.
7
Reference-based RADseq resolves robust relationships among closely related species of lichen-forming fungi using metagenomic DNA.基于参考的 RADseq 利用宏基因组 DNA 解决了亲缘关系密切的地衣真菌种间的可靠关系。
Sci Rep. 2017 Aug 29;7(1):9884. doi: 10.1038/s41598-017-09906-7.
8
RADProc: A computationally efficient de novo locus assembler for population studies using RADseq data.RADProc:一种基于 RADseq 数据的用于群体研究的计算高效从头基因座组装工具。
Mol Ecol Resour. 2019 Jan;19(1):272-282. doi: 10.1111/1755-0998.12954. Epub 2018 Dec 21.
9
Trade-offs and utility of alternative RADseq methods: reply to Puritz et al.替代RADseq方法的权衡与效用:对Puritz等人的回应
Mol Ecol. 2014 Dec;23(24):5943-6. doi: 10.1111/mec.12964.
10
Identifying the number of population clusters with structure: problems and solutions.识别具有结构的群体数量:问题与解决方案。
Mol Ecol Resour. 2016 May;16(3):601-3. doi: 10.1111/1755-0998.12521.

引用本文的文献

1
Speciation in the Peninsular Indian Flying Lizard (Draco dussumieri) Follows Climatic Transition and Not Physical Barriers.印度半岛飞蜥(Draco dussumieri)的物种形成遵循气候转变而非地理屏障。
Mol Ecol. 2025 Jun;34(12):e17800. doi: 10.1111/mec.17800. Epub 2025 May 20.
2
Secondary contact erodes Pleistocene diversification in a wide-ranging freshwater mussel (Quadrula).二次接触侵蚀了一种分布广泛的淡水贻贝(Quadrula)在更新世的多样化。
Mol Ecol. 2025 Jan;34(1):e17572. doi: 10.1111/mec.17572. Epub 2024 Nov 14.
3
Conservation genomics of the wild pumpkin Cucurbita radicans in Central Mexico: The influence of a changing environment on the genetic diversity and differentiation of a rare species.
中美洲野生南瓜 Cucurbita radicans 的保护基因组学:环境变化对稀有物种遗传多样性和分化的影响。
J Plant Res. 2024 Sep;137(5):799-813. doi: 10.1007/s10265-024-01552-1. Epub 2024 Jul 8.
4
Recent genetic, phenetic and ecological divergence across the Mesoamerican highlands: a study case with (Aves: Thraupidae).中美洲高地的遗传、表型和生态分化:以 (鸟纲:唐纳雀科)为例。
PeerJ. 2024 Mar 22;12:e16797. doi: 10.7717/peerj.16797. eCollection 2024.
5
Using genome-wide data to ascertain taxonomic status and assess population genetic structure for Houston toads (Bufo [= Anaxyrus] houstonensis).利用全基因组数据确定休斯顿蟾蜍(Bufo [= Anaxyrus] houstonensis)的分类地位并评估其种群遗传结构。
Sci Rep. 2024 Feb 8;14(1):3306. doi: 10.1038/s41598-024-53705-w.
6
Biogeographic factors contributing to the diversification of Euphoniinae (Aves, Passeriformes, Fringillidae): a phylogenetic and ancestral areas analysis.促成歌雀亚科(鸟类,雀形目,燕雀科)多样化的生物地理因素:系统发育和祖先分布区分析
Zookeys. 2024 Jan 8;1188:169-195. doi: 10.3897/zookeys.1188.107047. eCollection 2024.
7
2b or not 2b? 2bRAD is an effective alternative to ddRAD for phylogenomics.是2b还是非2b?对于系统发育基因组学而言,2bRAD是双酶切RAD(ddRAD)的一种有效替代方法。
Ecol Evol. 2023 Mar 8;13(3):e9842. doi: 10.1002/ece3.9842. eCollection 2023 Mar.
8
Evolution of a hybrid zone of two willow species ( L.) in the European Alps analyzed by RAD-seq and morphometrics.通过RAD测序和形态测量学分析欧洲阿尔卑斯山两种柳树(柳属)杂交区的演化
Ecol Evol. 2023 Jan 4;13(1):e9700. doi: 10.1002/ece3.9700. eCollection 2023 Jan.
9
Strategies for Intraspecific Genotyping of Duckweed: Comparison of Five Orthogonal Methods Applied to the Giant Duckweed .浮萍种内基因分型策略:应用于大浮萍的五种正交方法比较
Plants (Basel). 2022 Nov 9;11(22):3033. doi: 10.3390/plants11223033.
10
Geographical Patterns of Genetic Variation in Locoto Chile () in the Americas Inferred by Genome-Wide Data Analysis.通过全基因组数据分析推断美洲智利洛科托()的遗传变异地理模式。
Plants (Basel). 2022 Oct 29;11(21):2911. doi: 10.3390/plants11212911.