• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在限制位点相关 DNA 系统基因组学中的信息缺失模式,以及与一个富含物种的鳞翅目昆虫属的多位点 Sanger 数据的比较。

Information Dropout Patterns in Restriction Site Associated DNA Phylogenomics and a Comparison with Multilocus Sanger Data in a Species-Rich Moth Genus.

机构信息

Department of Ecology and Genetics, University of Oulu, Pentti Kaiteran katu 1, FI-90014, Oulu, Finland.

Department of Zoology, Institute of Ecology and Earth Sciences, University of Tartu, Vanemuise 46, EE-51014 Tartu, Estonia.

出版信息

Syst Biol. 2018 Nov 1;67(6):925-939. doi: 10.1093/sysbio/syy029.

DOI:10.1093/sysbio/syy029
PMID:29669013
Abstract

A rapid shift from traditional Sanger sequencing-based molecular methods to the phylogenomic approach with large numbers of loci is underway. Among phylogenomic methods, restriction site associated DNA (RAD) sequencing approaches have gained much attention as they enable rapid generation of up to thousands of loci randomly scattered across the genome and are suitable for nonmodel species. RAD data sets however suffer from large amounts of missing data and rapid locus dropout along with decreasing relatedness among taxa. The relationship between locus dropout and the amount of phylogenetic information retained in the data has remained largely uninvestigated. Similarly, phylogenetic hypotheses based on RAD have rarely been compared with phylogenetic hypotheses based on multilocus Sanger sequencing, even less so using exactly the same species and specimens. We compared the Sanger-based phylogenetic hypothesis (8 loci; 6172 bp) of 32 species of the diverse moth genus Eupithecia (Lepidoptera, Geometridae) to that based on double-digest RAD sequencing (3256 loci; 726,658 bp). We observed that topologies were largely congruent, with some notable exceptions that we discuss. The locus dropout effect was strong. We demonstrate that number of loci is not a precise measure of phylogenetic information since the number of single-nucleotide polymorphisms (SNPs) may remain low at very shallow phylogenetic levels despite large numbers of loci. As we hypothesize, the number of SNPs and parsimony informative SNPs (PIS) is low at shallow phylogenetic levels, peaks at intermediate levels and, thereafter, declines again at the deepest levels as a result of decay of available loci. Similarly, we demonstrate with empirical data that the locus dropout affects the type of loci retained, the loci found in many species tending to show lower interspecific distances than those shared among fewer species. We also examine the effects of the numbers of loci, SNPs, and PIS on nodal bootstrap support, but could not demonstrate with our data our expectation of a positive correlation between them. We conclude that RAD methods provide a powerful tool for phylogenomics at an intermediate phylogenetic level as indicated by its broad congruence with an eight-gene Sanger data set in a genus of moths. When assessing the quality of the data for phylogenetic inference, the focus should be on the distribution and number of SNPs and PIS rather than on loci.

摘要

从传统的基于 Sanger 测序的分子方法向具有大量基因座的系统发育基因组学方法的快速转变正在进行中。在系统发育基因组学方法中,限制性位点相关 DNA(RAD)测序方法引起了广泛关注,因为它们能够快速生成数千个随机散布在基因组中的基因座,并且适合非模式物种。然而,RAD 数据集存在大量缺失数据和快速基因座丢失,以及分类群之间的相关性降低。基因座丢失与数据中保留的系统发育信息量之间的关系在很大程度上尚未得到研究。同样,基于 RAD 的系统发育假设很少与基于多位点 Sanger 测序的系统发育假设进行比较,更不用说使用完全相同的物种和标本进行比较了。我们比较了 32 种不同的蛾类 Eupithecia 属(鳞翅目,尺蛾科)的基于 Sanger 的系统发育假设(8 个基因座;6172bp)和基于双酶切 RAD 测序的系统发育假设(3256 个基因座;726658bp)。我们观察到拓扑结构基本一致,但也有一些值得注意的例外,我们将在讨论中讨论。基因座丢失效应很强。我们证明,基因座数量并不是系统发育信息量的精确衡量标准,因为尽管基因座数量很大,但在非常浅的系统发育水平上单核苷酸多态性(SNP)的数量可能仍然很低。正如我们假设的那样,在浅系统发育水平上 SNP 和简约信息 SNP(PIS)的数量较低,在中等水平上达到峰值,然后由于可用基因座的衰减再次下降。同样,我们用实证数据证明了基因座丢失会影响保留的基因座类型,许多物种中发现的基因座往往比在较少物种中共享的基因座具有更低的种间距离。我们还检查了基因座数量、SNP 和 PIS 对节点自举支持的影响,但我们的数据无法证明我们期望它们之间存在正相关关系。我们的结论是,RAD 方法在一个中等的系统发育水平上为系统发育基因组学提供了一个强大的工具,这从它与蛾类一个属的八个基因 Sanger 数据集的广泛一致性中可以看出。在评估用于系统发育推断的数据质量时,重点应该放在 SNP 和 PIS 的分布和数量上,而不是基因座上。

相似文献

1
Information Dropout Patterns in Restriction Site Associated DNA Phylogenomics and a Comparison with Multilocus Sanger Data in a Species-Rich Moth Genus.在限制位点相关 DNA 系统基因组学中的信息缺失模式,以及与一个富含物种的鳞翅目昆虫属的多位点 Sanger 数据的比较。
Syst Biol. 2018 Nov 1;67(6):925-939. doi: 10.1093/sysbio/syy029.
2
Comparing species tree estimation with large anchored phylogenomic and small Sanger-sequenced molecular datasets: an empirical study on Malagasy pseudoxyrhophiine snakes.比较大型锚定系统发育基因组学和小型桑格测序分子数据集的物种树估计:马达加斯加伪蝰蛇的实证研究
BMC Evol Biol. 2015 Oct 12;15:221. doi: 10.1186/s12862-015-0503-1.
3
Assessing the potential of RAD-sequencing to resolve phylogenetic relationships within species radiations: The fly genus Chiastocheta (Diptera: Anthomyiidae) as a case study.评估RAD测序在解析物种辐射内系统发育关系方面的潜力:以果蝇属Chiastocheta(双翅目:花蝇科)为例进行研究。
Mol Phylogenet Evol. 2017 Sep;114:189-198. doi: 10.1016/j.ympev.2017.06.012. Epub 2017 Jun 21.
4
Identification and assessment of variable single-copy orthologous (SCO) nuclear loci for low-level phylogenomics: a case study in the genus Rosa (Rosaceae).用于低水平系统发生基因组学的可变单拷贝直系同源(SCO)核基因座的鉴定和评估:以蔷薇属(蔷薇科)为例。
BMC Evol Biol. 2019 Jul 24;19(1):152. doi: 10.1186/s12862-019-1479-z.
5
Inferring the shallow phylogeny of true salamanders (Salamandra) by multiple phylogenomic approaches.通过多种系统发育基因组学方法推断真螈属(蝾螈属)的浅层系统发育关系。
Mol Phylogenet Evol. 2017 Oct;115:16-26. doi: 10.1016/j.ympev.2017.07.009. Epub 2017 Jul 14.
6
Comparison of Target-Capture and Restriction-Site Associated DNA Sequencing for Phylogenomics: A Test in Cardinalid Tanagers (Aves, Genus: Piranga).用于系统发育基因组学的靶向捕获测序与限制性位点相关DNA测序的比较:以主红雀属唐纳雀(鸟类,主红雀属)为例的测试
Syst Biol. 2016 Jul;65(4):640-50. doi: 10.1093/sysbio/syw005. Epub 2016 Jan 28.
7
Misconceptions on Missing Data in RAD-seq Phylogenetics with a Deep-scale Example from Flowering Plants.RAD测序系统发育学中缺失数据的误解:以开花植物为例进行深度解析
Syst Biol. 2017 May 1;66(3):399-412. doi: 10.1093/sysbio/syw092.
8
Phylogenomics and loci dropout patterns of deeply diverged Zodarion ant-eating spiders suggest a high potential of RAD-seq for genus-level spider phylogenetics.系统发生基因组学和深分歧的食蚁蛛属蜘蛛的基因座缺失模式表明 RAD-seq 技术在蜘蛛属水平的系统发生学中有很大的潜力。
Cladistics. 2022 Jun;38(3):320-334. doi: 10.1111/cla.12493. Epub 2021 Oct 26.
9
Accounting for Uncertainty in Gene Tree Estimation: Summary-Coalescent Species Tree Inference in a Challenging Radiation of Australian Lizards.基因树估计中的不确定性考量:澳大利亚蜥蜴复杂辐射演化中的总结合并物种树推断
Syst Biol. 2017 May 1;66(3):352-366. doi: 10.1093/sysbio/syw089.
10
ddRAD-seq phylogenetics based on nucleotide, indel, and presence-absence polymorphisms: Analyses of two avian genera with contrasting histories.基于核苷酸、插入缺失和存在-缺失多态性的ddRAD-seq系统发育分析:对两个具有不同演化历史的鸟类属的分析
Mol Phylogenet Evol. 2016 Jan;94(Pt A):122-35. doi: 10.1016/j.ympev.2015.07.026. Epub 2015 Aug 13.

引用本文的文献

1
Linking large-scale genetic structure of three Argynnini butterfly species to geography and environment.将三种 Argynnini 蝴蝶物种的大规模遗传结构与地理和环境联系起来。
Mol Ecol. 2022 Aug;31(16):4381-4401. doi: 10.1111/mec.16594. Epub 2022 Jul 15.
2
Integrative taxonomy reveals overlooked cryptic diversity in the conifer feeding (Zeller, 1839) (Lepidoptera, Batrachedridae).综合分类学揭示了针叶树食叶蛾(泽勒,1839年)(鳞翅目,细蛾科)中被忽视的隐性多样性。
Zookeys. 2022 Feb 8;1085:165-182. doi: 10.3897/zookeys.1085.76853. eCollection 2022.
3
Incomplete lineage sorting and ancient admixture, and speciation without morphological change in ghost-worm cryptic species.
隐匿蠕虫隐性物种中的不完全世系分选与古代混合,以及无形态变化的物种形成
PeerJ. 2021 Feb 9;9:e10896. doi: 10.7717/peerj.10896. eCollection 2021.
4
Relevance of ddRADseq method for species and population delimitation of closely related and widely distributed wolf spiders (Araneae, Lycosidae).ddRADseq 方法在亲缘关系密切且广泛分布的狼蛛(蜘蛛目,狼蛛科)物种和种群界定中的相关性。
Sci Rep. 2021 Jan 26;11(1):2177. doi: 10.1038/s41598-021-81788-2.
5
Double-digest RAD-sequencing: do pre- and post-sequencing protocol parameters impact biological results?双酶切 RAD 测序:测序前后的协议参数是否会影响生物学结果?
Mol Genet Genomics. 2021 Mar;296(2):457-471. doi: 10.1007/s00438-020-01756-9. Epub 2021 Jan 20.
6
Revision of the genus Boursin, 1937 (Lepidoptera, Noctuidae, Xyleninae). I. (Goeze, 1781) and its sister species (Costantini, 1922) sp. rev. in Europe.布尔辛属的修订,1937年(鳞翅目,夜蛾科,木夜蛾亚科)。I. (戈泽,1781年)及其姐妹种(科斯坦蒂尼,1922年)在欧洲的新种修订。
Zookeys. 2020 Apr 16;927:75-97. doi: 10.3897/zookeys.927.51142. eCollection 2020.
7
Using genomic information for management planning of an endangered perennial, .利用基因组信息进行一种濒危多年生植物的管理规划
Ecol Evol. 2020 Feb 17;10(5):2638-2649. doi: 10.1002/ece3.6093. eCollection 2020 Mar.
8
Maximize Resolution or Minimize Error? Using Genotyping-By-Sequencing to Investigate the Recent Diversification of (Cistaceae).最大化分辨率还是最小化误差?利用简化基因组测序技术研究半日花科植物的近期分化
Front Plant Sci. 2019 Nov 11;10:1416. doi: 10.3389/fpls.2019.01416. eCollection 2019.
9
The conundrum of species delimitation: a genomic perspective on a mitogenetically super-variable butterfly.物种界定的难题:基于基因组学对一个具有高度线粒体变异性的蝴蝶的研究。
Proc Biol Sci. 2019 Sep 25;286(1911):20191311. doi: 10.1098/rspb.2019.1311. Epub 2019 Sep 18.
10
Genome-wide SNP Data Reveal an Overestimation of Species Diversity in a Group of Hawkmoths.全基因组 SNP 数据揭示一组 Hawk 蛾物种多样性的高估。
Genome Biol Evol. 2019 Aug 1;11(8):2136-2150. doi: 10.1093/gbe/evz113.