• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

片段基因序列对基因树和种系发生树的重建有负面影响。

Fragmentary Gene Sequences Negatively Impact Gene Tree and Species Tree Reconstruction.

机构信息

Department of Electrical and Computer Engineering, University of California at San Diego, La Jolla, CA.

Department of Entomology, University of Illinois, Urbana, IL.

出版信息

Mol Biol Evol. 2017 Dec 1;34(12):3279-3291. doi: 10.1093/molbev/msx261.

DOI:10.1093/molbev/msx261
PMID:29029241
Abstract

Species tree reconstruction from genome-wide data is increasingly being attempted, in most cases using a two-step approach of first estimating individual gene trees and then summarizing them to obtain a species tree. The accuracy of this approach, which promises to account for gene tree discordance, depends on the quality of the inferred gene trees. At the same time, phylogenomic and phylotranscriptomic analyses typically use involved bioinformatics pipelines for data preparation. Errors and shortcomings resulting from these preprocessing steps may impact the species tree analyses at the other end of the pipeline. In this article, we first show that the presence of fragmentary data for some species in a gene alignment, as often seen on real data, can result in substantial deterioration of gene trees, and as a result, the species tree. We then investigate a simple filtering strategy where individual fragmentary sequences are removed from individual genes but the rest of the gene is retained. Both in simulations and by reanalyzing a large insect phylotranscriptomic data set, we show the effectiveness of this simple filtering strategy.

摘要

从全基因组数据中重建物种树的尝试越来越多,在大多数情况下,采用两步法,首先估计单个基因树,然后对它们进行总结以获得物种树。这种方法的准确性有望解释基因树的分歧,取决于推断出的基因树的质量。与此同时,系统基因组学和系统转录组学分析通常使用复杂的生物信息学管道进行数据准备。这些预处理步骤产生的错误和缺陷可能会影响管道另一端的物种树分析。在本文中,我们首先表明,基因比对中某些物种的片段数据的存在(这种情况在实际数据中经常出现)会导致基因树的严重恶化,进而导致物种树的恶化。然后,我们研究了一种简单的过滤策略,其中从单个基因中删除单个片段序列,但保留其余基因。无论是在模拟还是重新分析大型昆虫系统转录组数据集时,我们都展示了这种简单过滤策略的有效性。

相似文献

1
Fragmentary Gene Sequences Negatively Impact Gene Tree and Species Tree Reconstruction.片段基因序列对基因树和种系发生树的重建有负面影响。
Mol Biol Evol. 2017 Dec 1;34(12):3279-3291. doi: 10.1093/molbev/msx261.
2
To Include or Not to Include: The Impact of Gene Filtering on Species Tree Estimation Methods.包含还是不包含:基因过滤对物种树估计方法的影响。
Syst Biol. 2018 Mar 1;67(2):285-303. doi: 10.1093/sysbio/syx077.
3
Estimating optimal species trees from incomplete gene trees under deep coalescence.在深度溯祖情况下从不完整基因树估计最优物种树。
J Comput Biol. 2012 Jun;19(6):591-605. doi: 10.1089/cmb.2012.0037.
4
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.ASTRAL-III:从部分解析的基因树重建多项式时间种系发生树。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153. doi: 10.1186/s12859-018-2129-y.
5
The effect of alignment uncertainty, substitution models and priors in building and dating the mammal tree of life.在构建和定时代哺乳动物系统发育树时,配准不确定性、替代模型和先验概率的影响。
BMC Evol Biol. 2019 Nov 6;19(1):203. doi: 10.1186/s12862-019-1534-9.
6
Multi-allele species reconstruction using ASTRAL.使用 ASTRAL 进行多等位基因物种重建。
Mol Phylogenet Evol. 2019 Jan;130:286-296. doi: 10.1016/j.ympev.2018.10.033. Epub 2018 Oct 26.
7
Unblended disjoint tree merging using GTM improves species tree estimation.使用 GTM 的非混合不相交树合并可提高物种树估计的准确性。
BMC Genomics. 2020 Apr 16;21(Suppl 2):235. doi: 10.1186/s12864-020-6605-1.
8
SVDquest: Improving SVDquartets species tree estimation using exact optimization within a constrained search space.SVDquest:在约束搜索空间内使用精确优化提高 SVDquartets 种系树估计。
Mol Phylogenet Evol. 2018 Jul;124:122-136. doi: 10.1016/j.ympev.2018.03.006. Epub 2018 Mar 9.
9
Species Tree Estimation Using ASTRAL: How Many Genes Are Enough?使用 ASTRAL 估算种系发生树:需要多少基因?
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1738-1747. doi: 10.1109/TCBB.2017.2757930. Epub 2017 Sep 29.
10
From phylogenetics to phylogenomics: the evolutionary relationships of insect endosymbiotic gamma-Proteobacteria as a test case.从系统发育学到系统基因组学:以昆虫内共生γ-变形菌的进化关系为例
Syst Biol. 2007 Feb;56(1):1-16. doi: 10.1080/10635150601109759.

引用本文的文献

1
Genomic organization, domain assortments, and nucleotide-binding domain diversity of NLR proteins in Sordariales fungi.粪壳菌纲真菌中NLR蛋白的基因组组织、结构域分类及核苷酸结合结构域多样性
PLoS Genet. 2025 Jul 7;21(7):e1011739. doi: 10.1371/journal.pgen.1011739. eCollection 2025 Jul.
2
Testing Phylogenetic Placement Accuracy of DNA Barcode Sequences on a Fish Backbone Tree: Implications of Backbone Tree Completeness and Species Representation.测试鱼类主干树上DNA条形码序列的系统发育定位准确性:主干树完整性和物种代表性的影响
Ecol Evol. 2025 Jan 7;15(1):e70817. doi: 10.1002/ece3.70817. eCollection 2025 Jan.
3
Efficient phylogenetic tree inference for massive taxonomic datasets: harnessing the power of a server to analyze 1 million taxa.
针对海量分类数据集的高效系统发育树推断:利用服务器的能力分析100万个分类单元。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae055.
4
Accurate, scalable, and fully automated inference of species trees from raw genome assemblies using ROADIES.使用ROADIES从原始基因组组装中准确、可扩展且完全自动化地推断物种树。
bioRxiv. 2024 Jun 1:2024.05.27.596098. doi: 10.1101/2024.05.27.596098.
5
The sweet tabaiba or there and back again: phylogeographical history of the Macaronesian Euphorbia balsamifera.甜蜜的甜巴豆或有去有回:马卡罗内西亚巴尔萨米弗拉的系统地理学历史。
Ann Bot. 2024 May 10;133(5-6):883-904. doi: 10.1093/aob/mcae001.
6
Introgression Underlies Phylogenetic Uncertainty But Not Parallel Plumage Evolution in a Recent Songbird Radiation.基因渗入是近期鸣禽辐射中系统发育不确定性的基础,但不是羽毛平行进化的基础。
Syst Biol. 2024 May 27;73(1):12-25. doi: 10.1093/sysbio/syad062.
7
Generation of accurate, expandable phylogenomic trees with uDance.使用 uDance 生成准确、可扩展的系统发育基因组树。
Nat Biotechnol. 2024 May;42(5):768-777. doi: 10.1038/s41587-023-01868-8. Epub 2023 Jul 27.
8
Assembling a Reference Phylogenomic Tree of Bacteria and Archaea by Summarizing Many Gene Phylogenies.通过汇总多个基因系统发育树来构建细菌和古菌的参考系统发育基因组树。
Methods Mol Biol. 2022;2569:137-165. doi: 10.1007/978-1-0716-2691-7_7.
9
Recent progress on methods for estimating and updating large phylogenies.关于估计和更新大型系统发育树的方法的最新进展。
Philos Trans R Soc Lond B Biol Sci. 2022 Oct 10;377(1861):20210244. doi: 10.1098/rstb.2021.0244. Epub 2022 Aug 22.
10
Phylogenomic Coalescent Analyses of Avian Retroelements Infer Zero-Length Branches at the Base of Neoaves, Emergent Support for Controversial Clades, and Ancient Introgressive Hybridization in Afroaves.鸟类逆转录转座子的系统发生合并分析推断新颌类基部存在零长度分支,为有争议的进化枝提供新兴支持,并在 Afroaves 中存在古老的渐渗杂交。
Genes (Basel). 2022 Jun 28;13(7):1167. doi: 10.3390/genes13071167.