• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ClipKIT:一种用于准确系统发育推断的多重序列比对修剪软件。

ClipKIT: A multiple sequence alignment trimming software for accurate phylogenomic inference.

机构信息

Vanderbilt University, Department of Biological Sciences, Nashville, Tennessee, United States of America.

Nashville, Tennessee, United States of America.

出版信息

PLoS Biol. 2020 Dec 2;18(12):e3001007. doi: 10.1371/journal.pbio.3001007. eCollection 2020 Dec.

DOI:10.1371/journal.pbio.3001007
PMID:33264284
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7735675/
Abstract

Highly divergent sites in multiple sequence alignments (MSAs), which can stem from erroneous inference of homology and saturation of substitutions, are thought to negatively impact phylogenetic inference. Thus, several different trimming strategies have been developed for identifying and removing these sites prior to phylogenetic inference. However, a recent study reported that doing so can worsen inference, underscoring the need for alternative alignment trimming strategies. Here, we introduce ClipKIT, an alignment trimming software that, rather than identifying and removing putatively phylogenetically uninformative sites, instead aims to identify and retain parsimony-informative sites, which are known to be phylogenetically informative. To test the efficacy of ClipKIT, we examined the accuracy and support of phylogenies inferred from 14 different alignment trimming strategies, including those implemented in ClipKIT, across nearly 140,000 alignments from a broad sampling of evolutionary histories. Phylogenies inferred from ClipKIT-trimmed alignments are accurate, robust, and time saving. Furthermore, ClipKIT consistently outperformed other trimming methods across diverse datasets, suggesting that strategies based on identifying and retaining parsimony-informative sites provide a robust framework for alignment trimming.

摘要

高度分化的多序列比对(MSA)位点可能源于同源性推断错误和替代饱和,被认为会对系统发育推断产生负面影响。因此,已经开发了几种不同的修剪策略,用于在进行系统发育推断之前识别和去除这些位点。然而,最近的一项研究报告称,这样做可能会恶化推断,这凸显了需要替代的对齐修剪策略。在这里,我们介绍了 ClipKIT,这是一种对齐修剪软件,它不是识别和去除推测的系统发育上无信息的位点,而是旨在识别和保留简约信息位点,这些位点已知是系统发育信息丰富的。为了测试 ClipKIT 的功效,我们检查了来自 ClipKIT 等 14 种不同对齐修剪策略推断的系统发育的准确性和支持率,这些策略涵盖了来自广泛进化历史样本的近 140,000 个对齐。从 ClipKIT 修剪对齐推断出的系统发育是准确的、稳健的且节省时间的。此外,ClipKIT 在不同数据集上始终优于其他修剪方法,这表明基于识别和保留简约信息位点的策略为对齐修剪提供了一个稳健的框架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0040/7735675/6e8b269e0a42/pbio.3001007.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0040/7735675/c91675cba31c/pbio.3001007.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0040/7735675/6e8b269e0a42/pbio.3001007.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0040/7735675/c91675cba31c/pbio.3001007.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0040/7735675/6e8b269e0a42/pbio.3001007.g002.jpg

相似文献

1
ClipKIT: A multiple sequence alignment trimming software for accurate phylogenomic inference.ClipKIT:一种用于准确系统发育推断的多重序列比对修剪软件。
PLoS Biol. 2020 Dec 2;18(12):e3001007. doi: 10.1371/journal.pbio.3001007. eCollection 2020 Dec.
2
BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments.BMGE(基于信息熵的块映射与聚集):一种从多序列比对中选择系统发育信息区域的新软件。
BMC Evol Biol. 2010 Jul 13;10:210. doi: 10.1186/1471-2148-10-210.
3
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
4
Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference.当前用于多序列比对自动过滤的方法常常会使单基因系统发育推断变差。
Syst Biol. 2015 Sep;64(5):778-91. doi: 10.1093/sysbio/syv033. Epub 2015 Jun 1.
5
Using ESTs for phylogenomics: can one accurately infer a phylogenetic tree from a gappy alignment?利用ESTs进行系统发育基因组学研究:能否从有缺口的比对中准确推断系统发育树?
BMC Evol Biol. 2008 Mar 26;8:95. doi: 10.1186/1471-2148-8-95.
6
The tree alignment problem.树对齐问题。
BMC Bioinformatics. 2012 Nov 9;13:293. doi: 10.1186/1471-2105-13-293.
7
SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.SATe-II:一种非常快速且准确的同时估计多个序列比对和系统发育树的方法。
Syst Biol. 2012 Jan;61(1):90-106. doi: 10.1093/sysbio/syr095. Epub 2011 Dec 1.
8
Phylogenomic inference of protein molecular function.蛋白质分子功能的系统发育基因组学推断
Curr Protoc Bioinformatics. 2005 Oct;Chapter 6:Unit 6.9. doi: 10.1002/0471250953.bi0609s11.
9
Fast-Evolving Alignment Sites Are Highly Informative for Reconstructions of Deep Tree of Life Phylogenies.快速进化的比对位点对重建生命之树的深层系统发育具有高度信息价值。
Microorganisms. 2023 Oct 5;11(10):2499. doi: 10.3390/microorganisms11102499.
10
A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments.一种对齐掩蔽方法,用于细化多重序列比对的系统发育信号。
Mol Biol Evol. 2013 Mar;30(3):689-712. doi: 10.1093/molbev/mss264. Epub 2012 Nov 27.

引用本文的文献

1
Expanding the diversity of Celavirus, the most divergent genus in the family Potyviridae.扩展芹菜病毒属(马铃薯Y病毒科中差异最大的属)的多样性。
Virus Genes. 2025 Sep 11. doi: 10.1007/s11262-025-02184-w.
2
Living Environment and Basic Features of the Nematodes Associated with Dung Beetle .蜣螂相关线虫的生存环境及基本特征
J Nematol. 2025 Aug 31;57(1):20250035. doi: 10.2478/jofnem-2025-0035. eCollection 2025 Feb.
3
Taxonomic quasi-primes: peptides charting lineage-specific adaptations and disease-relevant loci.分类学准素:描绘谱系特异性适应性和疾病相关基因座的肽段。

本文引用的文献

1
Genome-scale phylogeny and contrasting modes of genome evolution in the fungal phylum Ascomycota.基因组规模的系统发育和真菌门子囊菌门中截然不同的基因组进化模式。
Sci Adv. 2020 Nov 4;6(45). doi: 10.1126/sciadv.abd0079. Print 2020 Nov.
2
Phylogenetic tree building in the genomic age.基因组时代的系统发育树构建。
Nat Rev Genet. 2020 Jul;21(7):428-444. doi: 10.1038/s41576-020-0233-0. Epub 2020 May 18.
3
A Robust Phylogenomic Time Tree for Biotechnologically and Medically Important Fungi in the Genera and .具有生物技术和医学重要性的 和 属真菌的稳健系统基因组时间树。
Protein Sci. 2025 Sep;34(9):e70241. doi: 10.1002/pro.70241.
4
Conservation of bilaterian genome structure is the exception, not the rule.两侧对称动物基因组结构的保守是例外,而非普遍规律。
Genome Biol. 2025 Aug 18;26(1):247. doi: 10.1186/s13059-025-03732-1.
5
The evolutionary replacement of restriction-modification by Ssp antiviral systems is associated with the distribution of prophages in the major clonal group of .Ssp抗病毒系统对限制修饰的进化替代与原噬菌体在主要克隆群中的分布有关。
mBio. 2025 Sep 10;16(9):e0213525. doi: 10.1128/mbio.02135-25. Epub 2025 Aug 18.
6
The genomic origin of the unique chaetognath body plan.独特箭虫身体结构的基因组起源。
Nature. 2025 Aug 13. doi: 10.1038/s41586-025-09403-2.
7
Phyling: phylogenetic inference from annotated genomes.系统发育分析:从注释基因组进行系统发育推断。
bioRxiv. 2025 Aug 1:2025.07.30.666921. doi: 10.1101/2025.07.30.666921.
8
Evolution of Duplicated Hox Gene Clusters in Land Snails and Slugs.陆地蜗牛和蛞蝓中重复的Hox基因簇的进化
J Exp Zool B Mol Dev Evol. 2025 Sep;344(6):363-368. doi: 10.1002/jez.b.23322.
9
Vibrio pectenicida strain FHCF-3 is a causative agent of sea star wasting disease.栉孔扇贝弧菌FHCF-3菌株是海星消瘦病的病原体。
Nat Ecol Evol. 2025 Aug 4. doi: 10.1038/s41559-025-02797-2.
10
Fungal Planet description sheets: 1781-1866.真菌星球描述表:1781 - 1866年
Persoonia. 2025 Jun;54:327-587. doi: 10.3114/persoonia.2025.54.10. Epub 2025 Jul 8.
mBio. 2019 Jul 9;10(4):e00925-19. doi: 10.1128/mBio.00925-19.
4
Challenges and recommendations to improve the installability and archival stability of omics computational tools.提高组学计算工具可安装性和档案稳定性的挑战和建议。
PLoS Biol. 2019 Jun 20;17(6):e3000333. doi: 10.1371/journal.pbio.3000333. eCollection 2019 Jun.
5
integRATE: a desirability-based data integration framework for the prioritization of candidate genes across heterogeneous omics and its application to preterm birth.integRATE:一种基于理想性的数据整合框架,用于对异质组学中的候选基因进行优先级排序,并将其应用于早产研究。
BMC Med Genomics. 2018 Nov 19;11(1):107. doi: 10.1186/s12920-018-0426-y.
6
UFBoot2: Improving the Ultrafast Bootstrap Approximation.UFBoot2:改进超快bootstrap 逼近算法。
Mol Biol Evol. 2018 Feb 1;35(2):518-522. doi: 10.1093/molbev/msx281.
7
Reconstructing the Backbone of the Saccharomycotina Yeast Phylogeny Using Genome-Scale Data.利用基因组规模数据重建酵母亚门酵母系统发育的主干
G3 (Bethesda). 2016 Dec 7;6(12):3927-3939. doi: 10.1534/g3.116.034744.
8
A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.关于序列、功能和基于树的基因特性如何影响系统发育推断的全基因组规模研究。
Genome Biol Evol. 2016 Sep 2;8(8):2565-80. doi: 10.1093/gbe/evw179.
9
MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets.MEGA7:适用于更大数据集的分子进化遗传学分析版本7.0
Mol Biol Evol. 2016 Jul;33(7):1870-4. doi: 10.1093/molbev/msw054. Epub 2016 Mar 22.
10
Computing the Internode Certainty and Related Measures from Partial Gene Trees.从部分基因树计算节间确定性及相关度量。
Mol Biol Evol. 2016 Jun;33(6):1606-17. doi: 10.1093/molbev/msw040. Epub 2016 Feb 25.