• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将插入缺失信息纳入快速出现的病原体的系统发育估计中。

Incorporating indel information into phylogeny estimation for rapidly emerging pathogens.

作者信息

Redelings Benjamin D, Suchard Marc A

机构信息

Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27606, USA.

出版信息

BMC Evol Biol. 2007 Mar 14;7:40. doi: 10.1186/1471-2148-7-40.

DOI:10.1186/1471-2148-7-40
PMID:17359539
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1853084/
Abstract

BACKGROUND

Phylogenies of rapidly evolving pathogens can be difficult to resolve because of the small number of substitutions that accumulate in the short times since divergence. To improve resolution of such phylogenies we propose using insertion and deletion (indel) information in addition to substitution information. We accomplish this through joint estimation of alignment and phylogeny in a Bayesian framework, drawing inference using Markov chain Monte Carlo. Joint estimation of alignment and phylogeny sidesteps biases that stem from conditioning on a single alignment by taking into account the ensemble of near-optimal alignments.

RESULTS

We introduce a novel Markov chain transition kernel that improves computational efficiency by proposing non-local topology rearrangements and by block sampling alignment and topology parameters. In addition, we extend our previous indel model to increase biological realism by placing indels preferentially on longer branches. We demonstrate the ability of indel information to increase phylogenetic resolution in examples drawn from within-host viral sequence samples. We also demonstrate the importance of taking alignment uncertainty into account when using such information. Finally, we show that codon-based substitution models can significantly affect alignment quality and phylogenetic inference by unrealistically forcing indels to begin and end between codons.

CONCLUSION

These results indicate that indel information can improve phylogenetic resolution of recently diverged pathogens and that alignment uncertainty should be considered in such analyses.

摘要

背景

由于快速进化的病原体自分化以来在短时间内积累的替换数量较少,其系统发育关系可能难以解析。为了提高此类系统发育关系的解析度,我们建议除了使用替换信息外,还使用插入和缺失(indel)信息。我们通过在贝叶斯框架中对序列比对和系统发育进行联合估计来实现这一点,并使用马尔可夫链蒙特卡罗进行推断。序列比对和系统发育的联合估计通过考虑近乎最优比对的集合,避免了因基于单个比对而产生的偏差。

结果

我们引入了一种新颖的马尔可夫链转移核,通过提出非局部拓扑重排以及对序列比对和拓扑参数进行分块采样来提高计算效率。此外,我们扩展了之前的indel模型,通过将indel优先放置在较长分支上来增加生物学真实性。我们在宿主内病毒序列样本的实例中展示了indel信息增加系统发育解析度的能力。我们还证明了在使用此类信息时考虑序列比对不确定性的重要性。最后,我们表明基于密码子的替换模型可能会通过不切实际地迫使indel在密码子之间开始和结束,从而显著影响序列比对质量和系统发育推断。

结论

这些结果表明,indel信息可以提高近期分化病原体的系统发育解析度,并且在这类分析中应考虑序列比对的不确定性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/307b25126815/1471-2148-7-40-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/25326af9068b/1471-2148-7-40-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/c6db7f72d9d7/1471-2148-7-40-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/d1f050c4270d/1471-2148-7-40-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/59e08993e11a/1471-2148-7-40-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/7f8daa93efc2/1471-2148-7-40-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/be63e0ad6628/1471-2148-7-40-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/307b25126815/1471-2148-7-40-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/25326af9068b/1471-2148-7-40-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/c6db7f72d9d7/1471-2148-7-40-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/d1f050c4270d/1471-2148-7-40-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/59e08993e11a/1471-2148-7-40-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/7f8daa93efc2/1471-2148-7-40-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/be63e0ad6628/1471-2148-7-40-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/1853084/307b25126815/1471-2148-7-40-7.jpg

相似文献

1
Incorporating indel information into phylogeny estimation for rapidly emerging pathogens.将插入缺失信息纳入快速出现的病原体的系统发育估计中。
BMC Evol Biol. 2007 Mar 14;7:40. doi: 10.1186/1471-2148-7-40.
2
Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
3
Joint Bayesian estimation of alignment and phylogeny.比对与系统发育的联合贝叶斯估计。
Syst Biol. 2005 Jun;54(3):401-18. doi: 10.1080/10635150590947041.
4
BayesCAT: Bayesian co-estimation of alignment and tree.贝叶斯CAT:比对和树的贝叶斯联合估计
Biometrics. 2018 Mar;74(1):270-279. doi: 10.1111/biom.12640. Epub 2017 Jan 18.
5
HandAlign: Bayesian multiple sequence alignment, phylogeny and ancestral reconstruction.HandAlign:贝叶斯多序列比对、系统发育和祖先重建。
Bioinformatics. 2012 Apr 15;28(8):1170-1. doi: 10.1093/bioinformatics/bts058. Epub 2012 Jan 28.
6
BAli-Phy: simultaneous Bayesian inference of alignment and phylogeny.BAli-Phy:比对和系统发育的同步贝叶斯推断
Bioinformatics. 2006 Aug 15;22(16):2047-8. doi: 10.1093/bioinformatics/btl175. Epub 2006 May 5.
7
StatAlign: an extendable software package for joint Bayesian estimation of alignments and evolutionary trees.StatAlign:一个用于比对和进化树联合贝叶斯估计的可扩展软件包。
Bioinformatics. 2008 Oct 15;24(20):2403-4. doi: 10.1093/bioinformatics/btn457. Epub 2008 Aug 27.
8
Bayesian phylogenetic model selection using reversible jump Markov chain Monte Carlo.使用可逆跳跃马尔可夫链蒙特卡罗方法进行贝叶斯系统发育模型选择。
Mol Biol Evol. 2004 Jun;21(6):1123-33. doi: 10.1093/molbev/msh123. Epub 2004 Mar 19.
9
Evolutionary inference via the Poisson Indel Process.通过泊松插入缺失过程进行进化推断。
Proc Natl Acad Sci U S A. 2013 Jan 22;110(4):1160-6. doi: 10.1073/pnas.1220450110. Epub 2012 Dec 28.
10
General continuous-time Markov model of sequence evolution via insertions/deletions: are alignment probabilities factorable?通过插入/缺失进行序列进化的一般连续时间马尔可夫模型:比对概率是否可分解?
BMC Bioinformatics. 2016 Aug 11;17:304. doi: 10.1186/s12859-016-1105-7.

引用本文的文献

1
Are reads required? High-precision variant calling from bacterial genome assemblies.是否需要读数?从细菌基因组组装中进行高精度变异检测。
Access Microbiol. 2025 May 28;7(5). doi: 10.1099/acmi.0.001025.v3. eCollection 2025.
2
Gene conversion and duplication contribute to genetic variation in an outbreak of .基因转换和重复导致了……爆发中的遗传变异。 (原文中“an outbreak of.”后面缺少具体内容)
Microb Genom. 2025 May;11(5). doi: 10.1099/mgen.0.001396.
3
Phyloformer: Fast, Accurate, and Versatile Phylogenetic Reconstruction with Deep Neural Networks.

本文引用的文献

1
Incorporating information from length-mutational events into phylogenetic analysis.将长度突变事件的信息纳入系统发育分析。
Mol Phylogenet Evol. 2006 Mar;38(3):667-76. doi: 10.1016/j.ympev.2005.07.011. Epub 2005 Aug 29.
2
Joint Bayesian estimation of alignment and phylogeny.比对与系统发育的联合贝叶斯估计。
Syst Biol. 2005 Jun;54(3):401-18. doi: 10.1080/10635150590947041.
3
Indel patterns of the plastid DNA trnL- trnF region within the genus Poa (Poaceae).早熟禾属(禾本科)植物质体DNA trnL - trnF区域的插入缺失模式
Phyloformer:使用深度神经网络进行快速、准确且通用的系统发育重建。
Mol Biol Evol. 2025 Apr 1;42(4). doi: 10.1093/molbev/msaf051.
4
Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications.插入和缺失:计算方法、进化动态和生物应用。
Mol Biol Evol. 2024 Sep 4;41(9). doi: 10.1093/molbev/msae177.
5
Reliable estimation of tree branch lengths using deep neural networks.利用深度神经网络可靠估计树枝长度。
PLoS Comput Biol. 2024 Aug 5;20(8):e1012337. doi: 10.1371/journal.pcbi.1012337. eCollection 2024 Aug.
6
COATi: Statistical Pairwise Alignment of Protein-Coding Sequences.COATi:蛋白质编码序列的统计成对比对。
Mol Biol Evol. 2024 Jul 3;41(7). doi: 10.1093/molbev/msae117.
7
Correlations between alignment gaps and nucleotide substitution or amino acid replacement.碱基排列差异与核苷酸替换或氨基酸替换的相关性。
Proc Natl Acad Sci U S A. 2022 Aug 23;119(34):e2204435119. doi: 10.1073/pnas.2204435119. Epub 2022 Aug 16.
8
Evaluation of various distance computation methods for construction of haplotype-based phylogenies from large MLST datasets.基于多位点序列分型大数据集构建单体型系统发育树的各种距离计算方法的评估。
Mol Phylogenet Evol. 2022 Dec;177:107608. doi: 10.1016/j.ympev.2022.107608. Epub 2022 Aug 11.
9
The GH19 Engineering Database: Sequence diversity, substrate scope, and evolution in glycoside hydrolase family 19.GH19 工程数据库:糖苷水解酶家族 19 的序列多样性、底物范围和进化。
PLoS One. 2021 Oct 26;16(10):e0256817. doi: 10.1371/journal.pone.0256817. eCollection 2021.
10
Indels in SARS-CoV-2 occur at template-switching hotspots.新冠病毒中的插入缺失发生在模板转换热点区域。
BioData Min. 2021 Mar 20;14(1):20. doi: 10.1186/s13040-021-00251-0.
J Plant Res. 2004 Oct;117(5):393-407. doi: 10.1007/s10265-004-0172-4. Epub 2004 Sep 10.
4
Timing and reconstruction of the most recent common ancestor of the subtype C clade of human immunodeficiency virus type 1.1型人类免疫缺陷病毒C亚型进化枝最近共同祖先的时间确定与重建
J Virol. 2004 Oct;78(19):10501-6. doi: 10.1128/JVI.78.19.10501-10506.2004.
5
Phylogenetic analysis of polyomavirus simian virus 40 from monkeys and humans reveals genetic variation.对来自猴子和人类的多瘤病毒猴空泡病毒40进行系统发育分析,揭示了基因变异。
J Virol. 2004 Sep;78(17):9306-16. doi: 10.1128/JVI.78.17.9306-9316.2004.
6
MUSCLE: multiple sequence alignment with high accuracy and high throughput.MUSCLE:具有高精度和高吞吐量的多序列比对。
Nucleic Acids Res. 2004 Mar 19;32(5):1792-7. doi: 10.1093/nar/gkh340. Print 2004.
7
Molecular evolution of the SARS coronavirus during the course of the SARS epidemic in China.中国SARS疫情期间SARS冠状病毒的分子进化
Science. 2004 Mar 12;303(5664):1666-9. doi: 10.1126/science.1092002. Epub 2004 Jan 29.
8
The causes and consequences of HIV evolution.HIV进化的原因及后果。
Nat Rev Genet. 2004 Jan;5(1):52-61. doi: 10.1038/nrg1246.
9
Iterative pass optimization of sequence data.序列数据的迭代传递优化
Cladistics. 2003 Jun;19(3):254-60.
10
Molecular evolution of insertions and deletion in the chloroplast genome of silene.麦瓶草叶绿体基因组中插入和缺失的分子进化
Mol Biol Evol. 2003 Nov;20(11):1737-40. doi: 10.1093/molbev/msg163. Epub 2003 Jun 27.