• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过最大熵核苷酸偏差模型解读病毒-宿主适应性密码

Deciphering the Code of Viral-Host Adaptation Through Maximum-Entropy Nucleotide Bias Models.

作者信息

Di Gioacchino Andrea, Lecce Ivan, Greenbaum Benjamin D, Monasson Rémi, Cocco Simona

机构信息

CNRS UMR 8023, Laboratory of Physics of the Ecole Normale Supérieure and PSL Research, Sorbonne Université, Paris, France.

Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.

出版信息

Mol Biol Evol. 2025 Jun 4;42(6). doi: 10.1093/molbev/msaf127.

DOI:10.1093/molbev/msaf127
PMID:40458044
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12205363/
Abstract

How viruses evolve largely depends on their hosts. To quantitatively characterize this dependence, we introduce Maximum Entropy Nucleotide Bias models (MENB) learned from single, di- and tri-nucleotide usage of viral sequences that infect a given host. We first use MENB to classify the viral family and the host of a virus from its genome, among four families of ssRNA viruses and three hosts. We show that both the viral family and the host leave a fingerprint in nucleotide motif usages that MENB models decode. Benchmarking our approach against state-of-the-art methods based on deep neural networks shows that MENB is rapid, interpretable and robust. Our approach is able to predict, with good accuracy, both the viral family and the host from a whole genomic sequence or a portion of it. MENB models also display promising out of sample generalization ability on viral sequences of new host taxa or new viral families. Our approach is also capable of identifying, within the limitations imposed by the three-host setting, intermediate hosts for well-known pathogenic strains of Influenza A subtypes and Human Coronavirus and recombinations and reassortments on specific genomic regions. Finally, MENB models can be used to track the adaptation to the new host, to shed light on the more relevant selective pressures that acted on motif usage during this process and to design new sequences with altered nucleotide usage at fixed amino-acid content.

摘要

病毒如何进化很大程度上取决于其宿主。为了定量描述这种依赖性,我们引入了最大熵核苷酸偏差模型(MENB),该模型是从感染给定宿主的病毒序列的单核苷酸、双核苷酸和三核苷酸使用情况中学习得到的。我们首先使用MENB从病毒基因组中对病毒家族和宿主进行分类,涉及四种单链RNA病毒家族和三种宿主。我们表明,病毒家族和宿主都会在MENB模型解码的核苷酸基序使用中留下印记。将我们的方法与基于深度神经网络的最先进方法进行基准测试表明,MENB快速、可解释且稳健。我们的方法能够从整个基因组序列或其一部分中准确预测病毒家族和宿主。MENB模型在新宿主分类群或新病毒家族的病毒序列上也显示出有前景的样本外泛化能力。我们的方法还能够在三宿主设置所施加的限制范围内,识别甲型流感亚型和人类冠状病毒的知名致病菌株的中间宿主以及特定基因组区域的重组和重配。最后,MENB模型可用于追踪对新宿主的适应性,揭示在此过程中作用于基序使用的更相关的选择压力,并设计在固定氨基酸含量下具有改变的核苷酸使用的新序列。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/0bd8ab8d219c/msaf127f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/24d4403888a1/msaf127f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/3e82ae817062/msaf127f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/f5584a8070f0/msaf127f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/02adc8be6310/msaf127f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/b6d7d6b29b0c/msaf127f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/420493245d4e/msaf127f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/762dba35f1a4/msaf127f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/5772642a12ec/msaf127f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/654c7760d2e1/msaf127f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/0a38ce1fdac4/msaf127f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/0bd8ab8d219c/msaf127f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/24d4403888a1/msaf127f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/3e82ae817062/msaf127f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/f5584a8070f0/msaf127f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/02adc8be6310/msaf127f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/b6d7d6b29b0c/msaf127f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/420493245d4e/msaf127f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/762dba35f1a4/msaf127f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/5772642a12ec/msaf127f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/654c7760d2e1/msaf127f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/0a38ce1fdac4/msaf127f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bf0/12205363/0bd8ab8d219c/msaf127f11.jpg

相似文献

1
Deciphering the Code of Viral-Host Adaptation Through Maximum-Entropy Nucleotide Bias Models.通过最大熵核苷酸偏差模型解读病毒-宿主适应性密码
Mol Biol Evol. 2025 Jun 4;42(6). doi: 10.1093/molbev/msaf127.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Sequence signatures within the genome of SARS-CoV-2 can be used to predict host source.SARS-CoV-2 基因组中的序列特征可用于预测宿主来源。
Microbiol Spectr. 2024 Apr 2;12(4):e0358423. doi: 10.1128/spectrum.03584-23. Epub 2024 Mar 4.
4
Physical interventions to interrupt or reduce the spread of respiratory viruses.物理干预措施以阻断或减少呼吸道病毒的传播。
Cochrane Database Syst Rev. 2023 Jan 30;1(1):CD006207. doi: 10.1002/14651858.CD006207.pub6.
5
Modelling the effects of phylogeny and body size on within-host pathogen replication and immune response.建立模型以探究系统发育和体型对宿主内病原体复制和免疫反应的影响。
J R Soc Interface. 2017 Nov;14(136). doi: 10.1098/rsif.2017.0479.
6
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
7
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
8
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
9
Short-Term Memory Impairment短期记忆障碍
10
Incentives for preventing smoking in children and adolescents.预防儿童和青少年吸烟的激励措施。
Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3.

本文引用的文献

1
Inferring effects of mutations on SARS-CoV-2 transmission from genomic surveillance data.从基因组监测数据推断突变对新冠病毒传播的影响。
Nat Commun. 2025 Jan 7;16(1):441. doi: 10.1038/s41467-024-55593-0.
2
Infer global, predict local: Quantity-relevance trade-off in protein fitness predictions from sequence data.从序列数据推断全局,预测局部:蛋白质适应性预测中的数量-相关性权衡。
PLoS Comput Biol. 2023 Oct 26;19(10):e1011521. doi: 10.1371/journal.pcbi.1011521. eCollection 2023 Oct.
3
Algorithm for optimized mRNA design improves stability and immunogenicity.
优化 mRNA 设计的算法可提高稳定性和免疫原性。
Nature. 2023 Sep;621(7978):396-403. doi: 10.1038/s41586-023-06127-z. Epub 2023 May 2.
4
Evolution of the SARS-CoV-2 Mutational Spectrum.SARS-CoV-2 突变谱的演变。
Mol Biol Evol. 2023 Apr 4;40(4). doi: 10.1093/molbev/msad085.
5
Compensatory epistasis maintains ACE2 affinity in SARS-CoV-2 Omicron BA.1.补偿性上位性维持 SARS-CoV-2 奥密克戎 BA.1 中 ACE2 的亲和力。
Nat Commun. 2022 Nov 16;13(1):7011. doi: 10.1038/s41467-022-34506-z.
6
Introducing the Bacterial and Viral Bioinformatics Resource Center (BV-BRC): a resource combining PATRIC, IRD and ViPR.推出细菌和病毒生物信息学资源中心(BV-BRC):一个整合 PATRIC、IRD 和 ViPR 的资源。
Nucleic Acids Res. 2023 Jan 6;51(D1):D678-D689. doi: 10.1093/nar/gkac1003.
7
SARS-CoV-2 Omicron variant emerged under immune selection.SARS-CoV-2 奥密克戎变异株是在免疫选择压力下出现的。
Nat Microbiol. 2022 Nov;7(11):1756-1761. doi: 10.1038/s41564-022-01246-1. Epub 2022 Oct 4.
8
Emu: species-level microbial community profiling of full-length 16S rRNA Oxford Nanopore sequencing data.鸸鹋:全长 16S rRNA Oxford Nanopore 测序数据的种水平微生物群落分析。
Nat Methods. 2022 Jul;19(7):845-853. doi: 10.1038/s41592-022-01520-4. Epub 2022 Jun 30.
9
Recent Progress in Recombinant Influenza Vaccine Development Toward Heterosubtypic Immune Response.近年来重组流感疫苗研发取得进展,以实现针对不同亚型的免疫应答。
Front Immunol. 2022 May 19;13:878943. doi: 10.3389/fimmu.2022.878943. eCollection 2022.
10
ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction.ScanNet:一种用于基于结构的蛋白质结合位点预测的可解释几何深度学习模型。
Nat Methods. 2022 Jun;19(6):730-739. doi: 10.1038/s41592-022-01490-7. Epub 2022 May 30.