Suppr超能文献

乙型肝炎病毒 (HBV) 序列变异图可改善比对和样本特异性共识序列构建。

A hepatitis B virus (HBV) sequence variation graph improves alignment and sample-specific consensus sequence construction.

机构信息

Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States of America.

Center for Biomedical Data Science, Yale School of Medicine, New Haven, CT, United States of America.

出版信息

PLoS One. 2024 Apr 26;19(4):e0301069. doi: 10.1371/journal.pone.0301069. eCollection 2024.

Abstract

Nearly 300 million individuals live with chronic hepatitis B virus (HBV) infection (CHB), for which no curative therapy is available. As viral diversity is associated with pathogenesis and immunological control of infection, improved methods to characterize this diversity could aid drug development efforts. Conventionally, viral sequencing data are mapped/aligned to a reference genome, and only the aligned sequences are retained for analysis. Thus, reference selection is critical, yet selecting the most representative reference a priori remains difficult. We investigate an alternative pangenome approach which can combine multiple reference sequences into a graph which can be used during alignment. Using simulated short-read sequencing data generated from publicly available HBV genomes and real sequencing data from an individual living with CHB, we demonstrate alignment to a phylogenetically representative 'genome graph' can improve alignment, avoid issues of reference ambiguity, and facilitate the construction of sample-specific consensus sequences more genetically similar to the individual's infection. Graph-based methods can, therefore, improve efforts to characterize the genetics of viral pathogens, including HBV, and have broader implications in host-pathogen research.

摘要

全球约有 3 亿人患有慢性乙型肝炎病毒 (HBV) 感染 (CHB),但目前尚无治愈方法。由于病毒多样性与感染的发病机制和免疫控制有关,因此改进对这种多样性进行描述的方法可能有助于药物研发工作。传统上,病毒测序数据被映射/对齐到参考基因组,只有对齐的序列才会被保留用于分析。因此,参考序列的选择至关重要,但预先选择最具代表性的参考序列仍然很困难。我们研究了一种替代的 pan 基因组方法,该方法可以将多个参考序列组合到一个图中,以便在对齐过程中使用。我们使用从公开可用的 HBV 基因组生成的模拟短读测序数据和来自患有 CHB 的个体的真实测序数据进行演示,表明对齐到具有系统发育代表性的“基因组图”可以改善对齐,避免参考模糊性问题,并有助于构建与个体感染更具遗传相似性的特定样本的共识序列。基于图的方法可以改进对病毒病原体(包括 HBV)遗传特征的描述,在宿主-病原体研究中也具有更广泛的意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97f0/11051683/3cb892f8f46f/pone.0301069.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验