Department of Infectious Diseases, Research Laboratory of Clinical Virology, Ruijin Hospital, Shanghai Jiaotong University, School of Medicine, Shanghai, People's Republic of China.
Department of Gastroenterology, Ruijin Hospital, Shanghai Jiaotong University, School of Medicine, Shanghai, People's Republic of China.
Emerg Microbes Infect. 2021 Dec;10(1):842-851. doi: 10.1080/22221751.2021.1919033.
Few non-invasive models were established for precisely identifying the immune tolerant (IT) phase from chronic hepatitis B (CHB). This study aimed to develop a novel approach that combined next-generation sequencing (NGS) and machine learning algorithms using our recently published viral quasispecies (QS) analysis package. 290 HBeAg positive patients from whom liver biopsies were taken were enrolled and divided into a training group ( = 148) and a validation group ( = 142). HBV DNA was extracted and QS sequences were obtained by NGS. Hierarchical clustering analysis (HCA) and principal component analysis (PCA) based on viral operational taxonomic units (OTUs) were performed to explore the correlations among QS and clinical phenotypes. Three machine learning algorithms, including K-nearest neighbour, support vector machine, and random forest algorithm, were used to construct diagnostic models for IT phase classification. Based on histopathology, 90 IT patients and 200 CHB patients were diagnosed. HBsAg titres for IT patients were higher than those of CHB patients ( < 0.001). HCA and PCA analysis grouped IT and CHB patients into two distinct clusters. The relative abundance of viral OTUs differed mainly within the BCP/precore/core region and was significantly correlated with liver inflammation and fibrosis. For the IT phase classification, all machine-learning models showed higher AUC values compared to models based on HBsAg, APRI, and FIB-4. The relative abundance of viral OTUs reflects the severity of liver inflammation and fibrosis. The novel QS quantitative analysis approach could be used to diagnose IT patients more precisely and reduce the need for liver biopsy.
目前尚未建立明确识别慢性乙型肝炎(CHB)免疫耐受(IT)期的非侵入性模型。本研究旨在开发一种新方法,该方法结合了下一代测序(NGS)和机器学习算法,并使用我们最近发表的病毒准种(QS)分析软件包。共纳入 290 名接受肝活检的 HBeAg 阳性患者,并将其分为训练组(n=148)和验证组(n=142)。提取 HBV DNA,通过 NGS 获取 QS 序列。基于病毒操作分类单元(OTUs)进行层次聚类分析(HCA)和主成分分析(PCA),以探索 QS 与临床表型之间的相关性。使用三种机器学习算法,包括 K-最近邻、支持向量机和随机森林算法,构建用于 IT 期分类的诊断模型。根据组织病理学,诊断 90 名 IT 患者和 200 名 CHB 患者。IT 患者的 HBsAg 滴度高于 CHB 患者(<0.001)。HCA 和 PCA 分析将 IT 和 CHB 患者分为两个不同的簇。病毒 OTUs 的相对丰度主要在 BCP/precore/core 区存在差异,且与肝炎症和纤维化显著相关。对于 IT 期分类,所有机器学习模型的 AUC 值均高于基于 HBsAg、APRI 和 FIB-4 的模型。病毒 OTUs 的相对丰度反映了肝炎症和纤维化的严重程度。新型 QS 定量分析方法可更准确地诊断 IT 患者,减少肝活检的需求。