KayvanJoo Amir Hossein, Ebrahimi Mansour, Haqshenas Gholamreza
Department of Biology, School of Basic Sciences, University of Qom, Qom, Iran.
BMC Res Notes. 2014 Aug 23;7:565. doi: 10.1186/1756-0500-7-565.
Hepatitis C virus (HCV) causes chronic hepatitis C in 2-3% of world population and remains one of the health threatening human viruses, worldwide. In the absence of an effective vaccine, therapeutic approach is the only option to combat hepatitis C. Interferon-alpha (IFN-alpha) and ribavirin (RBV) combination alone or in combination with recently introduced new direct-acting antivirals (DAA) is used to treat patients infected with HCV. The present study utilized feature selection methods (Gini Index, Chi Squared and machine learning algorithms) and other bioinformatics tools to identify genetic determinants of therapy outcome within the entire HCV nucleotide sequence.
Using combination of several algorithms, the present study performed a comprehensive bioinformatics analysis and identified several nucleotide attributes within the full-length nucleotide sequences of HCV subtypes 1a and 1b that correlated with treatment outcome. Feature selection algorithms identified several nucleotide features (e.g. count of hydrogen and CG). Combination of algorithms utilized the selected nucleotide attributes and predicted HCV subtypes 1a and 1b therapy responders from non-responders with an accuracy of 75.00% and 85.00%, respectively. In addition, therapy responders and relapsers were categorized with an accuracy of 82.50% and 84.17%, respectively. Based on the identified attributes, decision trees were induced to differentiate different therapy response groups.
The present study identified new genetic markers that potentially impact the outcome of hepatitis C treatment. In addition, the results suggest new viral genomic attributes that might influence the outcome of IFN-mediated immune response to HCV infection.
丙型肝炎病毒(HCV)在全球2%-3%的人口中引发慢性丙型肝炎,仍然是全球对人类健康构成威胁的病毒之一。在缺乏有效疫苗的情况下,治疗方法是对抗丙型肝炎的唯一选择。单独使用α干扰素(IFN-α)和利巴韦林(RBV)联合,或与最近推出的新型直接作用抗病毒药物(DAA)联合,用于治疗HCV感染患者。本研究利用特征选择方法(基尼指数、卡方检验和机器学习算法)及其他生物信息学工具,在整个HCV核苷酸序列中识别治疗结果的遗传决定因素。
本研究结合多种算法进行了全面的生物信息学分析,在HCV 1a和1b亚型的全长核苷酸序列中识别出几个与治疗结果相关的核苷酸特征。特征选择算法识别出几个核苷酸特征(如氢原子计数和CG)。算法组合利用所选的核苷酸特征,分别以75.00%和85.00%的准确率从无反应者中预测出HCV 1a和1b亚型的治疗反应者。此外,治疗反应者和复发者的分类准确率分别为82.50%和84.17%。基于识别出的特征,构建决策树以区分不同的治疗反应组。
本研究识别出可能影响丙型肝炎治疗结果的新遗传标记。此外,结果表明了可能影响IFN介导的针对HCV感染免疫反应结果的新病毒基因组特征。