The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel.
Max Planck Institute for Developmental Biology, Max Planck Society (MPG), Tübingen, Germany.
Front Immunol. 2021 Feb 10;11:619896. doi: 10.3389/fimmu.2020.619896. eCollection 2020.
The presence of pathogen-specific antibodies in an individual's blood-sample is used as an indication of previous exposure and infection to that specific pathogen (e.g., virus or bacterium). Measurement of the diagnostic antibodies is routinely achieved using solid phase immuno-assays such as ELISA tests and western blots. Here, we describe a sero-diagnostic approach based on phage-display of epitope arrays we term "Domain-Scan". We harness Next-generation sequencing (NGS) to measure the serum binding to dozens of epitopes derived from HIV-1 and HCV simultaneously. The distinction of healthy individuals from those infected with either HIV-1 or HCV, is modeled as a machine-learning classification problem, in which each determinant ("domain") is considered as a feature, and its NGS read-out provides values that correspond to the level of determinant-specific antibodies in the sample. We show that following training of a machine-learning model on labeled examples, we can very accurately classify unlabeled samples and pinpoint the domains that contribute most to the classification. Our experimental/computational Domain-Scan approach is general and can be adapted to other pathogens as long as sufficient training samples are provided.
个体血液样本中病原体特异性抗体的存在可作为其先前暴露于和感染特定病原体(例如病毒或细菌)的指示。通常使用固相免疫分析(如 ELISA 试验和免疫印迹)来测量诊断抗体。在这里,我们描述了一种基于噬菌体展示表位阵列的血清诊断方法,我们称之为“结构域扫描”。我们利用下一代测序(NGS)技术同时测量数十种源自 HIV-1 和 HCV 的表位与血清的结合。将健康个体与感染 HIV-1 或 HCV 的个体区分开来,可以建模为机器学习分类问题,其中每个决定簇(“结构域”)被视为一个特征,其 NGS 读出值对应于样本中特定决定簇抗体的水平。我们表明,在对标记示例进行机器学习模型训练之后,我们可以非常准确地对未标记的样本进行分类,并确定对分类贡献最大的结构域。我们的实验/计算结构域扫描方法是通用的,只要提供足够的训练样本,就可以适用于其他病原体。