Department of Statistics, University of Oxford, Oxford OX1 3LB, United Kingdom; and.
UCB Pharma, Slough SL1 3WE, United Kingdom.
J Immunol. 2018 Oct 15;201(8):2502-2509. doi: 10.4049/jimmunol.1800708. Epub 2018 Sep 14.
Abs are immune system proteins that recognize noxious molecules for elimination. Their sequence diversity and binding versatility have made Abs the primary class of biopharmaceuticals. Recently, it has become possible to query their immense natural diversity using next-generation sequencing of Ig gene repertoires (Ig-seq). However, Ig-seq outputs are currently fragmented across repositories and tend to be presented as raw nucleotide reads, which means nontrivial effort is required to reuse the data for analysis. To address this issue, we have collected Ig-seq outputs from 55 studies, covering more than half a billion Ab sequences across diverse immune states, organisms (primarily human and mouse), and individuals. We have sorted, cleaned, annotated, translated, and numbered these sequences and make the data available via our Observed Antibody Space (OAS) resource at http://antibodymap.org The data within OAS will be regularly updated with newly released Ig-seq datasets. We believe OAS will facilitate data mining of immune repertoires for improved understanding of the immune system and development of better biotherapeutics.
抗体是识别有害分子并将其清除的免疫系统蛋白。其序列多样性和结合的多功能性使抗体成为主要的生物制药类别。最近,使用下一代免疫球蛋白基因库测序(Ig-seq)技术,已经可以查询其巨大的自然多样性。然而,Ig-seq 的输出结果目前分散在各个存储库中,并且往往以原始核苷酸读取的形式呈现,这意味着需要付出相当大的努力才能重新使用这些数据进行分析。为了解决这个问题,我们收集了 55 项研究的 Ig-seq 输出结果,涵盖了来自不同免疫状态、生物体(主要是人类和小鼠)和个体的超过 5 亿个抗体序列。我们对这些序列进行了分类、清理、注释、翻译和编号,并通过我们的抗体观察空间(OAS)资源(http://antibodymap.org)提供这些数据。OAS 中的数据将定期更新,以包含新发布的 Ig-seq 数据集。我们相信,OAS 将促进免疫库的数据挖掘,从而更好地理解免疫系统并开发更好的生物疗法。