Sathiamurthy M, Hickman H D, Cavett J W, Zahoor A, Prilliman K, Metcalf S, Fernandez Vina M, Hildebrand W H
Department of Microbiology and Immunology, University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA.
Tissue Antigens. 2003 Jan;61(1):12-9. doi: 10.1034/j.1399-0039.2003.610102.x.
We have established an HLA ligand database to provide scientists and clinicians with access to Major Histocompatibility Complex (MHC) class I and II motif and ligand data. The HLA Ligand Database is available on the world wide web at http://hlaligand.ouhsc.edu and contains ligands that have been published in peer-reviewed journals. HLA peptide datasets prove useful in several areas: ligands are important as targets for various immune responses while algorithms built upon ligand datasets allow identification of new peptides without time-consuming experimental procedures. A review of the HLA class I ligands in the database identifies strengths and deficiencies in the database and, therefore, the utility of the dataset for identifying new peptides. For instance, 212 HLA-A phenotypes exist of which 23 have a motif determined and 43 have peptides characterized. In terms of number of ligands, HLA-A0201 has 258 characterized ligands, A1101 has 25 peptides, while the remaining two-thirds of the HLA-A phenotypes have less than 10 associated peptide sequences. Characterization of ligands and motifs remains roughly the same at the HLA-B locus while the peptides of the HLA-C locus tend to be less characterized. These data show that 74% of HLA class I molecules do not have ligands represented in the database and thus algorithms based on the dataset could not predict ligands for a majority of the US population. Building upon this dataset and knowledge of HLA allelic frequencies, it is possible to plan a systematic expansion of the HLA class I ligand database to better identify ligands useful throughout the population.
我们建立了一个HLA配体数据库,为科学家和临床医生提供获取主要组织相容性复合体(MHC)I类和II类基序及配体数据的途径。HLA配体数据库可通过万维网访问,网址为http://hlaligand.ouhsc.edu,其中包含已在同行评审期刊上发表的配体。HLA肽数据集在多个领域证明是有用的:配体作为各种免疫反应的靶点很重要,而基于配体数据集构建的算法可以在不进行耗时实验程序的情况下识别新的肽。对数据库中HLA I类配体的综述确定了数据库的优势和不足,因此也确定了该数据集用于识别新肽的效用。例如,存在212种HLA - A表型,其中23种已确定基序,43种已鉴定肽段。就配体数量而言,HLA - A0201有258个已鉴定的配体,A1101有25个肽段,而其余三分之二的HLA - A表型相关肽序列少于10个。HLA - B位点的配体和基序特征大致相同,而HLA - C位点的肽段特征往往较少。这些数据表明,74%的HLA I类分子在数据库中没有配体代表,因此基于该数据集的算法无法为大多数美国人群预测配体。基于该数据集以及HLA等位基因频率的知识,有可能计划对HLA I类配体数据库进行系统扩展,以更好地识别在整个人口中有用的配体。