Theiler James, Korber Bette
Los Alamos National Laboratory, Los Alamos, 87545, NM, U.S.A.
New Mexico Consortium, Los Alamos, 87545, NM, U.S.A.
Stat Med. 2018 Jan 30;37(2):181-194. doi: 10.1002/sim.7203. Epub 2017 Jan 29.
Epigraph is a recently developed algorithm that enables the computationally efficient design of single or multi-antigen vaccines to maximize the potential epitope coverage for a diverse pathogen population. Potential epitopes are defined as short contiguous stretches of proteins, comparable in length to T-cell epitopes. This optimal coverage problem can be formulated in terms of a directed graph, with candidate antigens represented as paths that traverse this graph. Epigraph protein sequences can also be used as the basis for designing peptides for experimental evaluation of immune responses in natural infections to highly variable proteins. The epigraph tool suite also enables rapid characterization of populations of diverse sequences from an immunological perspective. Fundamental distance measures are based on immunologically relevant shared potential epitope frequencies, rather than simple Hamming or phylogenetic distances. Here, we provide a mathematical description of the epigraph algorithm, include a comparison of different heuristics that can be used when graphs are not acyclic, and we describe an additional tool we have added to the web-based epigraph tool suite that provides frequency summaries of all distinct potential epitopes in a population. We also show examples of the graphical output and summary tables that can be generated using the epigraph tool suite and explain their content and applications. Published 2017. This article is a U.S. Government work and is in the public domain in the USA. Statistics in Medicine published by John Wiley & Sons Ltd.
Epigraph是一种最近开发的算法,它能够以计算高效的方式设计单抗原或多抗原疫苗,从而为多种病原体群体最大化潜在表位覆盖率。潜在表位被定义为蛋白质的短连续片段,其长度与T细胞表位相当。这个最优覆盖率问题可以用有向图来表述,候选抗原表示为遍历该图的路径。Epigraph蛋白质序列还可以用作设计肽的基础,用于对自然感染中针对高度可变蛋白质的免疫反应进行实验评估。Epigraph工具套件还能够从免疫学角度快速表征不同序列的群体。基本距离度量基于免疫学相关的共享潜在表位频率,而不是简单的汉明距离或系统发育距离。在这里,我们提供了Epigraph算法的数学描述,包括对在图不是无环图时可使用的不同启发式方法的比较,并且我们描述了我们添加到基于网络的Epigraph工具套件中的一个附加工具,该工具提供群体中所有不同潜在表位的频率汇总。我们还展示了使用Epigraph工具套件可以生成的图形输出和汇总表的示例,并解释了它们的内容和应用。发表于2017年。本文是美国政府作品,在美国属于公共领域。由John Wiley & Sons Ltd.出版的《医学统计学》