Miosge Lisa A, Field Matthew A, Sontani Yovina, Cho Vicky, Johnson Simon, Palkova Anna, Balakishnan Bhavani, Liang Rong, Zhang Yafei, Lyon Stephen, Beutler Bruce, Whittle Belinda, Bertram Edward M, Enders Anselm, Goodnow Christopher C, Andrews T Daniel
Immunogenomics Laboratory, John Curtin School of Medical Research, Australian National University, Canberra City, ACT 2601, Australia;
Immunogenomics Laboratory, John Curtin School of Medical Research, Australian National University, Canberra City, ACT 2601, Australia; Australian Phenomics Facility, John Curtin School of Medical Research, Australian National University, Canberra City, ACT 2601, Australia;
Proc Natl Acad Sci U S A. 2015 Sep 15;112(37):E5189-98. doi: 10.1073/pnas.1511585112. Epub 2015 Aug 12.
Each person's genome sequence has thousands of missense variants. Practical interpretation of their functional significance must rely on computational inferences in the absence of exhaustive experimental measurements. Here we analyzed the efficacy of these inferences in 33 de novo missense mutations revealed by sequencing in first-generation progeny of N-ethyl-N-nitrosourea-treated mice, involving 23 essential immune system genes. PolyPhen2, SIFT, MutationAssessor, Panther, CADD, and Condel were used to predict each mutation's functional importance, whereas the actual effect was measured by breeding and testing homozygotes for the expected in vivo loss-of-function phenotype. Only 20% of mutations predicted to be deleterious by PolyPhen2 (and 15% by CADD) showed a discernible phenotype in individual homozygotes. Half of all possible missense mutations in the same 23 immune genes were predicted to be deleterious, and most of these appear to become subject to purifying selection because few persist between separate mouse substrains, rodents, or primates. Because defects in immune genes could be phenotypically masked in vivo by compensation and environment, we compared inferences by the same tools with the in vitro phenotype of all 2,314 possible missense variants in TP53; 42% of mutations predicted by PolyPhen2 to be deleterious (and 45% by CADD) had little measurable consequence for TP53-promoted transcription. We conclude that for de novo or low-frequency missense mutations found by genome sequencing, half those inferred as deleterious correspond to nearly neutral mutations that have little impact on the clinical phenotype of individual cases but will nevertheless become subject to purifying selection.
每个人的基因组序列都有成千上万的错义变异。在缺乏详尽实验测量的情况下,对其功能意义的实际解读必须依赖于计算推断。在此,我们分析了这些推断在经N-乙基-N-亚硝基脲处理的小鼠第一代子代测序所揭示的33个新生错义突变中的有效性,这些突变涉及23个关键免疫系统基因。使用PolyPhen2、SIFT、MutationAssessor、Panther、CADD和Condel来预测每个突变的功能重要性,而实际效果则通过培育和检测纯合子以观察预期的体内功能丧失表型来衡量。只有20%被PolyPhen2预测为有害的突变(CADD预测的为15%)在个体纯合子中表现出可识别的表型。在相同的23个免疫基因中,所有可能的错义突变中有一半被预测为有害,并且其中大多数似乎受到纯化选择,因为在不同的小鼠亚系、啮齿动物或灵长类动物之间很少有此类突变持续存在。由于免疫基因的缺陷在体内可能因补偿作用和环境因素而在表型上被掩盖,我们将相同工具的推断结果与TP53中所有2314个可能的错义变异的体外表型进行了比较;被PolyPhen2预测为有害的突变中有42%(CADD预测的为45%)对TP53促进的转录几乎没有可测量的影响。我们得出结论,对于通过基因组测序发现的新生或低频错义突变,那些被推断为有害的突变中有一半对应于几乎中性的突变,这些突变对个体病例的临床表型影响很小,但仍会受到纯化选择。