Agarwal Deepesh, Caillouet Christelle, Coudert David, Cazals Frederic
From the ‖Inria Sophia-Antipolis (Algorithms-Biology-Structure), 06902 Sophia Antipolis, France;
‡Univ. Nice Sophia Antipolis, CNRS, I3S, UMR 7271, 06900 Sophia Antipolis, France; §Inria Sophia Antipolis (COATI), 06902 Sophia Antipolis, France.
Mol Cell Proteomics. 2015 Aug;14(8):2274-84. doi: 10.1074/mcp.M114.047779. Epub 2015 Apr 7.
Consider a set of oligomers listing the subunits involved in subcomplexes of a macromolecular assembly, obtained e.g. using native mass spectrometry or affinity purification. Given these oligomers, connectivity inference (CI) consists of finding the most plausible contacts between these subunits, and minimum connectivity inference (MCI) is the variant consisting of finding a set of contacts of smallest cardinality. MCI problems avoid speculating on the total number of contacts but yield a subset of all contacts and do not allow exploiting a priori information on the likelihood of individual contacts. In this context, we present two novel algorithms, MILP-W and MILP-WB. The former solves the minimum weight connectivity inference (MWCI), an optimization problem whose criterion mixes the number of contacts and their likelihood. The latter uses the former in a bootstrap fashion to improve the sensitivity and the specificity of solution sets.Experiments on three systems (yeast exosome, yeast proteasome lid, human eIF3), for which reference contacts are known (crystal structure, cryo electron microscopy, cross-linking), show that our algorithms predict contacts with high specificity and sensitivity, yielding a very significant improvement over previous work, typically a twofold increase in sensitivity.The software accompanying this paper is made available and should prove of ubiquitous interest whenever connectivity inference from oligomers is faced.
考虑一组列出参与大分子组装亚复合物的亚基的寡聚物,例如通过使用原生质谱法或亲和纯化获得。给定这些寡聚物,连通性推断(CI)包括找到这些亚基之间最合理的接触,而最小连通性推断(MCI)是由找到一组基数最小的接触组成的变体。MCI问题避免推测接触的总数,但会产生所有接触的一个子集,并且不允许利用关于单个接触可能性的先验信息。在此背景下,我们提出了两种新颖的算法,即MILP-W和MILP-WB。前者解决最小权重连通性推断(MWCI),这是一个优化问题,其标准混合了接触的数量及其可能性。后者以自举方式使用前者来提高解集的敏感性和特异性。在三个已知参考接触(晶体结构、冷冻电子显微镜、交联)的系统(酵母外泌体、酵母蛋白酶体盖子、人eIF3)上进行的实验表明,我们的算法能够以高特异性和敏感性预测接触,比以前的工作有非常显著的改进,通常敏感性提高两倍。本文附带的软件可供使用,并且每当面临从寡聚物进行连通性推断时,应该会引起广泛关注。