Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, California, United States of America.
PLoS Comput Biol. 2011 Mar;7(3):e1001111. doi: 10.1371/journal.pcbi.1001111. Epub 2011 Mar 24.
The multidimensional computations performed by many biological systems are often characterized with limited information about the correlations between inputs and outputs. Given this limitation, our approach is to construct the maximum noise entropy response function of the system, leading to a closed-form and minimally biased model consistent with a given set of constraints on the input/output moments; the result is equivalent to conditional random field models from machine learning. For systems with binary outputs, such as neurons encoding sensory stimuli, the maximum noise entropy models are logistic functions whose arguments depend on the constraints. A constraint on the average output turns the binary maximum noise entropy models into minimum mutual information models, allowing for the calculation of the information content of the constraints and an information theoretic characterization of the system's computations. We use this approach to analyze the nonlinear input/output functions in macaque retina and thalamus; although these systems have been previously shown to be responsive to two input dimensions, the functional form of the response function in this reduced space had not been unambiguously identified. A second order model based on the logistic function is found to be both necessary and sufficient to accurately describe the neural responses to naturalistic stimuli, accounting for an average of 93% of the mutual information with a small number of parameters. Thus, despite the fact that the stimulus is highly non-Gaussian, the vast majority of the information in the neural responses is related to first and second order correlations. Our results suggest a principled and unbiased way to model multidimensional computations and determine the statistics of the inputs that are being encoded in the outputs.
许多生物系统执行的多维计算通常具有输入和输出之间相关性的有限信息。考虑到这一限制,我们的方法是构建系统的最大噪声熵响应函数,从而得到一个与输入/输出矩的给定约束一致的闭式和最小偏差模型;结果相当于机器学习中的条件随机场模型。对于具有二进制输出的系统,例如编码感觉刺激的神经元,最大噪声熵模型是逻辑函数,其参数取决于约束条件。输出平均值的约束将二进制最大噪声熵模型转化为最小互信息模型,允许计算约束的信息量和系统计算的信息论特征。我们使用这种方法来分析猕猴视网膜和丘脑的非线性输入/输出函数;尽管这些系统以前被证明对两个输入维度有反应,但在这个简化空间中响应函数的函数形式尚未明确确定。基于逻辑函数的二阶模型被发现不仅是必要的,而且足以准确描述对自然刺激的神经反应,平均可以用少数参数解释 93%的互信息。因此,尽管刺激高度非高斯,但神经反应中的绝大多数信息都与一阶和二阶相关。我们的结果表明了一种有原则的、无偏的多维计算建模方法,并确定了在输出中编码的输入的统计信息。