Suppr超能文献

保守性有助于识别生物学上相关的晶体接触。

Conservation helps to identify biologically relevant crystal contacts.

作者信息

Valdar W S, Thornton J M

机构信息

Biomolecular Structure and Modelling Unit, Biochemistry and Molecular Biology Department, University College London, UK.

出版信息

J Mol Biol. 2001 Oct 19;313(2):399-416. doi: 10.1006/jmbi.2001.5034.

Abstract

Some crystal contacts are biologically relevant, most are not. We assess the utility of combining measures of size and conservation to discriminate between biological and non-biological contacts. Conservation and size information is calculated for crystal contacts in 53 families of homodimers and 65 families of monomers. Biological contacts are shown to be usually conserved and typically the largest contact in the crystal. A range of neural networks accepting different combinations and encodings of this information is used to answer the following questions: (1) is a given crystal contact biological, and (2) given all crystal contacts in a homodimer, which is the biological one? Predictions for (1) are performed on both homodimer and monomer datasets. The best performing neural network combined size and conservation inputs. For the homodimers, it correctly classified 48 out of 53 biological contacts and 364 out of 366 non-biological contacts, giving a combined accuracy of 98.3 %. A more robust performance statistic, the phi-coefficient, which accounts for imbalances in the dataset, gave a value of 0.92. Taking all 535 non-biological contacts from the 65 monomers, this predictor made erroneous classifications only 4.3 % of the time. Predictions for (2) were performed on homodimers only. The best performing network achieved a prediction accuracy of 98.1 % using size information alone. We conclude that in answering question (1) size and conservation combined discriminate biological from non-biological contacts better than either measure alone. For answering question (2), we conclude that in our dataset size is so powerful a discriminant that conservation adds little predictive benefit.

摘要

一些晶体接触具有生物学相关性,而大多数则没有。我们评估结合大小和保守性测量来区分生物学接触和非生物学接触的效用。计算了53个同二聚体家族和65个单体家族中晶体接触的保守性和大小信息。结果表明,生物学接触通常是保守的,并且通常是晶体中最大的接触。使用一系列接受此信息的不同组合和编码的神经网络来回答以下问题:(1)给定的晶体接触是否具有生物学相关性,以及(2)给定同二聚体中的所有晶体接触,哪一个是具有生物学相关性的?对(1)的预测在同二聚体和单体数据集上均进行。表现最佳的神经网络结合了大小和保守性输入。对于同二聚体,它正确分类了53个生物学接触中的48个以及366个非生物学接触中的364个,综合准确率为98.3%。一个更稳健的性能统计量,即考虑数据集中不平衡情况的phi系数,值为0.92。对于来自65个单体的所有535个非生物学接触,该预测器的错误分类率仅为4.3%。对(2)的预测仅在同二聚体上进行。表现最佳的网络仅使用大小信息就达到了98.1%的预测准确率。我们得出结论,在回答问题(1)时,结合大小和保守性比单独使用任何一种测量方法都能更好地区分生物学接触和非生物学接触。对于回答问题(2),我们得出结论,在我们的数据集中,大小是一个非常强大的判别因素,保守性几乎没有增加预测优势。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验