Suppr超能文献

MUSI:一种用于从非常大的肽或核酸数据集识别多种特异性的集成系统。

MUSI: an integrated system for identifying multiple specificity from very large peptide or nucleic acid data sets.

机构信息

The Donnelly Centre, Banting and Best Department of Medical Research, University of Toronto, Toronto, ON, Canada M5S 3E1.

出版信息

Nucleic Acids Res. 2012 Mar;40(6):e47. doi: 10.1093/nar/gkr1294. Epub 2011 Dec 30.

Abstract

Peptide recognition domains and transcription factors play crucial roles in cellular signaling. They bind linear stretches of amino acids or nucleotides, respectively, with high specificity. Experimental techniques that assess the binding specificity of these domains, such as microarrays or phage display, can retrieve thousands of distinct ligands, providing detailed insight into binding specificity. In particular, the advent of next-generation sequencing has recently increased the throughput of such methods by several orders of magnitude. These advances have helped reveal the presence of distinct binding specificity classes that co-exist within a set of ligands interacting with the same target. Here, we introduce a software system called MUSI that can rapidly analyze very large data sets of binding sequences to determine the relevant binding specificity patterns. Our pipeline provides two major advances. First, it can detect previously unrecognized multiple specificity patterns in any data set. Second, it offers integrated processing of very large data sets from next-generation sequencing machines. The results are visualized as multiple sequence logos describing the different binding preferences of the protein under investigation. We demonstrate the performance of MUSI by analyzing recent phage display data for human SH3 domains as well as microarray data for mouse transcription factors.

摘要

肽识别结构域和转录因子在细胞信号转导中起着至关重要的作用。它们分别与线性氨基酸或核苷酸序列具有高度特异性结合。评估这些结构域结合特异性的实验技术,如微阵列或噬菌体展示,可以获得数千种不同的配体,从而深入了解结合特异性。特别是,新一代测序技术的出现最近使这些方法的通量提高了几个数量级。这些进展有助于揭示在与同一靶标相互作用的一组配体中存在的不同结合特异性类别。在这里,我们引入了一个名为 MUSI 的软件系统,它可以快速分析大量的结合序列数据,以确定相关的结合特异性模式。我们的流水线提供了两个主要的优势。首先,它可以在任何数据集检测到以前未被识别的多个特异性模式。其次,它提供了来自下一代测序仪的非常大数据集的集成处理。结果以描述所研究蛋白质的不同结合偏好的多个序列 logo 呈现。我们通过分析人类 SH3 结构域的噬菌体展示数据以及小鼠转录因子的微阵列数据来演示 MUSI 的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26de/3315295/93095b7ee7be/gkr1294f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验