Suppr超能文献

Logo2PWM:将序列 logo 转换为位置权重矩阵的工具。

Logo2PWM: a tool to convert sequence logo to position weight matrix.

机构信息

Department of Computer Science, The University of Texas at San Antonio, One UTSA Circle, San Antonio, 78249, TX, USA.

出版信息

BMC Genomics. 2017 Oct 3;18(Suppl 6):709. doi: 10.1186/s12864-017-4023-9.

Abstract

BACKGROUND

position weight matrix (PWM) and sequence logo are the most widely used representations of transcription factor binding site (TFBS) in biological sequences. Sequence logo - a graphical representation of PWM, has been widely used in scientific publications and reports, due to its easiness of human perception, rich information, and simple format. Different from sequence logo, PWM works great as a precise and compact digitalized form, which can be easily used by a variety of motif analysis software. There are a few available tools to generate sequence logos from PWM; however, no tool does the reverse. Such tool to convert sequence logo back to PWM is needed to scan a TFBS represented in logo format in a publication where the PWM is not provided or hard to be acquired. A major difficulty in developing such tool to convert sequence logo to PWM is to deal with the diversity of sequence logo images.

RESULTS

We propose logo2PWM for reconstructing PWM from a large variety of sequence logo images. Evaluation results on over one thousand logos from three sources of different logo format show that the correlation between the reconstructed PWMs and the original PWMs are constantly high, where median correlation is greater than 0.97.

CONCLUSION

Because of the high recognition accuracy, the easiness of usage, and, the availability of both web-based service and stand-alone application, we believe that logo2PWM can readily benefit the study of transcription by filling the gap between sequence logo and PWM.

摘要

背景

位置权重矩阵(PWM)和序列图是生物序列中最常用的转录因子结合位点(TFBS)表示形式。序列图是 PWM 的图形表示,由于其易于人类感知、信息丰富和格式简单,已在科学出版物和报告中得到广泛应用。与序列图不同,PWM 作为一种精确而紧凑的数字化形式,可由各种基序分析软件轻松使用,效果非常好。有一些可用的工具可以从 PWM 生成序列图,但没有工具可以反向操作。在出版物中,当没有提供或难以获取 PWM 时,需要这样的工具将序列图转换回 PWM,以便扫描以图形式表示的 TFBS。开发将序列图转换为 PWM 的此类工具的主要困难在于处理序列图图像的多样性。

结果

我们提出了 logo2PWM,用于从各种不同格式的序列图图像中重建 PWM。来自三个来源的一千多个徽标进行评估的结果表明,重建的 PWM 与原始 PWM 之间的相关性始终很高,中位数相关性大于 0.97。

结论

由于识别精度高、使用方便以及提供了基于网络的服务和独立应用程序,我们相信 logo2PWM 可以通过填补序列图和 PWM 之间的空白,为转录研究带来好处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e08/5629559/106d41f53d94/12864_2017_4023_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验