Suppr超能文献

利用 SIMSApiper 进行大规模结构信息指导的蛋白质多重序列比对。

Large-scale structure-informed multiple sequence alignment of proteins with SIMSApiper.

机构信息

Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, 1050, Belgium.

Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, 1050, Belgium.

出版信息

Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae276.

Abstract

SUMMARY

SIMSApiper is a Nextflow pipeline that creates reliable, structure-informed MSAs of thousands of protein sequences faster than standard structure-based alignment methods. Structural information can be provided by the user or collected by the pipeline from online resources. Parallelization with sequence identity-based subsets can be activated to significantly speed up the alignment process. Finally, the number of gaps in the final alignment can be reduced by leveraging the position of conserved secondary structure elements.

AVAILABILITY AND IMPLEMENTATION

The pipeline is implemented using Nextflow, Python3, and Bash. It is publicly available on github.com/Bio2Byte/simsapiper.

摘要

摘要

SIMSApiper 是一个 Nextflow 管道,它比标准的基于结构的对齐方法更快地为数千个蛋白质序列创建可靠的、结构信息丰富的 MSAs。结构信息可以由用户提供,也可以由管道从在线资源中收集。可以通过基于序列同一性的子集进行并行化,从而显著加快对齐过程。最后,可以利用保守的二级结构元素的位置来减少最终对齐中的空位数量。

可用性和实现

该管道使用 Nextflow、Python3 和 Bash 实现。它在 github.com/Bio2Byte/simsapiper 上公开可用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d6f/11099654/fced1dfe1794/btae276f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验