Suppr超能文献

使用FASTA查找蛋白质和核苷酸的相似性。

Finding Protein and Nucleotide Similarities with FASTA.

作者信息

Pearson William R

机构信息

University of Virginia School of Medicine, Charlottesville, Virginia.

出版信息

Curr Protoc Bioinformatics. 2016 Mar 24;53:3.9.1-3.9.25. doi: 10.1002/0471250953.bi0309s53.

Abstract

The FASTA programs provide a comprehensive set of rapid similarity searching tools (fasta36, fastx36, tfastx36, fasty36, tfasty36), similar to those provided by the BLAST package, as well as programs for slower, optimal, local, and global similarity searches (ssearch36, ggsearch36), and for searching with short peptides and oligonucleotides (fasts36, fastm36). The FASTA programs use an empirical strategy for estimating statistical significance that accommodates a range of similarity scoring matrices and gap penalties, improving alignment boundary accuracy and search sensitivity. The FASTA programs can produce "BLAST-like" alignment and tabular output, for ease of integration into existing analysis pipelines, and can search small, representative databases, and then report results for a larger set of sequences, using links from the smaller dataset. The FASTA programs work with a wide variety of database formats, including mySQL and postgreSQL databases. The programs also provide a strategy for integrating domain and active site annotations into alignments and highlighting the mutational state of functionally critical residues. These protocols describe how to use the FASTA programs to characterize protein and DNA sequences, using protein:protein, protein:DNA, and DNA:DNA comparisons.

摘要

FASTA程序提供了一套全面的快速相似性搜索工具(fasta36、fastx36、tfastx36、fasty36、tfasty36),与BLAST软件包提供的工具类似,还有用于较慢的、最优的、局部和全局相似性搜索的程序(ssearch36、ggsearch36),以及用于短肽和寡核苷酸搜索的程序(fasts36、fastm36)。FASTA程序使用一种经验策略来估计统计显著性,该策略适用于一系列相似性评分矩阵和空位罚分,提高了比对边界准确性和搜索灵敏度。FASTA程序可以生成“类似BLAST”的比对和表格输出,以便于集成到现有的分析流程中,并且可以搜索小型代表性数据库,然后使用来自较小数据集的链接报告更大一组序列的结果。FASTA程序可与多种数据库格式配合使用,包括mySQL和postgreSQL数据库。这些程序还提供了一种将结构域和活性位点注释整合到比对中并突出功能关键残基突变状态的策略。这些协议描述了如何使用FASTA程序通过蛋白质与蛋白质、蛋白质与DNA以及DNA与DNA比较来表征蛋白质和DNA序列。

相似文献

1
Finding Protein and Nucleotide Similarities with FASTA.使用FASTA查找蛋白质和核苷酸的相似性。
Curr Protoc Bioinformatics. 2016 Mar 24;53:3.9.1-3.9.25. doi: 10.1002/0471250953.bi0309s53.
2
Finding protein and nucleotide similarities with FASTA.使用FASTA查找蛋白质和核苷酸的相似性。
Curr Protoc Bioinformatics. 2004 Feb;Chapter 3:Unit3.9. doi: 10.1002/0471250953.bi0309s04.
4
Computing multiple sequence/structure alignments with the T-coffee package.使用T-coffee软件包计算多序列/结构比对
Curr Protoc Bioinformatics. 2004 Feb;Chapter 3:Unit3.8. doi: 10.1002/0471250953.bi0308s04.
6
Selecting the Right Similarity-Scoring Matrix.选择合适的相似性评分矩阵。
Curr Protoc Bioinformatics. 2013;43:3.5.1-3.5.9. doi: 10.1002/0471250953.bi0305s43.
8
Database similarity searches.数据库相似性搜索。
Methods Mol Biol. 2008;484:361-78. doi: 10.1007/978-1-59745-398-1_24.
10
Adjusting scoring matrices to correct overextended alignments.调整评分矩阵以纠正过度延伸的比对。
Bioinformatics. 2013 Dec 1;29(23):3007-13. doi: 10.1093/bioinformatics/btt517. Epub 2013 Aug 31.

引用本文的文献

1
Data-driven de novo design of super-adhesive hydrogels.基于数据驱动的超粘性水凝胶的从头设计。
Nature. 2025 Aug;644(8075):89-95. doi: 10.1038/s41586-025-09269-4. Epub 2025 Aug 6.
5
ProTaxoVis-protein taxonomic visualisation of presence.ProTaxoVis——蛋白质分类存在情况的可视化
BMC Bioinformatics. 2025 May 19;26(1):128. doi: 10.1186/s12859-025-06146-9.
8
The chordata olfactory receptor database.脊索动物嗅觉受体数据库。
Protein Cell. 2025 Apr 18;16(4):286-295. doi: 10.1093/procel/pwae050.

本文引用的文献

2
UniProt: a hub for protein information.通用蛋白质数据库(UniProt):蛋白质信息中心。
Nucleic Acids Res. 2015 Jan;43(Database issue):D204-12. doi: 10.1093/nar/gku989. Epub 2014 Oct 27.
3
Pfam: the protein families database.Pfam:蛋白质家族数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30. doi: 10.1093/nar/gkt1223. Epub 2013 Nov 27.
4
Adjusting scoring matrices to correct overextended alignments.调整评分矩阵以纠正过度延伸的比对。
Bioinformatics. 2013 Dec 1;29(23):3007-13. doi: 10.1093/bioinformatics/btt517. Epub 2013 Aug 31.
5
Accelerated Profile HMM Searches.加速轮廓隐马尔可夫模型搜索。
PLoS Comput Biol. 2011 Oct;7(10):e1002195. doi: 10.1371/journal.pcbi.1002195. Epub 2011 Oct 20.
6
HMMER web server: interactive sequence similarity searching.HMMER 网页服务器:交互式序列相似性搜索。
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W29-37. doi: 10.1093/nar/gkr367. Epub 2011 May 18.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验