Rebhan Michael
Head Bioinformatics Support, Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.
Methods Mol Biol. 2010;609:45-57. doi: 10.1007/978-1-60327-241-4_3.
Protein sequence databases do not contain just the sequence of the protein itself but also annotation that reflects our knowledge of its function and contributing residues. In this chapter, we will discuss various public protein sequence databases, with a focus on those that are generally applicable. Special attention is paid to issues related to the reliability of both sequence and annotation, as those are fundamental to many questions researchers will ask. Using both well-annotated and scarcely annotated human proteins as examples, it will be shown what information about the targets can be collected from freely available Internet resources and how this information can be used. The results are shown to be summarized in a simple graphical model of the protein's sequence architecture highlighting its structural and functional modules.
蛋白质序列数据库不仅包含蛋白质本身的序列,还包含反映我们对其功能和起作用残基了解的注释。在本章中,我们将讨论各种公共蛋白质序列数据库,重点关注那些普遍适用的数据库。特别关注与序列和注释可靠性相关的问题,因为这些对于研究人员提出的许多问题至关重要。以注释良好和注释稀少的人类蛋白质为例,将展示可以从免费的互联网资源中收集到关于目标的哪些信息,以及如何使用这些信息。结果显示总结在一个简单的蛋白质序列结构图形模型中,突出其结构和功能模块。